AcapeLlama/AcapeLlama_v2.0_guidance_induce_align
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/AcapeLlama/AcapeLlama_v2.0_guidance_induce_align
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: line
features:
- name: title
dtype: string
- name: mungchi
sequence: int64
- name: output
dtype: string
- name: guidance
struct:
- name: former
sequence: string
- name: latter
sequence: string
- name: lyrics
dtype: string
- name: instruction
dtype: string
splits:
- name: train
num_bytes: 1052913479.4
num_examples: 430056
- name: test
num_bytes: 116990386.6
num_examples: 47784
download_size: 510858742
dataset_size: 1169903866.0
- config_name: total
features:
- name: title
dtype: string
- name: mungchi
sequence: int64
- name: output
dtype: string
- name: lyrics
dtype: string
- name: instruction
dtype: string
splits:
- name: train
num_bytes: 19887797.31551596
num_examples: 4849
- name: test
num_bytes: 2210666.6844840385
num_examples: 539
download_size: 7229334
dataset_size: 22098464.0
- config_name: verse
features:
- name: title
dtype: string
- name: mungchi
sequence: int64
- name: output
dtype: string
- name: guidance
struct:
- name: former
sequence: string
- name: latter
sequence: string
- name: lyrics
dtype: string
- name: instruction
dtype: string
splits:
- name: train
num_bytes: 100532890.75546885
num_examples: 31597
- name: test
num_bytes: 11171028.24453116
num_examples: 3511
download_size: 46457684
dataset_size: 111703919.0
configs:
- config_name: line
data_files:
- split: train
path: line/train-*
- split: test
path: line/test-*
- config_name: total
data_files:
- split: train
path: total/train-*
- split: test
path: total/test-*
- config_name: verse
data_files:
- split: train
path: verse/train-*
- split: test
path: verse/test-*
---
提供机构:
AcapeLlama
原始信息汇总
数据集概述
配置名称:line
-
特征:
- title: 数据类型为字符串
- mungchi: 数据类型为整数序列
- output: 数据类型为字符串
- guidance: 结构化特征,包含
- former: 数据类型为字符串序列
- latter: 数据类型为字符串序列
- lyrics: 数据类型为字符串
- instruction: 数据类型为字符串
-
分割:
- train: 数据大小为1052913479.4字节,样本数为430056
- test: 数据大小为116990386.6字节,样本数为47784
-
下载大小: 510858742字节
-
数据集大小: 1169903866.0字节
配置名称:total
-
特征:
- title: 数据类型为字符串
- mungchi: 数据类型为整数序列
- output: 数据类型为字符串
- lyrics: 数据类型为字符串
- instruction: 数据类型为字符串
-
分割:
- train: 数据大小为19887797.31551596字节,样本数为4849
- test: 数据大小为2210666.6844840385字节,样本数为539
-
下载大小: 7229334字节
-
数据集大小: 22098464.0字节
配置名称:verse
-
特征:
- title: 数据类型为字符串
- mungchi: 数据类型为整数序列
- output: 数据类型为字符串
- guidance: 结构化特征,包含
- former: 数据类型为字符串序列
- latter: 数据类型为字符串序列
- lyrics: 数据类型为字符串
- instruction: 数据类型为字符串
-
分割:
- train: 数据大小为100532890.75546885字节,样本数为31597
- test: 数据大小为11171028.24453116字节,样本数为3511
-
下载大小: 46457684字节
-
数据集大小: 111703919.0字节



