AcapeLlama/AcapeLlama_v2.0_guidance
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/AcapeLlama/AcapeLlama_v2.0_guidance
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: line
features:
- name: title
dtype: string
- name: mungchi
sequence: int64
- name: output
dtype: string
- name: guidance
struct:
- name: former
sequence: string
- name: latter
sequence: string
- name: lyrics
dtype: string
- name: instruction
dtype: string
splits:
- name: train
num_bytes: 1055689194.6
num_examples: 430056
- name: test
num_bytes: 117298799.4
num_examples: 47784
download_size: 509788729
dataset_size: 1172987994.0
- config_name: total
features:
- name: title
dtype: string
- name: mungchi
sequence: int64
- name: output
dtype: string
- name: lyrics
dtype: string
- name: instruction
dtype: string
splits:
- name: train
num_bytes: 18327660.364328135
num_examples: 4849
- name: test
num_bytes: 2037246.6356718633
num_examples: 539
download_size: 6720934
dataset_size: 20364907.0
- config_name: verse
features:
- name: title
dtype: string
- name: mungchi
sequence: int64
- name: output
dtype: string
- name: guidance
struct:
- name: former
sequence: string
- name: latter
sequence: string
- name: lyrics
dtype: string
- name: instruction
dtype: string
splits:
- name: train
num_bytes: 97067036.99333486
num_examples: 31597
- name: test
num_bytes: 10785909.006665148
num_examples: 3511
download_size: 44924462
dataset_size: 107852946.0
configs:
- config_name: line
data_files:
- split: train
path: line/train-*
- split: test
path: line/test-*
- config_name: total
data_files:
- split: train
path: total/train-*
- split: test
path: total/test-*
- config_name: verse
data_files:
- split: train
path: verse/train-*
- split: test
path: verse/test-*
---
提供机构:
AcapeLlama
原始信息汇总
数据集概述
配置名称:line
-
特征:
- title: 字符串类型
- mungchi: 整数序列类型
- output: 字符串类型
- guidance: 结构类型,包含former和latter,均为字符串序列
- lyrics: 字符串类型
- instruction: 字符串类型
-
分割:
- train: 大小为1055689194.6字节,包含430056个样本
- test: 大小为117298799.4字节,包含47784个样本
-
下载大小: 509788729字节
-
数据集大小: 1172987994.0字节
配置名称:total
-
特征:
- title: 字符串类型
- mungchi: 整数序列类型
- output: 字符串类型
- lyrics: 字符串类型
- instruction: 字符串类型
-
分割:
- train: 大小为18327660.364328135字节,包含4849个样本
- test: 大小为2037246.6356718633字节,包含539个样本
-
下载大小: 6720934字节
-
数据集大小: 20364907.0字节
配置名称:verse
-
特征:
- title: 字符串类型
- mungchi: 整数序列类型
- output: 字符串类型
- guidance: 结构类型,包含former和latter,均为字符串序列
- lyrics: 字符串类型
- instruction: 字符串类型
-
分割:
- train: 大小为97067036.99333486字节,包含31597个样本
- test: 大小为10785909.006665148字节,包含3511个样本
-
下载大小: 44924462字节
-
数据集大小: 107852946.0字节



