bhavyagoyal-lexsi/harper-valley-pre-processed-splits
收藏Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/bhavyagoyal-lexsi/harper-valley-pre-processed-splits
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
config_name: multi-turn
features:
- name: conversation_id
dtype: string
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: response
dtype: string
splits:
- name: train
num_bytes: 1811911
num_examples: 3934
- name: validation
num_bytes: 226970
num_examples: 496
- name: test
num_bytes: 230741
num_examples: 505
download_size: 251192
dataset_size: 2269622
configs:
- config_name: multi-turn
data_files:
- split: train
path: multi-turn/train-*
- split: validation
path: multi-turn/validation-*
- split: test
path: multi-turn/test-*
---
数据集信息:
配置名称:多轮对话(multi-turn)
特征字段:
- 字段标识:conversation_id,数据类型:字符串
- 字段标识:messages,为列表类型,列表元素包含两个子字段:
- 子字段名:content,数据类型:字符串
- 子字段名:role,数据类型:字符串
- 字段标识:response,数据类型:字符串
数据集拆分:
- 拆分名称:train(训练集),占用字节数:1811911,样本数量:3934
- 拆分名称:validation(验证集),占用字节数:226970,样本数量:496
- 拆分名称:test(测试集),占用字节数:230741,样本数量:505
下载总大小:251192,数据集总存储大小:2269622
配置项:
- 配置名称:多轮对话(multi-turn),对应数据文件路径:
- 训练集拆分:multi-turn/train-*
- 验证集拆分:multi-turn/validation-*
- 测试集拆分:multi-turn/test-*
提供机构:
bhavyagoyal-lexsi



