anaonymous-aad/GenQA_writing
收藏Hugging Face2024-06-12 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/anaonymous-aad/GenQA_writing
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
dataset_info:
features:
- name: prompt
dtype: string
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
splits:
- name: train
num_bytes: 2421537454.0
num_examples: 932362
- name: test
num_bytes: 259720.73658085594
num_examples: 100
download_size: 1239598473
dataset_size: 2421797174.736581
---
# Dataset Card for "GenQA_writing"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset GenQA_writing includes two main configurations: default. The dataset is split into training and test sets, containing 932362 and 100 samples respectively. The main features of the dataset include prompt and messages, where messages is a list containing content and role as sub-features. The download size of the dataset is 1239598473 bytes, with a total size of 2421797174.736581 bytes.
提供机构:
anaonymous-aad
原始信息汇总
数据集概述
数据集名称
GenQA_writing
数据集配置
- 默认配置
数据文件
- 训练集(train):路径为
data/train-* - 测试集(test):路径为
data/test-*
数据特征
prompt:数据类型为stringmessages:包含以下子特征content:数据类型为stringrole:数据类型为string
数据集划分
- 训练集(train)
- 字节数:2421537454.0
- 样本数:932362
- 测试集(test)
- 字节数:259720.73658085594
- 样本数:100
数据集大小
- 下载大小:1239598473 字节
- 数据集总大小:2421797174.736581 字节



