emozilla/booksum-summary-analysis_gptneox-8192
收藏Hugging Face2023-05-30 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/emozilla/booksum-summary-analysis_gptneox-8192
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: input
dtype: string
- name: output
dtype: string
- name: type
dtype: string
splits:
- name: train
num_bytes: 194097976.97925937
num_examples: 10659
- name: test
num_bytes: 25683201.043425813
num_examples: 1570
- name: validation
num_bytes: 35799607.99283796
num_examples: 1824
download_size: 92249754
dataset_size: 255580786.01552314
---
# Dataset Card for "booksum-summary-analysis-8192"
Subset of [emozilla/booksum-summary-analysis](https://huggingface.co/datasets/emozilla/booksum-summary-analysis) with only entries that are less than 8,192 tokens under the [EleutherAI/gpt-neox-20b](https://huggingface.co/EleutherAI/gpt-neox-20b) tokenizer.
提供机构:
emozilla
原始信息汇总
数据集概述
数据集名称
- 名称: booksum-summary-analysis-8192
数据集特征
- 特征1: input
- 数据类型: string
- 特征2: output
- 数据类型: string
- 特征3: type
- 数据类型: string
数据集分割
- 训练集:
- 样本数量: 10659
- 存储大小: 194097976.97925937 bytes
- 测试集:
- 样本数量: 1570
- 存储大小: 25683201.043425813 bytes
- 验证集:
- 样本数量: 1824
- 存储大小: 35799607.99283796 bytes
数据集大小
- 下载大小: 92249754 bytes
- 总存储大小: 255580786.01552314 bytes



