Pokce/NewYorkTimes00-07
收藏Hugging Face2024-03-18 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/Pokce/NewYorkTimes00-07
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: NYT_length128
features:
- name: id
dtype: string
- name: year
dtype: string
- name: month
dtype: string
- name: day
dtype: string
- name: title
dtype: string
- name: lead_paragraph
dtype: string
- name: input
dtype: string
splits:
- name: NYT_length128
num_bytes: 805898669
num_examples: 497220
download_size: 543799180
dataset_size: 805898669
- config_name: NYT_length256
features:
- name: id
dtype: string
- name: year
dtype: string
- name: month
dtype: string
- name: day
dtype: string
- name: title
dtype: string
- name: lead_paragraph
dtype: string
- name: input
dtype: string
splits:
- name: NYT_length256
num_bytes: 986506429
num_examples: 407952
download_size: 656741234
dataset_size: 986506429
- config_name: NYT_length32
features:
- name: id
dtype: string
- name: year
dtype: string
- name: month
dtype: string
- name: day
dtype: string
- name: title
dtype: string
- name: lead_paragraph
dtype: string
- name: input
dtype: string
splits:
- name: NYT_length32
num_bytes: 632875181
num_examples: 659496
download_size: 427035594
dataset_size: 632875181
- config_name: NYT_length512
features:
- name: id
dtype: string
- name: year
dtype: string
- name: month
dtype: string
- name: day
dtype: string
- name: title
dtype: string
- name: lead_paragraph
dtype: string
- name: input
dtype: string
splits:
- name: NYT_length512
num_bytes: 1255048663
num_examples: 317512
download_size: 818477539
dataset_size: 1255048663
- config_name: NYT_length64
features:
- name: id
dtype: string
- name: year
dtype: string
- name: month
dtype: string
- name: day
dtype: string
- name: title
dtype: string
- name: lead_paragraph
dtype: string
- name: input
dtype: string
splits:
- name: NYT_length64
num_bytes: 715550864
num_examples: 600752
download_size: 484584815
dataset_size: 715550864
configs:
- config_name: NYT_length128
data_files:
- split: NYT_length128
path: NYT_length128/NYT_length128-*
- config_name: NYT_length256
data_files:
- split: NYT_length256
path: NYT_length256/NYT_length256-*
- config_name: NYT_length32
data_files:
- split: NYT_length32
path: NYT_length32/NYT_length32-*
- config_name: NYT_length512
data_files:
- split: NYT_length512
path: NYT_length512/NYT_length512-*
- config_name: NYT_length64
data_files:
- split: NYT_length64
path: NYT_length64/NYT_length64-*
---
提供机构:
Pokce
原始信息汇总
数据集概述
数据集配置
-
NYT_length128
- 特征:
- id: 字符串
- year: 字符串
- month: 字符串
- day: 字符串
- title: 字符串
- lead_paragraph: 字符串
- input: 字符串
- 分割:
- NYT_length128:
- 字节数: 805898669
- 示例数: 497220
- NYT_length128:
- 下载大小: 543799180
- 数据集大小: 805898669
- 特征:
-
NYT_length256
- 特征:
- id: 字符串
- year: 字符串
- month: 字符串
- day: 字符串
- title: 字符串
- lead_paragraph: 字符串
- input: 字符串
- 分割:
- NYT_length256:
- 字节数: 986506429
- 示例数: 407952
- NYT_length256:
- 下载大小: 656741234
- 数据集大小: 986506429
- 特征:
-
NYT_length32
- 特征:
- id: 字符串
- year: 字符串
- month: 字符串
- day: 字符串
- title: 字符串
- lead_paragraph: 字符串
- input: 字符串
- 分割:
- NYT_length32:
- 字节数: 632875181
- 示例数: 659496
- NYT_length32:
- 下载大小: 427035594
- 数据集大小: 632875181
- 特征:
-
NYT_length512
- 特征:
- id: 字符串
- year: 字符串
- month: 字符串
- day: 字符串
- title: 字符串
- lead_paragraph: 字符串
- input: 字符串
- 分割:
- NYT_length512:
- 字节数: 1255048663
- 示例数: 317512
- NYT_length512:
- 下载大小: 818477539
- 数据集大小: 1255048663
- 特征:
-
NYT_length64
- 特征:
- id: 字符串
- year: 字符串
- month: 字符串
- day: 字符串
- title: 字符串
- lead_paragraph: 字符串
- input: 字符串
- 分割:
- NYT_length64:
- 字节数: 715550864
- 示例数: 600752
- NYT_length64:
- 下载大小: 484584815
- 数据集大小: 715550864
- 特征:
数据文件
-
NYT_length128:
- 分割: NYT_length128
- 路径: NYT_length128/NYT_length128-*
-
NYT_length256:
- 分割: NYT_length256
- 路径: NYT_length256/NYT_length256-*
-
NYT_length32:
- 分割: NYT_length32
- 路径: NYT_length32/NYT_length32-*
-
NYT_length512:
- 分割: NYT_length512
- 路径: NYT_length512/NYT_length512-*
-
NYT_length64:
- 分割: NYT_length64
- 路径: NYT_length64/NYT_length64-*



