paoloitaliani/news_articles
收藏Hugging Face2024-01-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/paoloitaliani/news_articles
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: corriere_autunno
features:
- name: author
dtype: string
- name: journal
dtype: string
- name: body
dtype: string
- name: date
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 339578
num_examples: 90
download_size: 237083
dataset_size: 339578
- config_name: corriere_primavera
features:
- name: author
dtype: string
- name: journal
dtype: string
- name: body
dtype: string
- name: date
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 319422
num_examples: 105
download_size: 206264
dataset_size: 319422
- config_name: fattoq_autunno
features:
- name: author
dtype: string
- name: journal
dtype: string
- name: body
dtype: string
- name: date
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 519012
num_examples: 133
download_size: 338948
dataset_size: 519012
- config_name: fattoq_primavera
features:
- name: author
dtype: string
- name: journal
dtype: string
- name: body
dtype: string
- name: date
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 508621
num_examples: 152
download_size: 331977
dataset_size: 508621
- config_name: ukraine
features:
- name: date
dtype: timestamp[ns]
- name: body
dtype: string
- name: author
dtype: string
- name: journal
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 81923456
num_examples: 27449
download_size: 0
dataset_size: 81923456
configs:
- config_name: corriere_autunno
data_files:
- split: train
path: corriere_autunno/train-*
- config_name: corriere_primavera
data_files:
- split: train
path: corriere_primavera/train-*
- config_name: fattoq_autunno
data_files:
- split: train
path: fattoq_autunno/train-*
- config_name: fattoq_primavera
data_files:
- split: train
path: fattoq_primavera/train-*
- config_name: ukraine
data_files:
- split: train
path: ukraine/train-*
---
# Dataset Card for "news_articles"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
paoloitaliani
原始信息汇总
数据集概述
数据集配置
corriere_autunno
- 特征:
- author: string
- journal: string
- body: string
- date: string
- index_level_0: int64
- 分割:
- train:
- 字节数: 339578
- 样本数: 90
- train:
- 下载大小: 237083
- 数据集大小: 339578
- 数据文件:
- train: corriere_autunno/train-*
corriere_primavera
- 特征:
- author: string
- journal: string
- body: string
- date: string
- index_level_0: int64
- 分割:
- train:
- 字节数: 319422
- 样本数: 105
- train:
- 下载大小: 206264
- 数据集大小: 319422
- 数据文件:
- train: corriere_primavera/train-*
fattoq_autunno
- 特征:
- author: string
- journal: string
- body: string
- date: string
- index_level_0: int64
- 分割:
- train:
- 字节数: 519012
- 样本数: 133
- train:
- 下载大小: 338948
- 数据集大小: 519012
- 数据文件:
- train: fattoq_autunno/train-*
fattoq_primavera
- 特征:
- author: string
- journal: string
- body: string
- date: string
- index_level_0: int64
- 分割:
- train:
- 字节数: 508621
- 样本数: 152
- train:
- 下载大小: 331977
- 数据集大小: 508621
- 数据文件:
- train: fattoq_primavera/train-*
ukraine
- 特征:
- date: timestamp[ns]
- body: string
- author: string
- journal: string
- index_level_0: int64
- 分割:
- train:
- 字节数: 81923456
- 样本数: 27449
- train:
- 下载大小: 0
- 数据集大小: 81923456
- 数据文件:
- train: ukraine/train-*



