hacktoberfest-corpus-es/colmbian_spanish_news
收藏Hugging Face2023-10-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hacktoberfest-corpus-es/colmbian_spanish_news
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-2.0
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: valid
path: data/valid-*
dataset_info:
features:
- name: news_id
dtype: string
- name: news_url_absolute
dtype: string
- name: news_init_date
dtype: string
- name: news_final_date
dtype: string
- name: news_title
dtype: string
- name: news_text_content
dtype: string
- name: entailment
dtype: float64
- name: category
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 262518060.61903325
num_examples: 60920
- name: test
num_bytes: 13130212.257160116
num_examples: 3047
- name: valid
num_bytes: 52503612.12380665
num_examples: 12184
download_size: 195538787
dataset_size: 328151885.0
---
提供机构:
hacktoberfest-corpus-es
原始信息汇总
数据集概述
许可证
- 数据集许可证:cc-by-2.0
配置
- 配置名称:default
- 数据文件:
- 训练集(train):路径为
data/train-* - 测试集(test):路径为
data/test-* - 验证集(valid):路径为
data/valid-*
- 训练集(train):路径为
- 数据文件:
数据集信息
-
特征:
news_id:字符串类型news_url_absolute:字符串类型news_init_date:字符串类型news_final_date:字符串类型news_title:字符串类型news_text_content:字符串类型entailment:浮点数类型(float64)category:字符串类型__index_level_0__:整数类型(int64)
-
数据分割:
- 训练集(train):
- 字节数:262518060.61903325
- 样本数:60920
- 测试集(test):
- 字节数:13130212.257160116
- 样本数:3047
- 验证集(valid):
- 字节数:52503612.12380665
- 样本数:12184
- 训练集(train):
-
数据集大小:
- 下载大小:195538787 字节
- 数据集大小:328151885.0 字节



