WikiQuality/all_methods_hi
收藏Hugging Face2024-06-15 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/WikiQuality/all_methods_hi
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: ha
features:
- name: id
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 6223159.543542851
num_examples: 2968
- name: test
num_bytes: 329190.0432399688
num_examples: 157
download_size: 17421528
dataset_size: 6552349.58678282
- config_name: ig
features:
- name: id
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 6819725.051282051
num_examples: 2394
- name: test
num_bytes: 358932.89743589744
num_examples: 126
download_size: 13789639
dataset_size: 7178657.948717948
- config_name: pcm
features:
- name: id
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 255445.5
num_examples: 171
- name: test
num_bytes: 14938.333333333334
num_examples: 10
download_size: 484427
dataset_size: 270383.8333333333
- config_name: sw
features:
- name: id
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 4405812.048501815
num_examples: 4839
- name: test
num_bytes: 232172.3646141688
num_examples: 255
download_size: 16709066
dataset_size: 4637984.413115984
- config_name: yo
features:
- name: id
dtype: string
- name: url
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 291244.733814741
num_examples: 618
- name: test
num_bytes: 15551.903261952191
num_examples: 33
download_size: 2763763
dataset_size: 306796.63707669324
configs:
- config_name: ha
data_files:
- split: train
path: ha/train-*
- split: test
path: ha/test-*
- config_name: ig
data_files:
- split: train
path: ig/train-*
- split: test
path: ig/test-*
- config_name: pcm
data_files:
- split: train
path: pcm/train-*
- split: test
path: pcm/test-*
- config_name: sw
data_files:
- split: train
path: sw/train-*
- split: test
path: sw/test-*
- config_name: yo
data_files:
- split: train
path: yo/train-*
- split: test
path: yo/test-*
---
提供机构:
WikiQuality
原始信息汇总
数据集概述
数据集配置
配置名称:ha
- 特征:
- id: string
- url: string
- title: string
- text: string
- 分割:
- train:
- 字节数: 6223159.543542851
- 样本数: 2968
- test:
- 字节数: 329190.0432399688
- 样本数: 157
- train:
- 下载大小: 17421528
- 数据集大小: 6552349.58678282
配置名称:ig
- 特征:
- id: string
- url: string
- title: string
- text: string
- 分割:
- train:
- 字节数: 6819725.051282051
- 样本数: 2394
- test:
- 字节数: 358932.89743589744
- 样本数: 126
- train:
- 下载大小: 13789639
- 数据集大小: 7178657.948717948
配置名称:pcm
- 特征:
- id: string
- url: string
- title: string
- text: string
- 分割:
- train:
- 字节数: 255445.5
- 样本数: 171
- test:
- 字节数: 14938.333333333334
- 样本数: 10
- train:
- 下载大小: 484427
- 数据集大小: 270383.8333333333
配置名称:sw
- 特征:
- id: string
- url: string
- title: string
- text: string
- 分割:
- train:
- 字节数: 4405812.048501815
- 样本数: 4839
- test:
- 字节数: 232172.3646141688
- 样本数: 255
- train:
- 下载大小: 16709066
- 数据集大小: 4637984.413115984
配置名称:yo
- 特征:
- id: string
- url: string
- title: string
- text: string
- 分割:
- train:
- 字节数: 291244.733814741
- 样本数: 618
- test:
- 字节数: 15551.903261952191
- 样本数: 33
- train:
- 下载大小: 2763763
- 数据集大小: 306796.63707669324
数据文件路径
配置名称:ha
- train: ha/train-*
- test: ha/test-*
配置名称:ig
- train: ig/train-*
- test: ig/test-*
配置名称:pcm
- train: pcm/train-*
- test: pcm/test-*
配置名称:sw
- train: sw/train-*
- test: sw/test-*
配置名称:yo
- train: yo/train-*
- test: yo/test-*



