mbzuai-ugrip-statement-tuning/paws-x
收藏Hugging Face2024-06-06 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/mbzuai-ugrip-statement-tuning/paws-x
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: de
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 13738169
num_examples: 49401
- name: test
num_bytes: 561887
num_examples: 2000
- name: validation
num_bytes: 552084
num_examples: 2000
download_size: 7118579
dataset_size: 14852140
- config_name: en
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 13152301
num_examples: 49401
- name: test
num_bytes: 532784
num_examples: 2000
- name: validation
num_bytes: 529969
num_examples: 2000
download_size: 6713379
dataset_size: 14215054
- config_name: es
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 13742951
num_examples: 49401
- name: test
num_bytes: 556917
num_examples: 2000
- name: validation
num_bytes: 552015
num_examples: 2000
download_size: 7157169
dataset_size: 14851883
- config_name: fr
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 14230932
num_examples: 49401
- name: test
num_bytes: 572935
num_examples: 2000
- name: validation
num_bytes: 570804
num_examples: 2000
download_size: 7392705
dataset_size: 15374671
- config_name: ja
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 15978230
num_examples: 49401
- name: test
num_bytes: 706386
num_examples: 2000
- name: validation
num_bytes: 699700
num_examples: 2000
download_size: 8155950
dataset_size: 17384316
- config_name: ko
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 14870075
num_examples: 49401
- name: test
num_bytes: 600293
num_examples: 2000
- name: validation
num_bytes: 592748
num_examples: 2000
download_size: 8096378
dataset_size: 16063116
- config_name: zh
features:
- name: id
dtype: int32
- name: statement
dtype: string
- name: is_true
dtype: int64
splits:
- name: train
num_bytes: 11750720
num_examples: 49401
- name: test
num_bytes: 512516
num_examples: 2000
- name: validation
num_bytes: 511056
num_examples: 2000
download_size: 6842723
dataset_size: 12774292
configs:
- config_name: de
data_files:
- split: train
path: de/train-*
- split: test
path: de/test-*
- split: validation
path: de/validation-*
- config_name: en
data_files:
- split: train
path: en/train-*
- split: test
path: en/test-*
- split: validation
path: en/validation-*
- config_name: es
data_files:
- split: train
path: es/train-*
- split: test
path: es/test-*
- split: validation
path: es/validation-*
- config_name: fr
data_files:
- split: train
path: fr/train-*
- split: test
path: fr/test-*
- split: validation
path: fr/validation-*
- config_name: ja
data_files:
- split: train
path: ja/train-*
- split: test
path: ja/test-*
- split: validation
path: ja/validation-*
- config_name: ko
data_files:
- split: train
path: ko/train-*
- split: test
path: ko/test-*
- split: validation
path: ko/validation-*
- config_name: zh
data_files:
- split: train
path: zh/train-*
- split: test
path: zh/test-*
- split: validation
path: zh/validation-*
---
提供机构:
mbzuai-ugrip-statement-tuning
原始信息汇总
数据集概述
数据集配置
- de
- en
- es
- fr
- ja
- ko
- zh
特征信息
- id
- 数据类型:int32
- statement
- 数据类型:string
- is_true
- 数据类型:int64
数据集拆分
- train
- 示例数量:49401
- 字节数:
- de: 13738169
- en: 13152301
- es: 13742951
- fr: 14230932
- ja: 15978230
- ko: 14870075
- zh: 11750720
- test
- 示例数量:2000
- 字节数:
- de: 561887
- en: 532784
- es: 556917
- fr: 572935
- ja: 706386
- ko: 600293
- zh: 512516
- validation
- 示例数量:2000
- 字节数:
- de: 552084
- en: 529969
- es: 552015
- fr: 570804
- ja: 699700
- ko: 592748
- zh: 511056
下载与数据集大小
- 下载大小
- de: 7118579
- en: 6713379
- es: 7157169
- fr: 7392705
- ja: 8155950
- ko: 8096378
- zh: 6842723
- 数据集大小
- de: 14852140
- en: 14215054
- es: 14851883
- fr: 15374671
- ja: 17384316
- ko: 16063116
- zh: 12774292
数据文件路径
- train
- 路径格式:[语言]/train-*
- test
- 路径格式:[语言]/test-*
- validation
- 路径格式:[语言]/validation-*



