five

mbzuai-ugrip-statement-tuning/paws-x

收藏
Hugging Face2024-06-06 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/mbzuai-ugrip-statement-tuning/paws-x
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: de features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 13738169 num_examples: 49401 - name: test num_bytes: 561887 num_examples: 2000 - name: validation num_bytes: 552084 num_examples: 2000 download_size: 7118579 dataset_size: 14852140 - config_name: en features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 13152301 num_examples: 49401 - name: test num_bytes: 532784 num_examples: 2000 - name: validation num_bytes: 529969 num_examples: 2000 download_size: 6713379 dataset_size: 14215054 - config_name: es features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 13742951 num_examples: 49401 - name: test num_bytes: 556917 num_examples: 2000 - name: validation num_bytes: 552015 num_examples: 2000 download_size: 7157169 dataset_size: 14851883 - config_name: fr features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 14230932 num_examples: 49401 - name: test num_bytes: 572935 num_examples: 2000 - name: validation num_bytes: 570804 num_examples: 2000 download_size: 7392705 dataset_size: 15374671 - config_name: ja features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 15978230 num_examples: 49401 - name: test num_bytes: 706386 num_examples: 2000 - name: validation num_bytes: 699700 num_examples: 2000 download_size: 8155950 dataset_size: 17384316 - config_name: ko features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 14870075 num_examples: 49401 - name: test num_bytes: 600293 num_examples: 2000 - name: validation num_bytes: 592748 num_examples: 2000 download_size: 8096378 dataset_size: 16063116 - config_name: zh features: - name: id dtype: int32 - name: statement dtype: string - name: is_true dtype: int64 splits: - name: train num_bytes: 11750720 num_examples: 49401 - name: test num_bytes: 512516 num_examples: 2000 - name: validation num_bytes: 511056 num_examples: 2000 download_size: 6842723 dataset_size: 12774292 configs: - config_name: de data_files: - split: train path: de/train-* - split: test path: de/test-* - split: validation path: de/validation-* - config_name: en data_files: - split: train path: en/train-* - split: test path: en/test-* - split: validation path: en/validation-* - config_name: es data_files: - split: train path: es/train-* - split: test path: es/test-* - split: validation path: es/validation-* - config_name: fr data_files: - split: train path: fr/train-* - split: test path: fr/test-* - split: validation path: fr/validation-* - config_name: ja data_files: - split: train path: ja/train-* - split: test path: ja/test-* - split: validation path: ja/validation-* - config_name: ko data_files: - split: train path: ko/train-* - split: test path: ko/test-* - split: validation path: ko/validation-* - config_name: zh data_files: - split: train path: zh/train-* - split: test path: zh/test-* - split: validation path: zh/validation-* ---
提供机构:
mbzuai-ugrip-statement-tuning
原始信息汇总

数据集概述

数据集配置

  • de
  • en
  • es
  • fr
  • ja
  • ko
  • zh

特征信息

  • id
    • 数据类型:int32
  • statement
    • 数据类型:string
  • is_true
    • 数据类型:int64

数据集拆分

  • train
    • 示例数量:49401
    • 字节数:
      • de: 13738169
      • en: 13152301
      • es: 13742951
      • fr: 14230932
      • ja: 15978230
      • ko: 14870075
      • zh: 11750720
  • test
    • 示例数量:2000
    • 字节数:
      • de: 561887
      • en: 532784
      • es: 556917
      • fr: 572935
      • ja: 706386
      • ko: 600293
      • zh: 512516
  • validation
    • 示例数量:2000
    • 字节数:
      • de: 552084
      • en: 529969
      • es: 552015
      • fr: 570804
      • ja: 699700
      • ko: 592748
      • zh: 511056

下载与数据集大小

  • 下载大小
    • de: 7118579
    • en: 6713379
    • es: 7157169
    • fr: 7392705
    • ja: 8155950
    • ko: 8096378
    • zh: 6842723
  • 数据集大小
    • de: 14852140
    • en: 14215054
    • es: 14851883
    • fr: 15374671
    • ja: 17384316
    • ko: 16063116
    • zh: 12774292

数据文件路径

  • train
    • 路径格式:[语言]/train-*
  • test
    • 路径格式:[语言]/test-*
  • validation
    • 路径格式:[语言]/validation-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作