five

ibragim-bad/arcc_multilang

收藏
Hugging Face2024-02-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ibragim-bad/arcc_multilang
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: ar features: - name: id dtype: string - name: question dtype: string - name: choices sequence: - name: text dtype: string - name: label dtype: string - name: answerKey dtype: string splits: - name: train num_bytes: 515979 num_examples: 1117 - name: validation num_bytes: 146393 num_examples: 298 - name: test num_bytes: 555344 num_examples: 1169 download_size: 559228 dataset_size: 1217716 - config_name: de features: - name: id dtype: string - name: question dtype: string - name: choices sequence: - name: text dtype: string - name: label dtype: string - name: answerKey dtype: string splits: - name: train num_bytes: 416218 num_examples: 1116 - name: validation num_bytes: 116268 num_examples: 298 - name: test num_bytes: 445928 num_examples: 1169 download_size: 513244 dataset_size: 978414 - config_name: es features: - name: id dtype: string - name: question dtype: string - name: choices sequence: - name: text dtype: string - name: label dtype: string - name: answerKey dtype: string splits: - name: train num_bytes: 415815 num_examples: 1118 - name: validation num_bytes: 116298 num_examples: 297 - name: test num_bytes: 444815 num_examples: 1170 download_size: 499409 dataset_size: 976928 - config_name: fr features: - name: id dtype: string - name: question dtype: string - name: choices sequence: - name: text dtype: string - name: label dtype: string - name: answerKey dtype: string splits: - name: train num_bytes: 431884 num_examples: 1118 - name: validation num_bytes: 121206 num_examples: 298 - name: test num_bytes: 460727 num_examples: 1169 download_size: 519321 dataset_size: 1013817 - config_name: he features: - name: index dtype: int64 - name: ind dtype: int64 - name: question dtype: string - name: choices struct: - name: label sequence: string - name: text sequence: string - name: id dtype: string - name: answerKey dtype: string - name: split dtype: string splits: - name: validation num_bytes: 116970 num_examples: 270 download_size: 60796 dataset_size: 116970 - config_name: it features: - name: id dtype: string - name: question dtype: string - name: choices sequence: - name: text dtype: string - name: label dtype: string - name: answerKey dtype: string splits: - name: train num_bytes: 411526 num_examples: 1118 - name: validation num_bytes: 114977 num_examples: 297 - name: test num_bytes: 439356 num_examples: 1169 download_size: 506239 dataset_size: 965859 - config_name: ru features: - name: id dtype: string - name: question dtype: string - name: choices sequence: - name: text dtype: string - name: label dtype: string - name: answerKey dtype: string splits: - name: train num_bytes: 617514 num_examples: 1118 - name: validation num_bytes: 171795 num_examples: 297 - name: test num_bytes: 660294 num_examples: 1169 download_size: 669039 dataset_size: 1449603 configs: - config_name: ar data_files: - split: train path: ar/train-* - split: validation path: ar/validation-* - split: test path: ar/test-* - config_name: de data_files: - split: train path: de/train-* - split: validation path: de/validation-* - split: test path: de/test-* - config_name: es data_files: - split: train path: es/train-* - split: validation path: es/validation-* - split: test path: es/test-* - config_name: fr data_files: - split: train path: fr/train-* - split: validation path: fr/validation-* - split: test path: fr/test-* - config_name: he data_files: - split: validation path: he/validation-* - config_name: it data_files: - split: train path: it/train-* - split: validation path: it/validation-* - split: test path: it/test-* - config_name: ru data_files: - split: train path: ru/train-* - split: validation path: ru/validation-* - split: test path: ru/test-* ---
提供机构:
ibragim-bad
原始信息汇总

数据集概述

数据集配置

阿拉伯语 (ar)

  • 特征:
    • id: 字符串
    • question: 字符串
    • choices: 序列
      • text: 字符串
      • label: 字符串
    • answerKey: 字符串
  • 分割:
    • train: 515979 字节, 1117 个样本
    • validation: 146393 字节, 298 个样本
    • test: 555344 字节, 1169 个样本
  • 下载大小: 559228 字节
  • 数据集大小: 1217716 字节

德语 (de)

  • 特征:
    • id: 字符串
    • question: 字符串
    • choices: 序列
      • text: 字符串
      • label: 字符串
    • answerKey: 字符串
  • 分割:
    • train: 416218 字节, 1116 个样本
    • validation: 116268 字节, 298 个样本
    • test: 445928 字节, 1169 个样本
  • 下载大小: 513244 字节
  • 数据集大小: 978414 字节

西班牙语 (es)

  • 特征:
    • id: 字符串
    • question: 字符串
    • choices: 序列
      • text: 字符串
      • label: 字符串
    • answerKey: 字符串
  • 分割:
    • train: 415815 字节, 1118 个样本
    • validation: 116298 字节, 297 个样本
    • test: 444815 字节, 1170 个样本
  • 下载大小: 499409 字节
  • 数据集大小: 976928 字节

法语 (fr)

  • 特征:
    • id: 字符串
    • question: 字符串
    • choices: 序列
      • text: 字符串
      • label: 字符串
    • answerKey: 字符串
  • 分割:
    • train: 431884 字节, 1118 个样本
    • validation: 121206 字节, 298 个样本
    • test: 460727 字节, 1169 个样本
  • 下载大小: 519321 字节
  • 数据集大小: 1013817 字节

希伯来语 (he)

  • 特征:
    • index: 整数64位
    • ind: 整数64位
    • question: 字符串
    • choices: 结构
      • label: 序列字符串
      • text: 序列字符串
    • id: 字符串
    • answerKey: 字符串
    • split: 字符串
  • 分割:
    • validation: 116970 字节, 270 个样本
  • 下载大小: 60796 字节
  • 数据集大小: 116970 字节

意大利语 (it)

  • 特征:
    • id: 字符串
    • question: 字符串
    • choices: 序列
      • text: 字符串
      • label: 字符串
    • answerKey: 字符串
  • 分割:
    • train: 411526 字节, 1118 个样本
    • validation: 114977 字节, 297 个样本
    • test: 439356 字节, 1169 个样本
  • 下载大小: 506239 字节
  • 数据集大小: 965859 字节

俄语 (ru)

  • 特征:
    • id: 字符串
    • question: 字符串
    • choices: 序列
      • text: 字符串
      • label: 字符串
    • answerKey: 字符串
  • 分割:
    • train: 617514 字节, 1118 个样本
    • validation: 171795 字节, 297 个样本
    • test: 660294 字节, 1169 个样本
  • 下载大小: 669039 字节
  • 数据集大小: 1449603 字节

数据文件路径

  • 阿拉伯语 (ar):

    • train: ar/train-*
    • validation: ar/validation-*
    • test: ar/test-*
  • 德语 (de):

    • train: de/train-*
    • validation: de/validation-*
    • test: de/test-*
  • 西班牙语 (es):

    • train: es/train-*
    • validation: es/validation-*
    • test: es/test-*
  • 法语 (fr):

    • train: fr/train-*
    • validation: fr/validation-*
    • test: fr/test-*
  • 希伯来语 (he):

    • validation: he/validation-*
  • 意大利语 (it):

    • train: it/train-*
    • validation: it/validation-*
    • test: it/test-*
  • 俄语 (ru):

    • train: ru/train-*
    • validation: ru/validation-*
    • test: ru/test-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作