Thanmay/commonsense_qa-translated
收藏Hugging Face2024-02-12 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Thanmay/commonsense_qa-translated
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: en
features:
- name: id
dtype: string
- name: question
dtype: string
- name: question_concept
dtype: string
- name: choices
sequence:
- name: label
dtype: string
- name: text
dtype: string
- name: answerKey
dtype: string
splits:
- name: test
num_bytes: 257842
num_examples: 1140
- name: validation
num_bytes: 273848
num_examples: 1221
download_size: 311467
dataset_size: 531690
- config_name: gu
features:
- name: id
dtype: string
- name: question_concept
dtype: string
- name: answerKey
dtype: string
- name: question
dtype: string
- name: choices
struct:
- name: label
sequence: string
- name: text
sequence: string
splits:
- name: validation
num_bytes: 509284
num_examples: 1221
- name: test
num_bytes: 481005
num_examples: 1140
download_size: 411754
dataset_size: 990289
- config_name: hi
features:
- name: id
dtype: string
- name: question_concept
dtype: string
- name: answerKey
dtype: string
- name: question
dtype: string
- name: choices
struct:
- name: label
sequence: string
- name: text
sequence: string
splits:
- name: validation
num_bytes: 519155
num_examples: 1221
- name: test
num_bytes: 490275
num_examples: 1140
download_size: 410911
dataset_size: 1009430
- config_name: ml
features:
- name: id
dtype: string
- name: question_concept
dtype: string
- name: answerKey
dtype: string
- name: question
dtype: string
- name: choices
struct:
- name: label
sequence: string
- name: text
sequence: string
splits:
- name: validation
num_bytes: 611370
num_examples: 1221
- name: test
num_bytes: 579108
num_examples: 1140
download_size: 453273
dataset_size: 1190478
- config_name: mr
features:
- name: id
dtype: string
- name: question_concept
dtype: string
- name: answerKey
dtype: string
- name: question
dtype: string
- name: choices
struct:
- name: label
sequence: string
- name: text
sequence: string
splits:
- name: validation
num_bytes: 523687
num_examples: 1221
- name: test
num_bytes: 495642
num_examples: 1140
download_size: 417463
dataset_size: 1019329
- config_name: ta
features:
- name: id
dtype: string
- name: question_concept
dtype: string
- name: answerKey
dtype: string
- name: question
dtype: string
- name: choices
struct:
- name: label
sequence: string
- name: text
sequence: string
splits:
- name: validation
num_bytes: 621423
num_examples: 1221
- name: test
num_bytes: 588898
num_examples: 1140
download_size: 445098
dataset_size: 1210321
configs:
- config_name: en
data_files:
- split: test
path: en/test-*
- split: validation
path: en/validation-*
- config_name: gu
data_files:
- split: validation
path: gu/validation-*
- split: test
path: gu/test-*
- config_name: hi
data_files:
- split: validation
path: hi/validation-*
- split: test
path: hi/test-*
- config_name: ml
data_files:
- split: validation
path: ml/validation-*
- split: test
path: ml/test-*
- config_name: mr
data_files:
- split: validation
path: mr/validation-*
- split: test
path: mr/test-*
- config_name: ta
data_files:
- split: validation
path: ta/validation-*
- split: test
path: ta/test-*
---
提供机构:
Thanmay
原始信息汇总
数据集概述
配置信息
英文配置 (en)
- 特征:
id: 字符串类型question: 字符串类型question_concept: 字符串类型choices: 序列类型,包含以下字段:label: 字符串类型text: 字符串类型
answerKey: 字符串类型
- 分割:
test: 257842 字节,1140 个样本validation: 273848 字节,1221 个样本
- 下载大小: 311467 字节
- 数据集大小: 531690 字节
古吉拉特语配置 (gu)
- 特征:
id: 字符串类型question_concept: 字符串类型answerKey: 字符串类型question: 字符串类型choices: 结构类型,包含以下字段:label: 序列类型,字符串text: 序列类型,字符串
- 分割:
validation: 509284 字节,1221 个样本test: 481005 字节,1140 个样本
- 下载大小: 411754 字节
- 数据集大小: 990289 字节
印地语配置 (hi)
- 特征:
id: 字符串类型question_concept: 字符串类型answerKey: 字符串类型question: 字符串类型choices: 结构类型,包含以下字段:label: 序列类型,字符串text: 序列类型,字符串
- 分割:
validation: 519155 字节,1221 个样本test: 490275 字节,1140 个样本
- 下载大小: 410911 字节
- 数据集大小: 1009430 字节
马拉雅拉姆语配置 (ml)
- 特征:
id: 字符串类型question_concept: 字符串类型answerKey: 字符串类型question: 字符串类型choices: 结构类型,包含以下字段:label: 序列类型,字符串text: 序列类型,字符串
- 分割:
validation: 611370 字节,1221 个样本test: 579108 字节,1140 个样本
- 下载大小: 453273 字节
- 数据集大小: 1190478 字节
马拉地语配置 (mr)
- 特征:
id: 字符串类型question_concept: 字符串类型answerKey: 字符串类型question: 字符串类型choices: 结构类型,包含以下字段:label: 序列类型,字符串text: 序列类型,字符串
- 分割:
validation: 523687 字节,1221 个样本test: 495642 字节,1140 个样本
- 下载大小: 417463 字节
- 数据集大小: 1019329 字节
泰米尔语配置 (ta)
- 特征:
id: 字符串类型question_concept: 字符串类型answerKey: 字符串类型question: 字符串类型choices: 结构类型,包含以下字段:label: 序列类型,字符串text: 序列类型,字符串
- 分割:
validation: 621423 字节,1221 个样本test: 588898 字节,1140 个样本
- 下载大小: 445098 字节
- 数据集大小: 1210321 字节
数据文件路径
- 英文配置 (en):
test: en/test-*validation: en/validation-*
- 古吉拉特语配置 (gu):
validation: gu/validation-*test: gu/test-*
- 印地语配置 (hi):
validation: hi/validation-*test: hi/test-*
- 马拉雅拉姆语配置 (ml):
validation: ml/validation-*test: ml/test-*
- 马拉地语配置 (mr):
validation: mr/validation-*test: mr/test-*
- 泰米尔语配置 (ta):
validation: ta/validation-*test: ta/test-*



