commonsense_qa
收藏数据集概述
数据集描述
- 名称: CommonsenseQA
- 语言: 英语 (
en) - 许可证: MIT
- 多语言性: 单语种
- 大小类别: 1K<n<10K
- 源数据集: 原始数据
- 任务类别: 问答
- 任务ID: 开放领域问答
- PapersWithCode ID: commonsenseqa
- 别名: CommonsenseQA
数据集结构
特征
id(string): 唯一IDquestion(string): 问题question_concept(string): 与问题相关的ConceptNet概念choices(字典):label(string): 选项标签text(string): 选项文本
answerKey(string): 答案
分割
train- 字节数: 2207794
- 样本数: 9741
validation- 字节数: 273848
- 样本数: 1221
test- 字节数: 257842
- 样本数: 1140
配置
default- 数据文件:
train:data/train-*validation:data/validation-*test:data/test-*
- 数据文件:
数据集创建
许可证信息
该数据集在MIT许可证下发布。
引用信息
@inproceedings{talmor-etal-2019-commonsenseqa, title = "{C}ommonsense{QA}: A Question Answering Challenge Targeting Commonsense Knowledge", author = "Talmor, Alon and Herzig, Jonathan and Lourie, Nicholas and Berant, Jonathan", booktitle = "Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)", month = jun, year = "2019", address = "Minneapolis, Minnesota", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/N19-1421", doi = "10.18653/v1/N19-1421", pages = "4149--4158", archivePrefix = "arXiv", eprint = "1811.00937", primaryClass = "cs", }




