five

heloisy/cosmos_qa_ptbr

收藏
Hugging Face2024-02-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/heloisy/cosmos_qa_ptbr
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: CosmosQA Português language: - pt license: - cc-by-4.0 source_datasets: - cosmos_qa size_categories: - 10K<n<100K dataset_info: features: - name: contexto dtype: string - name: pergunta dtype: string - name: A dtype: string - name: B dtype: string - name: C dtype: string - name: D dtype: string - name: Label dtype: string - name: id dtype: string splits: - name: train num_examples: 25262 - name: validation num_examples: 2985 - name: test num_examples: 6963 task_categories: - multiple-choice --- # Cosmos QA Português Este dataset é uma tradução para português do Cosmos QA, que originalmente é na língua inglesa. A tradução foi feita automaticamente usando o GPT-3.5-turbo, logo pode ter erros que não foram notados numa análise superficial. Se atente ao uso. # Dataset Card for cosmos_qa ## Licensing Information The data is distributed under the [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) license. ## Source Data Citation INformation ``` @inproceedings{huang-etal-2019-cosmos, title = "Cosmos {QA}: Machine Reading Comprehension with Contextual Commonsense Reasoning", author = "Huang, Lifu and Le Bras, Ronan and Bhagavatula, Chandra and Choi, Yejin", booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)", month = nov, year = "2019", address = "Hong Kong, China", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/D19-1243", doi = "10.18653/v1/D19-1243", pages = "2391--2401", } ```
提供机构:
heloisy
原始信息汇总

CosmosQA Português 数据集概述

基本信息

  • 语言: 葡萄牙语 (pt)
  • 许可: CC BY 4.0
  • 源数据集: Cosmos QA
  • 数据规模: 10K<n<100K

数据集结构

特征

  • contexto: 字符串类型
  • pergunta: 字符串类型
  • A: 字符串类型
  • B: 字符串类型
  • C: 字符串类型
  • D: 字符串类型
  • Label: 字符串类型
  • id: 字符串类型

分割

  • 训练集 (train): 25262 个样本
  • 验证集 (validation): 2985 个样本
  • 测试集 (test): 6963 个样本

任务类别

  • 多选题 (multiple-choice)
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作