heloisy/cosmos_qa_ptbr
收藏Hugging Face2024-02-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/heloisy/cosmos_qa_ptbr
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: CosmosQA Português
language:
- pt
license:
- cc-by-4.0
source_datasets:
- cosmos_qa
size_categories:
- 10K<n<100K
dataset_info:
features:
- name: contexto
dtype: string
- name: pergunta
dtype: string
- name: A
dtype: string
- name: B
dtype: string
- name: C
dtype: string
- name: D
dtype: string
- name: Label
dtype: string
- name: id
dtype: string
splits:
- name: train
num_examples: 25262
- name: validation
num_examples: 2985
- name: test
num_examples: 6963
task_categories:
- multiple-choice
---
# Cosmos QA Português
Este dataset é uma tradução para português do Cosmos QA, que originalmente é na língua inglesa.
A tradução foi feita automaticamente usando o GPT-3.5-turbo, logo pode ter erros que não foram notados numa análise superficial.
Se atente ao uso.
# Dataset Card for cosmos_qa
## Licensing Information
The data is distributed under the [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) license.
## Source Data Citation INformation
```
@inproceedings{huang-etal-2019-cosmos,
title = "Cosmos {QA}: Machine Reading Comprehension with Contextual Commonsense Reasoning",
author = "Huang, Lifu and
Le Bras, Ronan and
Bhagavatula, Chandra and
Choi, Yejin",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)",
month = nov,
year = "2019",
address = "Hong Kong, China",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/D19-1243",
doi = "10.18653/v1/D19-1243",
pages = "2391--2401",
}
```
提供机构:
heloisy
原始信息汇总
CosmosQA Português 数据集概述
基本信息
- 语言: 葡萄牙语 (pt)
- 许可: CC BY 4.0
- 源数据集: Cosmos QA
- 数据规模: 10K<n<100K
数据集结构
特征
- contexto: 字符串类型
- pergunta: 字符串类型
- A: 字符串类型
- B: 字符串类型
- C: 字符串类型
- D: 字符串类型
- Label: 字符串类型
- id: 字符串类型
分割
- 训练集 (train): 25262 个样本
- 验证集 (validation): 2985 个样本
- 测试集 (test): 6963 个样本
任务类别
- 多选题 (multiple-choice)



