XQuAD
收藏OpenCSG2024-04-11 更新2026-01-19 收录
下载链接:
https://opencsg.com/datasets/OpenDataLab/XQuAD?tab=summary
下载链接
链接失效反馈官方服务:
资源简介:
XQuAD(Cross-lingual Question Answering Dataset)是评估跨语言问答性能的基准数据集。该数据集包含来自 SQuAD v1.1(Rajpurkar 等人,2016 年)开发集的 240 个段落和 1190 个问答对的子集,以及它们的十种语言的专业翻译:西班牙语、德语、希腊语、俄语、土耳其语、阿拉伯语、越南语、泰语、汉语和印地语。因此,数据集在 11 种语言中完全平行。
XQuAD (Cross-lingual Question Answering Dataset) is a benchmark dataset for evaluating cross-lingual question answering performance. It contains a subset of 240 paragraphs and 1190 question-answer pairs from the development set of SQuAD v1.1 (Rajpurkar et al., 2016), along with their professional translations into ten languages: Spanish, German, Greek, Russian, Turkish, Arabic, Vietnamese, Thai, Chinese, and Hindi. Consequently, the dataset is fully parallel across 11 languages.
提供机构:
OpenDataLab
创建时间:
2024-04-11



