SQAC
收藏huggingface.co2025-03-25 收录
下载链接:
https://huggingface.co/datasets/PlanTL-GOB-ES/SQAC
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 6,247 contexts and 18,817 questions with their answers, 1 to 5 for each fragment.
The sources of the contexts are:
* Encyclopedic articles from [Wikipedia in Spanish](https://es.wikipedia.org/), used under [CC-by-sa licence](https://creativecommons.org/licenses/by-sa/3.0/legalcode).
* News from [Wikinews in Spanish](https://es.wikinews.org/), used under [CC-by licence](https://creativecommons.org/licenses/by/2.5/).
* Text from the Spanish corpus [AnCora](http://clic.ub.edu/corpus/en), which is a mix from diferent newswire and literature sources, used under [CC-by licence] (https://creativecommons.org/licenses/by/4.0/legalcode).
This dataset can be used to build extractive-QA.
本数据集包含6,247个上下文和18,817个问题及其答案,每个片段包含1至5个答案。上下文的来源包括:来自[西班牙语维基百科](https://es.wikipedia.org/)的百科文章,在[CC-by-sa许可协议](https://creativecommons.org/licenses/by-sa/3.0/legalcode)下使用;来自[西班牙语维基新闻](https://es.wikinews.org/)的新闻报道,在[CC-by许可协议](https://creativecommons.org/licenses/by/2.5/)下使用;以及来自西班牙语语料库[AnCora](http://clic.ub.edu/corpus/en),该语料库汇集了不同新闻通讯和文学来源的文本,在[CC-by许可协议](https://creativecommons.org/licenses/by/4.0/legalcode)下使用。此数据集可用于构建提取式问答系统。
提供机构:
huggingface.co



