KOJKO/hotpot_qa
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/KOJKO/hotpot_qa
下载链接
链接失效反馈官方服务:
资源简介:
HotpotQA是一个新的数据集,包含113,000个基于维基百科的问题-答案对,具有四个关键特征:(1)问题需要通过查找和推理多个支持文档来回答;(2)问题多样且不受任何现有知识库或知识模式的限制;(3)提供句子级的支持事实,允许问答系统在强监督下进行推理并解释预测;(4)提供一种新类型的事实比较问题,以测试问答系统提取相关事实和进行必要比较的能力。数据集分为distractor和fullwiki两种配置,每种配置都有特定的特征和分割。数据集为英文,通过众包创建,采用CC BY-SA 4.0许可证。
HotpotQA is a new dataset with 113k Wikipedia-based question-answer pairs with four key features: (1) the questions require finding and reasoning over multiple supporting documents to answer; (2) the questions are diverse and not constrained to any pre-existing knowledge bases or knowledge schemas; (3) we provide sentence-level supporting facts required for reasoning, allowing QA systems to reason with strong supervision and explain the predictions; (4) we offer a new type of factoid comparison questions to test QA systems’ ability to extract relevant facts and perform necessary comparison. The dataset is available in two configurations: distractor and fullwiki, each with specific features and splits. The dataset is in English, crowdsourced, and licensed under CC BY-SA 4.0.
提供机构:
KOJKO



