BioASQ
收藏OpenDataLab2026-05-17 更新2024-05-09 收录
下载链接:
https://opendatalab.org.cn/OpenDataLab/BioASQ
下载链接
链接失效反馈官方服务:
资源简介:
BioASQ问答 (QA) 基准数据集包含英语问题,以及黄金标准 (参考) 答案和相关材料。该数据集旨在反映生物医学专家的真实信息需求,因此比大多数现有数据集更现实,更具挑战性。此外,与以前大多数仅包含确切答案的QA基准不同,bioasq-qa数据集还包括理想答案 (实际上是摘要),这对于多文档摘要的研究特别有用。数据集结合了结构化和非结构化数据。与每个问题相关的材料包括文档和摘要,这些文档和摘要对于信息检索和段落检索实验很有用,以及在概念到文本自然语言生成中很有用的概念。研究解释和文本包含的研究人员还可以衡量其方法改善生物医学质量保证系统性能的程度。最后但并非最不重要的一点是,随着BioASQ挑战的运行并生成新数据,数据集不断扩展。
The BioASQ Question Answering (QA) benchmark dataset contains English questions, along with gold-standard (reference) answers and relevant materials. This dataset is designed to reflect the real-world information needs of biomedical experts, making it more realistic and challenging than most existing QA datasets. Unlike most prior QA benchmarks that only include exact answers, the BioASQ-QA dataset also features ideal answers (essentially summaries), which are particularly valuable for research on multi-document summarization. The dataset combines both structured and unstructured data. The materials associated with each question include documents and summaries that are useful for information retrieval and passage retrieval experiments, as well as concepts applicable to concept-to-text natural language generation research. Researchers studying interpretability and text inclusion can also measure the extent to which their methods improve the performance of biomedical QA systems. Last but not least, the dataset is continuously expanding as the BioASQ challenges are conducted and new data is generated.
提供机构:
OpenDataLab
创建时间:
2023-09-04
搜集汇总
数据集介绍

背景与挑战
背景概述
BioASQ是一个生物医学领域的问答基准数据集,包含英语问题、黄金标准答案及相关材料,旨在模拟专家真实信息需求,因此比大多数现有数据集更具挑战性。它不仅提供确切答案,还包括理想答案(摘要),适用于多文档摘要、信息检索和自然语言生成等研究,且数据集结合结构化和非结构化数据,并随挑战运行不断扩展。
以上内容由遇见数据集搜集并总结生成



