THUMedInfo/PMC-Patients-ReCDS
收藏Hugging Face2024-11-28 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/THUMedInfo/PMC-Patients-ReCDS
下载链接
链接失效反馈官方服务:
资源简介:
PMC-Patients是一个首创的数据集,包含从PubMed Central (PMC)的病例报告中提取的167k患者摘要、3.1M患者-文章相关性和293k患者-患者相似性注释,这些注释由PubMed引用图定义。该数据集支持两个任务:Patient-to-Article Retrieval (PAR)和Patient-to-Patient Retrieval (PPR),用于基准测试基于检索的临床决策支持系统(ReCDS)。数据集的结构包括查询、语料库和qrels(注释),并且提供了数据实例和分割的示例。
PMC-Patients is a first-of-its-kind dataset consisting of 167k patient summaries extracted from case reports in PubMed Central (PMC), 3.1M patient-article relevance and 293k patient-patient similarity annotations defined by PubMed citation graph. The dataset supports two tasks: Patient-to-Article Retrieval (PAR) and Patient-to-Patient Retrieval (PPR). The dataset structure includes queries, corpus, and qrels (annotations), stored in jsonl and tsv formats.
提供机构:
THUMedInfo



