five

MedRedQA

收藏
Research Data Australia2024-12-14 收录
下载链接:
https://researchdata.edu.au/medredqa/3378885
下载链接
链接失效反馈
官方服务:
资源简介:
A large non-factoid English consumer Question Answering (QA) dataset containing 51,000 pairs of consumer questions and their corresponding expert answers. This dataset is useful for bench-marking or training systems on more difficult real-world questions and responses which may contain spelling or formatting errors, or lexical gaps between consumer and expert vocabularies.\n\nBy downloading this dataset, you agree to have obtained ethics approval from your institution.\nLineage: We collected data from posts and comments to subreddit /r/askdocs, published between July 10, 2013, and April 2, 2022, totalling 600,000 submissions (original posts) and 1,700,000 comments (replies). We generated question-answer pairs by taking the highest scoring answer from a verified medical expert to a Reddit question. Questions with only images are removed, all links are removed and authors are removed. \n\nWe provide two separate datasets in this collection and provide the following schemas.\nMedRedQA - Reddit Medical Question and Answer pairs from /r/askdocs. CSV format.\ni. the poster's question (Body) \nii. Title of the post \niii. The filtered answer from a verified physician comment (Response)\niv. Occupation indicated for verification status\nv. Any PMCIDs found in the post\n\nMedRedQA+PubMed - PubMed Enriched subset of MedRedQA. JSON format.\ni. Question. The user's original question. The is equivalent to the Body field in MedRedQA\nii. Document: The abstract of the PubMed document (if it exists and contains an abstract) for that particular post. Note: it does not necessarily mean the answer references this document. But at least one other verified physician in the responses has mentioned that particular document.\niii. The filtered response. This is equivalent to the Response field in MedRedQA.
提供机构:
Commonwealth Scientific and Industrial Research Organisation
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作