Kakyoin03/Health_QA_English
收藏Hugging Face2026-04-28 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Kakyoin03/Health_QA_English
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含高度结构化的医学问答对,经过提取、清理和标准化处理,语言为英语。它是BRAIN HEALTH (HELIX-FT)项目的基准数据集,旨在微调大型语言模型(LLMs)作为医疗助手。数据集包含18,876个高质量的医学问答对,每个条目包含问题、上下文问题、答案、专业领域、紧急程度、实体和文章标题等字段。数据集经过严格的评估,包括RAG Triad Metrics、词汇和数据多样性指标以及自动化NLP管道评分。
This dataset contains highly structured medical Questions and Answers extracted, cleaned, and standardized in English. It serves as the baseline dataset for the BRAIN HEALTH (HELIX-FT) project, designed to fine-tune Large Language Models (LLMs) to act as medical assistants. The dataset includes 18,876 high-quality medical Q&A pairs, with each item containing fields such as question, context_question, answer, speciality, urgency, entities, and article_title. The dataset underwent rigorous evaluation, including RAG Triad Metrics, Lexical & Data Diversity Metrics, and Automated NLP Pipeline Scores.
提供机构:
Kakyoin03



