Gaykar/DrugData
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Gaykar/DrugData
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为药物描述数据集(教育用途),包含985个样本,每个样本包含药物名称和描述。数据集严格用于教育和研究目的,特别适用于大型语言模型(LLMs)在药物相关文本理解、摘要或问答任务中的微调。药物名称由作者手动从公开的医疗信息网站选择,描述是通过Google Gemini基于选定的药物名称生成的,但未经医学专业人士验证。数据集结构包括两列:DrugName(药物名称)和Description(药物描述)。数据集允许用于教育实验、NLP研究、LLMs微调以及文本生成和理解任务,但不允许用于临床决策、医疗诊断、治疗建议或实际医疗应用。
This dataset is named Drug Description Dataset (Educational Use) and contains 985 samples, each consisting of a drug name and description. It is intended strictly for educational and research purposes, particularly for fine-tuning Large Language Models (LLMs) on drug-related text understanding, summarization, or question-answering tasks. Drug names were manually selected by the author from publicly available medical information websites, and descriptions were synthetically generated using Google Gemini based on the selected drug names, though not verified by medical professionals. The dataset structure includes two columns: DrugName and Description. The dataset is allowed for educational experiments, NLP research, fine-tuning LLMs, and text generation and understanding tasks, but not for clinical decision-making, medical diagnosis, treatment recommendations, or real-world healthcare applications.
提供机构:
Gaykar



