FrenYou/medical
收藏Hugging Face2026-04-23 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/FrenYou/medical
下载链接
链接失效反馈官方服务:
资源简介:
medical是一个中文医疗数据集,可用于医疗领域大模型训练。数据集包含预训练数据集(pretrain)、指令微调数据集(finetune)和奖励模型数据集(reward)三个部分。预训练数据集包含医疗百科数据和医疗教材文本数据,用于预训练注入医疗知识;指令微调数据集包含中文和英文的医疗问诊对话数据,用于监督微调;奖励模型数据集包含中文医疗对话数据集的提问和答复,用于奖励模型训练。数据集的结构、字段解释、数据分割和来源等信息也在README中详细说明。
medical is a Chinese Medical dataset designed for training large models in the medical field. The dataset includes three main parts: pretrain, finetune, and reward. The pretrain dataset contains medical encyclopedia data and medical textbook text data for pretraining to inject medical knowledge. The finetune dataset includes Chinese and English medical consultation dialogue data for supervised fine-tuning. The reward dataset consists of questions and answers from Chinese medical dialogue datasets for reward model training. The README provides detailed information on the dataset structure, field explanations, data splits, and sources.
提供机构:
FrenYou



