medical-r1-distill-data

Name: medical-r1-distill-data
Creator: maas
Published: 2026-01-06 16:23:24
License: 暂无描述

魔搭社区2026-01-06 更新2025-03-01 收录

下载链接：

https://modelscope.cn/datasets/FreedomIntelligence/medical-r1-distill-data

下载链接

链接失效反馈

官方服务：

资源简介：

## Introduction This dataset is an SFT dataset distilled from **Deepseek-R1 (Full Power Version)**, based on [medical verifiable problems](https://huggingface.co/datasets/FreedomIntelligence/medical-o1-verifiable-problem) from HuatuoGPT-o1. The **Chinese version** of the dataset is available at [FreedomIntelligence/Medical-R1-Distill-Data-Chinese](https://huggingface.co/datasets/FreedomIntelligence/Medical-R1-Distill-Data-Chinese). The distillation originates from the native Deepseek-R1 API requests. We hope this distilled dataset can help initialize your models with the reasoning chain from R1. You can also use our previously built medical verified long reasoning chains based on GPT-4o on [medical-o1-reasoning-SFT](https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT). For details, see our [paper](https://arxiv.org/pdf/2412.18925) and [GitHub repository](https://github.com/FreedomIntelligence/HuatuoGPT-o1). ## Citation If you find our data useful, please consider citing our work! ``` @misc{chen2024huatuogpto1medicalcomplexreasoning, title={HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs}, author={Junying Chen and Zhenyang Cai and Ke Ji and Xidong Wang and Wanlong Liu and Rongsheng Wang and Jianye Hou and Benyou Wang}, year={2024}, eprint={2412.18925}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2412.18925}, } ```

## 简介本数据集是从**Deepseek-R1（全功能版）**中蒸馏得到的**监督微调（Supervised Fine-Tuning，SFT）**数据集，其数据基础源自HuatuoGPT-o1的[医疗可验证问题](https://huggingface.co/datasets/FreedomIntelligence/medical-o1-verifiable-problem)。本数据集的**中文版本**可通过[FreedomIntelligence/Medical-R1-Distill-Data-Chinese](https://huggingface.co/datasets/FreedomIntelligence/Medical-R1-Distill-Data-Chinese)获取。本次蒸馏基于原生Deepseek-R1 API请求构建。我们期望该蒸馏数据集能够助力开发者基于R1的推理链条初始化模型。此外，您也可使用我们此前基于GPT-4o构建的医疗验证长推理链数据集[medical-o1-reasoning-SFT](https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT)。如需了解更多细节，请参阅我们的[论文](https://arxiv.org/pdf/2412.18925)与[GitHub仓库](https://github.com/FreedomIntelligence/HuatuoGPT-o1)。 ## 引用若您认为本数据集对您的研究有所助益，请考虑引用我们的工作！ @misc{chen2024huatuogpto1medicalcomplexreasoning, title={HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs}, author={Junying Chen and Zhenyang Cai and Ke Ji and Xidong Wang and Wanlong Liu and Rongsheng Wang and Jianye Hou and Benyou Wang}, year={2024}, eprint={2412.18925}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2412.18925}, }

提供机构：

maas

创建时间：

2025-02-22

搜集汇总

数据集介绍