five

HuatuoGPT2-SFT-GPT4-140K

收藏
魔搭社区2025-12-05 更新2025-01-25 收录
下载链接:
https://modelscope.cn/datasets/FreedomIntelligence/HuatuoGPT2-SFT-GPT4-140K
下载链接
链接失效反馈
官方服务:
资源简介:
## HuatuoGPT2-SFT-GPT4-140K 140K Chinese medical instructions generated by **GPT-4**, based on questions from [HuatuoGPT Dataset](https://huggingface.co/datasets/FreedomIntelligence/HuatuoGPT-sft-data-v1). This dataset contains supervised fine-tuning instructions for HuatuoGPT2, designed to enhance the model's ability to follow instructions in real medical scenarios. We have made all the data (142,248 entries) in this dataset publicly available. ## Repository - **Github:** https://github.com/FreedomIntelligence/HuatuoGPT-II ## Citation ``` @misc{chen2023huatuogptii, title={HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs}, author={Junying Chen and Xidong Wang and Anningzhe Gao and Feng Jiang and Shunian Chen and Hongbo Zhang and Dingjie Song and Wenya Xie and Chuyi Kong and Jianquan Li and Xiang Wan and Haizhou Li and Benyou Wang}, year={2023}, eprint={2311.09774}, archivePrefix={arXiv}, primaryClass={cs.CL} } @article{huatuogpt-2023, title={HuatuoGPT, Towards Taming Language Models To Be a Doctor}, author={Hongbo Zhang and Junying Chen and Feng Jiang and Fei Yu and Zhihong Chen and Jianquan Li and Guiming Chen and Xiangbo Wu and Zhiyi Zhang and Qingying Xiao and Xiang Wan and Benyou Wang and Haizhou Li}, journal={arXiv preprint arXiv:2305.15075}, year={2023} } ```

## HuatuoGPT2-SFT-GPT4-140K 该数据集包含14万条由GPT-4生成的中文医疗指令,样本来源于[HuatuoGPT数据集(HuatuoGPT Dataset)](https://huggingface.co/datasets/FreedomIntelligence/HuatuoGPT-sft-data-v1)。 本数据集专为HuatuoGPT2构建监督微调指令集,旨在提升模型在真实医疗场景下遵循指令的能力。本次公开了该数据集内全部142,248条数据条目。 ## 仓库 - **GitHub:** https://github.com/FreedomIntelligence/HuatuoGPT-II ## 引用 @misc{chen2023huatuogptii, title={HuatuoGPT-II:面向大语言模型(Large Language Model)医疗适配的单阶段训练方法}, author={Junying Chen、Xidong Wang、Anningzhe Gao、Feng Jiang、Shunian Chen、Hongbo Zhang、Dingjie Song、Wenya Xie、Chuyi Kong、Jianquan Li、Xiang Wan、Haizhou Li、Benyou Wang}, year={2023}, eprint={2311.09774}, archivePrefix={arXiv}, primaryClass={cs.CL} } @article{huatuogpt-2023, title={HuatuoGPT:驯服语言模型以成为专业医生}, author={Hongbo Zhang、Junying Chen、Feng Jiang、Fei Yu、Zhihong Chen、Jianquan Li、Guiming Chen、Xiangbo Wu、Zhiyi Zhang、Qingying Xiao、Xiang Wan、Benyou Wang、Haizhou Li}, journal={arXiv预印本}, year={2023}, eprint={2305.15075} }
提供机构:
maas
创建时间:
2025-01-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作