o1_medical
收藏魔搭社区2025-12-05 更新2025-06-14 收录
下载链接:
https://modelscope.cn/datasets/UCSC-VLAA/o1_medical
下载链接
链接失效反馈官方服务:
资源简介:
# A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
- [🌐 Project Page](https://ucsc-vlaa.github.io/o1_medicine/)
- [💻 Github Repo](https://github.com/UCSC-VLAA/o1_medical)
- [🖨️ arXiv Paper](https://arxiv.org/abs/2409.15277)
This dataset is part of the work presented in the paper [A Preliminary Study of O1 in Medicine: Are We Closer to an AI Doctor?](https://arxiv.org/abs/2409.15277). It contains the prompts and questions used in the experiments described in the paper, aiming to evaluate the performance of AI models in medical question-answering tasks. This dataset serves as a valuable resource for researchers and developers working on AI-driven medical solutions.
## Usage
To use this dataset, we recommend integrating it with our evaluation framework, which is available in the following repository: [O1 Evaluation Framework](https://github.com/UCSC-VLAA/o1_eval). The framework provides the necessary tools and scripts to conduct experiments and analyze results efficiently.
For the specific dataset setup and further instructions, please refer to the `setup.sh` script available in the evaluation framework repository: [Dataset Setup](https://github.com/UCSC-VLAA/o1_eval).
Please note that **LancetQA** and **nejmQA** are not included in this dataset due to copyright restrictions.
## License and Citation
Please respect the individual licenses for each dataset as specified in the table above. When using these datasets in your research, make sure to cite the original sources and comply with their respective terms of use.
If you find our work useful for your research and applications, please cite using this BibTeX:
```bibtex
@misc{xie2024preliminarystudyo1medicine,
title={A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?},
author={Yunfei Xie and Juncheng Wu and Haoqin Tu and Siwei Yang and Bingchen Zhao and Yongshuo Zong and Qiao Jin and Cihang Xie and Yuyin Zhou},
year={2024},
eprint={2409.15277},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2409.15277},
}
```
## Disclaimer
o1_medical is a compilation of publicly available datasets, created with the intent to contribute back to the community and provide researchers and developers with a resource for academic and technical research. Any individual or organization (hereinafter referred to as "User") utilizing this dataset must adhere to the following disclaimer:
1. **Dataset Origin**: This dataset is composed of multiple publicly available datasets, the sources of which are clearly identified in the preprint paper. Users are obligated to comply with the relevant licenses and terms of use of the original datasets.
2. **Data Accuracy**: While every effort has been made to ensure the accuracy and completeness of the dataset, we cannot guarantee its absolute accuracy. Users assume all risks and responsibilities associated with the use of this dataset.
3. **Limitation of Liability**: Under no circumstances shall the dataset providers or contributors be held liable for any actions or outcomes resulting from the User's utilization of this dataset.
4. **Usage Constraints**: Users must comply with applicable laws, regulations, and ethical standards when using this dataset. The dataset must not be used for illegal, privacy-infringing, defamatory, discriminatory, or other unlawful or unethical purposes.
5. **Intellectual Property**: The intellectual property rights of this dataset belong to the respective rights holders of the original datasets. Users shall not infringe upon the intellectual property rights of the dataset in any manner.
As a non-profit organization, our team advocates for a harmonious and friendly open-source exchange environment. If you discover any content within the open-source dataset that infringes upon your legal rights, please send an email to (yxie126@ucsc.edu). In your email, please provide a detailed description of the alleged infringement and furnish us with relevant proof of ownership. We will initiate an investigation process within 3 working days and take necessary measures to address the issue (such as removing the relevant data). However, please ensure the veracity of your complaint, as any adverse consequences resulting from measures taken based on false claims will be solely your responsibility.
By downloading, copying, accessing, or using this dataset, the User acknowledges that they have read, understood, and agreed to abide by all terms and conditions set forth in this disclaimer. If the User is unable to accept any part of this disclaimer, they should refrain from using this dataset.
For any further inquiries or issues regarding the dataset, please feel free to raise an issue in the GitHub repository or contact the authors.
# 医学领域o1的初步研究:我们距离AI医生更近了吗?
- [🌐 项目主页](https://ucsc-vlaa.github.io/o1_medicine/)
- [💻 GitHub 仓库](https://github.com/UCSC-VLAA/o1_medical)
- [🖨️ arXiv 论文](https://arxiv.org/abs/2409.15277)
本数据集隶属于论文《医学领域o1的初步研究:我们距离AI医生更近了吗?》(https://arxiv.org/abs/2409.15277)的相关研究工作。数据集收录了论文中实验所使用的提示词(Prompt)与问题,旨在评估AI模型在医疗问答任务中的表现。本数据集可为从事AI驱动医疗解决方案研发的研究人员与开发者提供宝贵的研究资源。
## 使用方式
若要使用本数据集,我们推荐将其与配套的评估框架集成,该框架可从以下仓库获取:[O1 评估框架](https://github.com/UCSC-VLAA/o1_eval)。该框架提供了开展实验与高效分析结果所需的工具与脚本。
关于数据集的具体配置与进一步说明,请参阅评估框架仓库中的`setup.sh`脚本:[数据集配置指南](https://github.com/UCSC-VLAA/o1_eval)。
请注意,**LancetQA**与**nejmQA**因版权限制未包含在本数据集中。
## 许可与引用
请遵守上表中各数据集对应的单独许可条款。在研究中使用这些数据集时,请务必引用原始来源并遵守其各自的使用条款。
若您的研究与应用中用到了本团队的工作,请使用以下BibTeX格式进行引用:
bibtex
@misc{xie2024preliminarystudyo1medicine,
title={A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?},
author={Yunfei Xie and Juncheng Wu and Haoqin Tu and Siwei Yang and Bingchen Zhao and Yongshuo Zong and Qiao Jin and Cihang Xie and Yuyin Zhou},
year={2024},
eprint={2409.15277},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2409.15277},
}
## 免责声明
o1_medical是公开数据集的汇编集,旨在回馈社区并为研究人员与开发者提供用于学术与技术研究的资源。任何使用本数据集的个人或组织(以下简称“用户”)必须遵守以下免责声明:
1. **数据集来源**:本数据集由多个公开数据集汇编而成,其来源已在预印本论文中明确标注。用户必须遵守原始数据集的相关许可与使用条款。
2. **数据准确性**:尽管已尽最大努力确保数据集的准确性与完整性,但我们无法保证其绝对无误。用户需自行承担使用本数据集的全部风险与责任。
3. **责任限制**:在任何情况下,数据集提供者或贡献者均不对用户使用本数据集所产生的任何行为或后果承担责任。
4. **使用限制**:用户在使用本数据集时必须遵守适用的法律法规与伦理标准。本数据集不得用于非法、侵犯隐私、诽谤、歧视或其他违法或不道德的用途。
5. **知识产权**:本数据集的知识产权归属于原始数据集的相应权利持有人。用户不得以任何方式侵犯数据集的知识产权。
作为非营利组织,本团队倡导和谐友好的开源交流环境。若您发现开源数据集中存在侵犯您合法权益的内容,请发送邮件至(yxie126@ucsc.edu)。邮件中请详细说明涉嫌侵权的内容,并提供相关的所有权证明材料。我们将在3个工作日内启动调查流程,并采取必要措施解决问题(例如移除相关数据)。但请确保您的投诉属实,因虚假声明导致的措施所产生的任何不利后果将由您自行承担。
通过下载、复制、访问或使用本数据集,即表示用户已阅读、理解并同意遵守本免责声明中的所有条款与条件。若用户无法接受本免责声明的任何部分,请请勿使用本数据集。
若对本数据集有任何进一步的疑问或问题,请随时在GitHub仓库中提交Issue,或联系论文作者。
提供机构:
maas
创建时间:
2025-04-21



