axiong/pmc_llama_instructions
收藏Hugging Face2023-11-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/axiong/pmc_llama_instructions
下载链接
链接失效反馈官方服务:
资源简介:
---
license: openrail
task_categories:
- question-answering
- text-generation
language:
- en
tags:
- biology
- med
---
This repo provides part of the dataset used for PMC-LLaMA-13B's instruction tuning.
| Data | Size | Link |
| --- | --- | --- |
| ChatDoctor | 100K | https://www.yunxiangli.top/ChatDoctor/ |
| MedQA | 10.2K | https://huggingface.co/datasets/GBaker/MedQA-USMLE-4-options |
| MedMCQA | 183K | https://huggingface.co/datasets/medmcqa |
| PubmedQA | 211K | https://huggingface.co/datasets/pubmed_qa |
| LiveQA | 635 | https://huggingface.co/datasets/truehealth/liveqa |
| MedicationQA | 690 | https://huggingface.co/datasets/truehealth/medicationqa |
| UMLS | 99K | https://www.nlm.nih.gov/research/umls/index.html |
The whole instruction dataset is composed of 7 parts. We have covered them in this dataset repo except for *ChatDoctor*.
You should consider merge ChatDoctor's data for complete dataset.
提供机构:
axiong
原始信息汇总
数据集概述
数据集组成部分
该数据集用于PMC-LLaMA-13B的指令调优,包含以下部分:
| 数据集名称 | 数据量 | 链接 |
|---|---|---|
| ChatDoctor | 100K | https://www.yunxiangli.top/ChatDoctor/ |
| MedQA | 10.2K | https://huggingface.co/datasets/GBaker/MedQA-USMLE-4-options |
| MedMCQA | 183K | https://huggingface.co/datasets/medmcqa |
| PubmedQA | 211K | https://huggingface.co/datasets/pubmed_qa |
| LiveQA | 635 | https://huggingface.co/datasets/truehealth/liveqa |
| MedicationQA | 690 | https://huggingface.co/datasets/truehealth/medicationqa |
| UMLS | 99K | https://www.nlm.nih.gov/research/umls/index.html |
数据集完整性
整个指令数据集由7部分组成,本数据集仓库中已包含除ChatDoctor外的所有部分。建议合并ChatDoctor的数据以获得完整数据集。



