five

axiong/pmc_llama_instructions

收藏
Hugging Face2023-11-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/axiong/pmc_llama_instructions
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: openrail task_categories: - question-answering - text-generation language: - en tags: - biology - med --- This repo provides part of the dataset used for PMC-LLaMA-13B's instruction tuning. | Data | Size | Link | | --- | --- | --- | | ChatDoctor | 100K | https://www.yunxiangli.top/ChatDoctor/ | | MedQA | 10.2K | https://huggingface.co/datasets/GBaker/MedQA-USMLE-4-options | | MedMCQA | 183K | https://huggingface.co/datasets/medmcqa | | PubmedQA | 211K | https://huggingface.co/datasets/pubmed_qa | | LiveQA | 635 | https://huggingface.co/datasets/truehealth/liveqa | | MedicationQA | 690 | https://huggingface.co/datasets/truehealth/medicationqa | | UMLS | 99K | https://www.nlm.nih.gov/research/umls/index.html | The whole instruction dataset is composed of 7 parts. We have covered them in this dataset repo except for *ChatDoctor*. You should consider merge ChatDoctor's data for complete dataset.
提供机构:
axiong
原始信息汇总

数据集概述

数据集组成部分

该数据集用于PMC-LLaMA-13B的指令调优,包含以下部分:

数据集名称 数据量 链接
ChatDoctor 100K https://www.yunxiangli.top/ChatDoctor/
MedQA 10.2K https://huggingface.co/datasets/GBaker/MedQA-USMLE-4-options
MedMCQA 183K https://huggingface.co/datasets/medmcqa
PubmedQA 211K https://huggingface.co/datasets/pubmed_qa
LiveQA 635 https://huggingface.co/datasets/truehealth/liveqa
MedicationQA 690 https://huggingface.co/datasets/truehealth/medicationqa
UMLS 99K https://www.nlm.nih.gov/research/umls/index.html

数据集完整性

整个指令数据集由7部分组成,本数据集仓库中已包含除ChatDoctor外的所有部分。建议合并ChatDoctor的数据以获得完整数据集。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作