drugchat_liang_zhang_et_al
收藏魔搭社区2025-10-09 更新2025-05-31 收录
下载链接:
https://modelscope.cn/datasets/jablonkagroup/drugchat_liang_zhang_et_al
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Details
### Dataset Description
Instruction tuning dataset used for the LLM component of DrugChat.
10,834 compounds (3,8962 from ChEMBL and 6,942 from PubChem) containing
descriptive drug information were collected. 143,517 questions were generated
using the molecules' classification, properties and descriptions from ChEBI, LOTUS & YMDB.
- **Curated by:**
- **License:** BSD-3-Clause
### Dataset Sources
- [corresponding publication](https://www.techrxiv.org/articles/preprint/DrugChat_Towards_Enabling_ChatGPT-Like_Capabilities_on_Drug_Molecule_Graphs/22945922)
- [rep & data source](https://github.com/UCSD-AI4H/drugchat)
## Citation
**BibTeX:**
```bibtex
@article{Liang2023,
author = "Youwei Liang and Ruiyi Zhang and Li Zhang and Pengtao Xie",
title = "{DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs}",
year = "2023",
month = "5",
url = "https://www.techrxiv.org/articles/preprint/DrugChat_Towards_Enabling_ChatGPT-Like_Capabilities_on_Drug_Molecule_Graphs/22945922",
doi = "10.36227/techrxiv.22945922.v1"}
```
## 数据集详情
### 数据集描述
本数据集为适用于DrugChat的大语言模型(LLM)组件的指令微调数据集。共收集了10834个包含药物描述信息的化合物(其中3896个来自ChEMBL,6942个来自PubChem)。研究人员基于ChEBI、LOTUS与YMDB数据库中化合物的分类、属性及描述信息,生成了143517条问题。
- **数据整理方:**
- **授权协议:** BSD-3-Clause
### 数据集来源
- [相关研究论文](https://www.techrxiv.org/articles/preprint/DrugChat_Towards_Enabling_ChatGPT-Like_Capabilities_on_Drug_Molecule_Graphs/22945922)
- [代码与数据集来源](https://github.com/UCSD-AI4H/drugchat)
## 引用信息
**BibTeX格式引用:**
bibtex
@article{Liang2023,
author = "Youwei Liang and Ruiyi Zhang and Li Zhang and Pengtao Xie",
title = "{DrugChat: Towards Enabling ChatGPT-Like_Capabilities_on_Drug_Molecule_Graphs}",
year = "2023",
month = "5",
url = "https://www.techrxiv.org/articles/preprint/DrugChat_Towards_Enabling_ChatGPT-Like_Capabilities_on_Drug_Molecule_Graphs/22945922",
doi = "10.36227/techrxiv.22945922.v1"}
提供机构:
maas
创建时间:
2025-05-27



