AI4Chem/ChemPref-DPO-for-Chemistry-data-en
收藏Hugging Face2024-04-21 更新2024-04-19 收录
下载链接:
https://hf-mirror.com/datasets/AI4Chem/ChemPref-DPO-for-Chemistry-data-en
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: instruction
dtype: string
- name: output
sequence: string
- name: input
dtype: string
- name: history
sequence: 'null'
splits:
- name: train
num_bytes: 30375810
num_examples: 10703
download_size: 13847446
dataset_size: 30375810
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: mit
task_categories:
- text-generation
- question-answering
language:
- en
tags:
- chemistry
---
## Citation
```
@misc{zhang2024chemllm,
title={ChemLLM: A Chemical Large Language Model},
author={Di Zhang and Wei Liu and Qian Tan and Jingdan Chen and Hang Yan and Yuliang Yan and Jiatong Li and Weiran Huang and Xiangyu Yue and Dongzhan Zhou and Shufei Zhang and Mao Su and Hansen Zhong and Yuqiang Li and Wanli Ouyang},
year={2024},
eprint={2402.06852},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
```
提供机构:
AI4Chem
原始信息汇总
数据集信息
特征
- 名称: instruction
- 数据类型: string
- 名称: output
- 序列类型: string
- 名称: input
- 数据类型: string
- 名称: history
- 序列类型: null
分割
- 名称: train
- 字节数: 30375810
- 样本数: 10703
大小
- 下载大小: 13847446
- 数据集大小: 30375810
配置
- 配置名称: default
- 数据文件:
- 分割: train
- 路径: data/train-*
- 数据文件:
许可证
- 许可证: mit
任务类别
- 文本生成
- 问答
语言
- 语言: en
标签
- 化学



