PubChemQA

Name: PubChemQA
Creator: PubChem
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://github.com/pharmolix/openbiomed

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集旨在评估在分子理解任务中的幻觉水平，尤其是关注大型语言模型在处理分子SMILES和蛋白质序列方面的表现。此外，该数据集还作为评估不同大型语言模型在分子理解任务中产生幻觉现象的一个基准。

This dataset is designed to evaluate the level of hallucination in molecular understanding tasks, with a particular focus on the performance of large language models (LLMs) when processing molecular SMILES and protein sequences. Furthermore, this dataset serves as a benchmark for assessing the hallucination phenomena generated by different large language models in molecular understanding tasks.

提供机构：

PubChem

5,000+

优质数据集

54 个

任务类型

进入经典数据集