PubChemQA
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/pharmolix/openbiomed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在评估在分子理解任务中的幻觉水平,尤其是关注大型语言模型在处理分子SMILES和蛋白质序列方面的表现。此外,该数据集还作为评估不同大型语言模型在分子理解任务中产生幻觉现象的一个基准。
This dataset is designed to evaluate the level of hallucination in molecular understanding tasks, with a particular focus on the performance of large language models (LLMs) when processing molecular SMILES and protein sequences. Furthermore, this dataset serves as a benchmark for assessing the hallucination phenomena generated by different large language models in molecular understanding tasks.
提供机构:
PubChem



