five

Low-cost prediction of molecular and transition state partition functions via machine learning

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6326559
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the vibrational, rotational, translational, and electronic partition functions for 35,883 organic chemistry molecular structures taken from the Grambow et. al dataset [1]. It was used to train ML deep neural networks to predict unknown transition state partition functions as well as partition functions for known molecular structures [2] The partition functions were computed at temperatures in the range T= [50, 2000] K with the rigid rotor, rigid body, harmonic oscillator approximations. Reactions involve no more than 7 C, N, or O atoms. Frequencies for the vibrational partition functions were taken from [1] where they were computed with DFT at the ωB97X-D3/def2-TZVP level of theory. For the rotational partition function, symmetry numbers were obtained by evaluating proper and improper invariant rotations of the structures. We note that structures involving two molecules were not separated: vibrational frequencies and symmetry numbers were computed for the aggregate structure. For each reaction, partition functions were calculated at 50 temperatures sampled uniformly from the inverse temperature range 1/T = [1/2000, 1/50] K-1. This corresponds to 11,961 reactions, 35,883 total structures, and 1,794,150 total partition function examples. The file Partition_Functions.tar.gz contains directories entitled “rxnXXXXXX” where XXXXXX is a reaction number identifier. Each contain three files “rXXXXXX.csv”, “pXXXXXX.csv”, “tsXXXXXX.csv” corresponding to data from the reactant (“r”), product (“p”), and transition state (“ts”) for reaction XXXXXX. Note that the directory structure and the reaction identifiers are the same as used in the original structure dataset by Grambow et al. and the corresponding structures can easily be extracted from that dataset. Each comma separated value (csv) file contains 50 rows and the following columns:  Column label  Values  T [K]  Temperature  qpart_ele [unitless]  Electronic partition function  qpart_trans [unitless]  Translational partition function  qpart_vib [unitless]  Vibrational partition function  qpart_rot [unitless]  Rotational partition function  qpart [unitless]  Partition function  log_qpart_trans [unitless]   Natural logarithm of translational partition function  log_qpart_rot [unitless]  Natural logarithm of rotational partition function  log_qpart_vib [unitless]  Natural logarithm of vibration partition function  log_qpart [unitless]  Natural logarithm of partition function   [1]      C. A. Grambow, L. Pattanaik, and W. H. Green, “Reactants, products, and transition states of elementary chemical reactions based on quantum chemistry,” Sci. Data, 7:1–8, 2020. [2]      Komp, E. Valleau, S. “Low-cost prediction of molecular and transition state partition functions via machine learning”, arXiv:, 2022.
创建时间:
2022-03-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作