Discrete Feature Representations of CHO Reaction Mechanisms as Quasireaction Subgraphs
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7905293
下载链接
链接失效反馈官方服务:
资源简介:
This data set contains 194778 quasireaction subgraphs extracted from CHO transition networks with 2-6 non-hydrogen atoms (CxHyOz, 2 <= x + z <= 6).
The complete table of subgraphs (including file locations) is in CHO-6-atoms-subgraphs.csv file. The subgraphs are in GraphML format (http://graphml.graphdrawing.org) and are compressed using bzip2. All subgraphs are undirected and unweighted. The reactant and product nodes (initial and final) are labeled in the "type" node attribute. The nodes are represented as multi-molecule SMILES strings. The edges are labeled by the reaction rules in SMARTS representation. The forward and backward reading of the SMARTS string should be considered equivalent.
The generation and analysis of this data set is described in
D. Rappoport, Statistics and Bias-Free Sampling of Reaction Mechanisms from Reaction Network Models, 2023, submitted. Preprint at ChemrXiv, DOI: 10.26434/chemrxiv-2023-wltcr
Simulation parameters
- CHO networks constructed using polar bond break/bond formation rule set for CHO.
- High-energy nodes were excluded using the following rules:
(i) more than 3 rings, (ii) triple and allene bonds in rings, (iii) double bonds at
bridge atoms,(iv) double bonds in fused 3-membered rings.
- Neutral nodes were defined as containing only neutral molecules.
- Shortest path lengths were determined for all pairs of neutral nodes.
- Pairs of neutral nodes with shortest-path length > 8 were excluded.
- Additionally, pairs of neutral nodes connected only by shortest paths passing through
additional neutral nodes (reducible paths) were excluded.
For background and additional details, see paper above.
创建时间:
2023-05-09



