Reconnected InChI-IUPAC Dataset for Metal-Containing Compounds
收藏DataONE2026-03-07 更新2026-05-27 收录
下载链接:
https://search.dataone.org/view/sha256:f3a9e54a945dda500946c8bea8a6c230f7aef81d487e6ae24cb84aef61d8df0f
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 1 million pairs of standard InChI, reconnected InChI, and IUPAC systematic names for metal-containing compounds, intended to support research on InChI-to-IUPAC translation. Each record includes category labels and metal annotations such as category, has_metal and primary_metal, as well as derived length fields: standard_inchi_len, reconnected_inchi_len and iupac_len. The dataset has been cleaned by removing invalid or placeholder names, enforcing InChI and IUPAC format checks, applying length constraints, normalising whitespace, and deduplicating records based on unique structure-name pairs.
创建时间:
2026-05-04



