five

Reconnected InChI-IUPAC Dataset for Metal-Containing Compounds

收藏
DataONE2026-03-07 更新2026-05-27 收录
下载链接:
https://search.dataone.org/view/sha256:f3a9e54a945dda500946c8bea8a6c230f7aef81d487e6ae24cb84aef61d8df0f
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains 1 million pairs of standard InChI, reconnected InChI, and IUPAC systematic names for metal-containing compounds, intended to support research on InChI-to-IUPAC translation. Each record includes category labels and metal annotations such as category, has_metal and primary_metal, as well as derived length fields: standard_inchi_len, reconnected_inchi_len and iupac_len. The dataset has been cleaned by removing invalid or placeholder names, enforcing InChI and IUPAC format checks, applying length constraints, normalising whitespace, and deduplicating records based on unique structure-name pairs.
创建时间:
2026-05-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作