five

ChEMBL3D: Quantum-Accurate 3D Conformers for ChEMBL at Scale

收藏
Figshare2026-03-10 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/_b_ChEMBL3D_Quantum-Accurate_3D_Conformers_for_ChEMBL_at_Scale_b_/31428449
下载链接
链接失效反馈
官方服务:
资源简介:
ChEMBL3D is a large-scale 3D molecular conformer dataset curated from ChEMBL v34. It provides hydrogen-complete molecular topologies and optimized conformer geometries, organized by atom-count groups for efficient access and processing.The dataset includes:topologies: grouped SDF/SMI files containing molecular topologies (group IDs such as 007, 008, …).zarr_database: size-grouped Zarr conformer data with aligned arrays for:atomic numbers (numbers)3D coordinates (coord)absolute energies (energy)relative energies (relative_energy)stereochemistry identifiers (stereo_id)molecule identifiers (mol_id)molecular charge (charge)Conformers were produced and curated through a multi-stage workflow including protomer/stereoisomer handling, conformer generation, geometry optimization with an AIMNet2 potential, and topology/conformer consistency checks. The release is intended to support molecular ML, conformer ranking, force-field development, and large-scale cheminformatics workflows.For distribution on platforms with file-count limits, this release is provided as compressed archives:topologies.tar.zstzarr_database.tar.zst.part-* (split compressed archive parts)Archive integrity checks are provided via ARCHIVE_SHA256SUMS.For full dataset structure, packaging/unpacking commands, and usage examples, see the included README.md.
创建时间:
2026-03-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作