five

NOMAD Chemical Formulas and Calculation IDs

收藏
Figshare2022-03-07 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/NOMAD_Chemical_Formulas_and_Calculation_IDs/19319783
下载链接
链接失效反馈
官方服务:
资源简介:
all-formula.csv contains two columns: calc_id (Calculation ID) and formula (Chemical Formula). These were restricted to VASP DFT calculations, and do not include noble gases nor radioactive elements. Some calculation IDs have missing chemical formulas. The list has also been filtered down to unique (non-reduced) chemical formulas in unique-formula.csv along with the calc_id-s for each unique formula. No structural information is included directly in this data. REALLY, what you're probably interested most in is unique-reduced-formula.csv. because it is the most curated and is directly usable with e.g. pymatgen. What this contains is three columns: calc_id, reduced_formula, and factor which correspond to the Calculation ID, the reduced formula (e.g. Si2O4 --> SiO2), and the factor (e.g. for Si2O4 --> SiO2 the factor is 2). The formulas were first parsed via pymatgen.core.Composition class. Going from all-formula.csv to unique-formula.csv to unique-reduced-formula.csv gives 11680557 --> 764431 --> 695612 rows.Finally, bad-formula.csv just contains the formulas that were skipped during processing (i.e. couldn't be processed with pymatgen.core.Composition for various reasons, 15 in total).The data was downloaded on 2022-03-07. See the links below (esp. nomad-examples GitHub repository) for details on the data download and filtering process.
创建时间:
2022-03-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作