five

NOMAD Chemical Formulas and Calculation IDs

收藏
DataCite Commons2025-06-01 更新2024-07-29 收录
下载链接:
https://figshare.com/articles/dataset/NOMAD_Chemical_Formulas_and_Calculation_IDs/19319783/3
下载链接
链接失效反馈
官方服务:
资源简介:
<i>all-formula.csv</i> contains two columns: <i>calc_id</i> (Calculation ID) and <i>formula</i> (Chemical Formula). These were restricted to VASP DFT calculations, and do not include noble gases nor radioactive elements. Some calculation IDs have missing chemical formulas. The list has also been filtered down to unique (non-reduced) chemical formulas in <i>unique-formula.csv</i> along with the <i>calc_id</i>-s for each unique formula. No structural information is included directly in this data. REALLY, what you're probably interested most in is <i>unique-reduced-formula.csv</i>. because it is the most curated and is directly usable with e.g. <i>pymatgen</i>. What this contains is three columns: <i>calc_id</i>, <i>reduced_formula</i>, and <i>factor</i> which correspond to the Calculation ID, the reduced formula (e.g. Si2O4 --&gt; SiO2), and the factor (e.g. for Si2O4 --&gt; SiO2 the factor is 2). The formulas were first parsed via <i>pymatgen.core.Composition</i> class. Going from <i>all-formula.csv</i> to <i>unique-formula.csv</i> to <i>unique-reduced-formula.csv</i> gives 11680557 --&gt; 764431 --&gt; 695612 rows.<br>Finally, <i>bad-formula.csv</i> just contains the formulas that were skipped during processing (i.e. couldn't be processed with <i>pymatgen.core.Composition</i> for various reasons, 15 in total).<br><br>The data was downloaded on 2022-03-07. See the links below (esp. <i>nomad-examples</i> GitHub repository) for details on the data download and filtering process.
提供机构:
figshare
创建时间:
2022-03-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作