Compositional and Structural Fingerprints of Materials Project Time Split Data for use with Generative Materials Benchmarking Metrics
收藏Figshare2022-08-10 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Compositional_and_Structural_Fingerprints_of_Materials_Project_Time_Split_Data_for_use_with_Generative_Materials_Benchmarking_Metrics/20444109/2
下载链接
链接失效反馈官方服务:
资源简介:
This is a supporting dataset for <em><strong>matbench-genmetrics</strong></em> <sub>[docs]</sub> <sub>[repo]</sub>, a set of generative materials benchmarking metrics. The compositional (Magpie) and structural (CrystalNN) fingerprints* are produced in <sub>fingerprint_snapshot.py</sub> using <em><strong>mp-time-split</strong></em> data <sub>[docs]</sub> <sub>[repo]</sub> <sub>[figshare]</sub> and are given in <em>comp_fingerprints.csv</em> (132 features) and <em>struct_fingerprints.csv</em> (61 features), respectively. Each has an additional column in the first position, <em>material_id,</em> which contains the <sub>Materials Project <em>material_id</em></sub>. So, in total there are 133 and 62 columns, respectively. There are 40476 entries, plus a header row with labels, so 40477 rows in total. The primary purpose of these datasets is to avoid repeating lengthy calculations each time a <em>matbench-genmetrics</em> benchmark is computed; thus, only the generated structures need to be featurized. The total runtime for the compositional and structural fingerprinting using 6 physical cores (12 virtual cores as determined by <em>multiprocessing.cpu_count()</em>) is approximately 50 minutes and 140 min, respectively. The benchmarks can be used with materials generative models such as <sub><em>xtal2png</em></sub><em>+</em><sub><em>Imagen</em></sub>. <br> *The use of Magpie and CrystalNN featurizers are based on the coverage metric from CDVAE <sub>[repo]</sub> <sub>[paper].</sub> <br> A small set of dummy data for testing purposes is also included (<em>dummy_comp_fingerprints.csv</em> and <em>dummy_struct_fingerprints.csv</em>)
提供机构:
Sparks, Taylor; Jablonka, Kevin Maik; Baird, Sterling G.; Smit, Berend
创建时间:
2022-08-06



