five

Compositional and Structural Fingerprints of Materials Project Time Split Data for use with Generative Materials Benchmarking Metrics

收藏
DataCite Commons2022-08-06 更新2024-07-29 收录
下载链接:
https://figshare.com/articles/dataset/Compositional_and_Structural_Fingerprints_of_Materials_Project_Time_Split_Data_for_use_with_Generative_Materials_Benchmarking_Metrics/20444109/1
下载链接
链接失效反馈
官方服务:
资源简介:
This is a supporting dataset for <em><strong>matbench-genmetrics</strong></em> [docs] [repo], a set of generative materials benchmarking metrics. The compositional (Magpie) and structural (CrystalNN) fingerprints* are produced in fingerprint_snapshot.py using <em><strong>mp-time-split</strong></em> data [docs] [repo] [figshare] and are given in <em>comp_fingerprints.csv</em> (132 features) and <em>struct_fingerprints.csv</em> (61 features), respectively. Each has an additional column in the first position, <em>material_id,</em> which contains the Materials Project <em>material_id</em>. So, in total there are 133 and 62 columns, respectively. There are 40476 entries, plus a header row with labels, so 40477 rows in total. The primary purpose of these datasets is to avoid repeating lengthy calculations each time a <em>matbench-genmetrics</em> benchmark is computed; thus, only the generated structures need to be featurized. The total runtime for the compositional and structural fingerprinting using 6 physical cores (12 virtual cores as determined by <em>multiprocessing.cpu_count()</em>) is approximately 50 minutes and 140 min, respectively. The benchmarks can be used with materials generative models such as <em>xtal2png</em><em>+</em><em>Imagen</em>. <br> *The use of Magpie and CrystalNN featurizers are based on the coverage metric from CDVAE [repo] [paper].
提供机构:
figshare
创建时间:
2022-08-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作