five

MD17 data for graph2mat

收藏
DataCite Commons2024-08-06 更新2025-04-10 收录
下载链接:
https://data.dtu.dk/articles/dataset/MD17_data_for_graph2mat/26195285
下载链接
链接失效反馈
官方服务:
资源简介:
Creators ------------ Pol Febrer (pol.febrer@icn2.cat, ORCID 0000-0003-0904-2234) Peter Bjorn Jorgensen (peterbjorgensen@gmail.com, ORCID 0000-0003-4404-7276) Arghya Bhowmik (arbh@dtu.dk, ORCID 0000-0003-3198-5116) <br> Related publication ------------------- The dataset is published as part of the paper: "GRAPH2MAT: UNIVERSAL GRAPH TO MATRIX CONVERSION FOR ELECTRON DENSITY PREDICTION" (https://doi.org/10.26434/chemrxiv-2024-j4g21) https://github.com/BIG-MAP/graph2mat <br> Short description ------------------ This dataset contains the Hamiltonian, Overlap, Density and Energy Density matrices from SIESTA calculations of a subset of the MD17 aspirin dataset. The subset is taken from the third split in (https://doi.org/10.6084/m9.figshare.12672038.v3). <br> SIESTA 5.0.0 was used to compute the dataset. <br> Contents ----------------- <br> The dataset has two directories: <br> - pseudos: Contains the pseudopotentials used for the calculation (obtained from http://www.pseudo-dojo.org/, type NC SR (ONCVPSP v0.5), PBE, standard accuracy) - splits: The data splits used in the published paper. Each file "splits_X.json" contains the splits for training size X. <br> And then, three directories containing the calculations with different basis sets: - matrix_dataset_defsplit: Uses the default split-valence DZP basis in SIESTA. - matrix_dataset_optimsplit: Uses a split-valence DZP basis optimized for aspirin. - matrix_dataset_defnodes: Uses the default nodes DZP basis in SIESTA. <br> Each of the basis directories has two subdirectories: - basis: Contains the files specifying the basis used for each atom. - runs: The results of running the SIESTA simulations. Contents are discussed next. <br> The "runs" directory contains one directory for each run, named with the index of the run. Each directory contains: - RUN.fdf, geom.fdf: The input files used for the SIESTA calculation. - RUN.out: The log of the SIESTA run, which apar - siesta.TSDE: Contains the Density and Energy Density matrices. - siesta.TSHS: Contains the Hamiltonian and Overlap matrices. <br> Each matrix can be read using the sisl python package (https://github.com/zerothi/sisl) like: <br> ```python import sisl <br> matrix = sisl.get_sile("RUN.fdf").read_X() ``` <br> where X is hamiltonian, overlap, density_matrix or energy_density_matrix. <br> To reproduce the results presented in the paper, follow the documentation of the graph2mat package (https://github.com/BIG-MAP/graph2mat). <br> Cite this data ------------------ <br> https://doi.org/10.11583/DTU.c.7310005 © 2024 Technical University of Denmark <br> <br> License ----------------- This dataset is published under the CC BY 4.0 license. This license allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. <br>
提供机构:
Technical University of Denmark
创建时间:
2024-08-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作