five

Quantum mechanical electronic and geometric parameters for DNA k-mers as features for machine learning

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10865899
下载链接
链接失效反馈
官方服务:
资源简介:
With the development of advanced predictive modelling techniques, we are witnessing a steep increase in model development initiatives in genomics that employ high-end machine learning methodologies. Of particular interest are models that predict certain genomic or biological characteristics based solely on DNA sequence information. These models, however, treat the DNA sequence as a mere collection of four, A, T, G and C, letters, thus dismissing the past physico-chemical advancements in science that can enable the use of more intricate information about nucleic acid sequences. Here, we provide a comprehensive database of quantum mechanical and geometric features for all the permutations of 7-meric DNA in their representative B, A and Z conformations. The database is generated by employing the applicable high-cost and time-consuming quantum mechanical methodologies. This can thus make it seamless to associate a wealth of novel molecular features to any DNA sequence, by scanning it with a matching k-meric window and pulling the pre-computed values from our database for further use in modelling. We demonstrate the usefulness of our deposited features through their exclusive use in developing a model for A to C mutation rate constants. The DNA k-mer quantum mechanical parameters can also be found https://github.com/SahakyanLab/DNAkmerQM, the corresponding research and development code from https://github.com/SahakyanLab/NucleicAcidsQM, and the associated pre-print from https://doi.org/10.1101/2023.01.25.525597.
创建时间:
2024-03-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作