Genome-wide AnnotateMissense predictions for 90.6 million hg38 missense variants
收藏DataCite Commons2026-05-02 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.19981866
下载链接
链接失效反馈官方服务:
资源简介:
This record contains the genome-wide AnnotateMissense prediction database and model-associated output associated with the manuscript
AnnotateMissense: A Multi-source Annotation and Benchmarking Framework for Genome-wide Missense Pathogenicity Prediction.
AnnotateMissense is a scalable framework for genome-wide missense variant annotation, feature integration, benchmarking, and pathogenicity prediction. Starting from 90,643,830 hg38 missense single-nucleotide variants derived from dbNSFP v5.1/CAGI7 Annotate-All-Missense resources, the workflow integrates ANNOVAR-derived annotations, dbNSFP features, AlphaMissense scores, ESM-derived protein language model features, and engineered biological features.
The uploaded files include:
variants.duckdb.gz: compressed DuckDB database containing genome-wide missense variant annotations and AnnotateMissense prediction outputs.
UQ_BioSig_model_Final.tsv.gz: compressed final model-associated prediction/output table.
The source code and workflow scripts are available at:
https://github.com/MuhammadMuneeb007/CAGI7_Annotate_All_Missense
These predictions are intended for research prioritisation, variant triage, and reproducible benchmarking. They are not standalone clinical classifications and should not be used as the sole basis for clinical decision-making.
提供机构:
Zenodo
创建时间:
2026-05-02



