Supporting data for "Synonymous variants that disrupt mRNA structure are significantly constrained in the human population"
收藏DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100878
下载链接
链接失效反馈官方服务:
资源简介:
The role of synonymous single nucleotide variants in human health and disease is poorly understood, yet there is a growing body of evidence to suggest that this class of silent genetic variation plays multiple regulatory roles in both transcription and translation. One mechanism by which synonymous codons direct and modulate the translational process is through alteration of the elaborate structure formed by single-stranded mRNA molecules. While tools to computationally predict the impact of non-synonymous variants on protein structure are plentiful, analogous tools to systematically assess how synonymous variants might disrupt mRNA structure are lacking. <br><br>To address this need, we developed novel software using a parallel processing framework for large-scale generation of secondary RNA structures and folding statistics for the transcriptome of any species. Focusing our analysis on the human transcriptome, we calculated 5 billion RNA folding statistics for 469 million single nucleotide variants in 45,800 transcripts. By considering the impact of all possible synonymous variants globally, we discover that synonymous variants predicted to disrupt mRNA structure have significantly lower rates of incidence in the human population.<br><br>These findings support the hypothesis that synonymous variants may play a role in genetic disorders due to their effects on mRNA structure. Given that the community lacks tools to evaluate the potential pathogenic impact of synonymous variants, we provide RNA stability, edge distance and diversity metrics for every nucleotide in the human transcriptome and introduce a Structural Predictivity Index (SPI) to quantify structural constraint operating on any synonymous variant. Because no single RNA-folding metric can capture the diversity of mechanisms by which a variant could alter secondary mRNA structure, we generated a SUmmarized RNA Folding (SURF) metric to provide a single measurement to predict the impact of secondary structure altering variants in human genetic studies.<br><br>To access the unique list of genomic coordinates and their associated scores download RNAStability_v10.5.1_hg38_distinct_SURF_SPI_Phred_GitHub_Export.tsv.gz
提供机构:
GigaScience Database
创建时间:
2021-03-09



