five

m5C Motif Atlas of SARS-CoV-2: CG, CCA, and ACW Sites Across 9.36 Million Genomes

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://doi.org/10.7910/DVN/4IALUH
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset provides a comprehensive annotation of RNA m5C-associated motifs such as CG, CCA, and ACW across 9,356,279 high-quality SARS-CoV-2 genomes (aligned to reference NC_045512.2). For each genome, we report motif counts and 1-based genomic positions. Data is aggregated into 118 chunks (grouped into 24 compressed bundles) with per-chunk and global summaries. Boundaries & Notes for Use: Motifs are identified in silico based on sequence context; not experimentally validated. Genomes are filtered for length ≥28,000 nt to exclude fragments. Positional data reflects alignment coordinates; not adjusted for indels or structural variants. Designed for population-level motif frequency analysis, not single-genome functional inference. Does not include patient metadata, geography, or time only sequence-derived features. Analysis done by: TahirHB@Hotmail.Com
创建时间:
2025-10-27
二维码
社区交流群
二维码
科研交流群
商业服务