H5N1 Influenza A Virus Genomes: Codon Usage, Host Adaptation, and Temporal-Geographic Metadata n=139,000
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/H5N1_Influenza_A_Virus_Genomes_Codon_Usage_Host_Adaptation_and_Temporal-Geographic_Metadata_n_139_000/30340129
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 139,000 complete H5N1 influenza A virus sequences sourced from NCBI, processed using local Python and Bioinformatics scripts. Sequences were filtered for completeness, aligned with MAFFT (using --parttree --retree 1 --maxiterate 0 for scalability), and trimmed using custom gap-based methods.
Codon usage metrics including ENc, CAI (relative to human and avian reference sets), GC3, RSCU, CPB, PR2, and mutation burden were computed per sequence.
Temporal (year) and geographic (country/city) metadata were extracted from FASTA headers. Includes aligned FASTA, codon metrics CSV, QC reports, and plots for our upcoming journal article.
Now being shared via: https://figshare.com/s/036ddf020b4b479db080
创建时间:
2025-10-12



