Ultrafast and accurate sequence alignment and clustering of viral genomes
收藏DataCite Commons2025-06-01 更新2025-05-07 收录
下载链接:
https://figshare.com/articles/dataset/Ultrafast_and_accurate_sequence_alignment_and_clustering_of_viral_genomes/28294805/1
下载链接
链接失效反馈官方服务:
资源简介:
These files are associated with the publication "Ultrafast and accurate sequence alignment and clustering of viral genomes".phage-genomes.fnaComplete genome sequences of 4,244 bacteriophages in FASTA formatphage-ICTV_taxonomy.csvTaxonomic affiliations of 4,244 bacteriophages according to the ICTV taxonomyphage-genomes_simulated_mutations.csvThe expected (true) total ANI (tANI) values in the 70-100% range, derived from 10,000 pairs of bacteriophage genomes subjected to simulated mutations, including different levels of substitution, insertion, deletion, duplication, inversion, and translocation events. Mutations were introduced using Mutation-Simulator v3.0.2. Column descriptions:sample: sample IDref_id: Reference genome IDalt_id: Altered reference IDtotal_ani: True total ANI [%]ref_length: Reference genome lengthalt_length: Altered reference genome lengthsn: Substitions frequencyde: Deletions frequencyins: Insertions frequencydu: Duplications frequencyinv: Inversions frequencytl: Translocations frequencyn_sn: Substituted nucleotidesn_de: Deleted nucleotidesn_ins: Inserted nucleotidesn_du: Duplicated nucleotidesn_inv: Inverted nucleotidesn_tl: Translocated nucleotidesphage-genomes_simulated_mutations.fnaNucleotide sequences of reference and altered genomes.viruses-sample_contigs.fna94,225 viral metagenomic contigs subsampled from IMG/VR v4.1.viruses-blastn_ani.csv4,361,743 contig pairs obtained in BLASTn satisfying the threshold of ANI ≥ 95% and alignment fraction (query coverage) ≥ 85%
提供机构:
figshare
创建时间:
2025-01-28



