five

Complete bacteriophage genomes containing simulated mutations

收藏
DataCite Commons2024-06-21 更新2024-08-26 收录
下载链接:
https://figshare.com/articles/dataset/Complete_bacteriophage_genomes_containing_simulated_mutations/25907815
下载链接
链接失效反馈
官方服务:
资源简介:
These files are associated with the publication "Ultrafast and accurate sequence alignment and clustering of viral genomes".phage-genomes_simulated_mutations.csv the expected (true) total ANI (tANI) values in the 70-100% range, derived from 10,000 pairs of bacteriophage genomes subjected to simulated mutations, including different levels of substitution, insertion, deletion, duplication, inversion, and translocation events. Mutations were introduced using Mutation-Simulator v3.0.2.Column descriptions:sample: sample IDref_id: Reference genome IDalt_id: Altered reference IDtotal_ani: True total ANI [%]ref_length: Reference genome lengthalt_length: Altered reference genome lengthsn: Substitions frequencyde: Deletions frequencyins: Insertions frequencydu: Duplications frequencyinv: Inversions frequencytl: Translocations frequencyn_sn: Substituted nucleotidesn_de: Deleted nucleotidesn_ins: Inserted nucleotidesn_du: Duplicated nucleotidesn_inv: Inverted nucleotidesn_tl: Translocated nucleotides2. phage-genomes_simulated_mutations.fnanucleotide sequences of reference and altered genomes.<br>

本数据集关联于已发表论文《超快且精准的病毒基因组序列比对与聚类》(原英文标题:Ultrafast and accurate sequence alignment and clustering of viral genomes)。 phage-genomes_simulated_mutations.csv 文件存储了70%至100%区间内的预期(真实)总平均核苷酸一致性(true total ANI, tANI)数值,该数据集源自10000对经模拟突变处理的噬菌体基因组对,模拟突变涵盖不同程度的碱基替换、插入、缺失、重复、倒位及易位事件。突变的引入操作通过Mutation-Simulator v3.0.2工具完成。 该文件的列字段说明如下: - sample:样本编号(ID) - ref_id:参考基因组编号(ID) - alt_id:经改造的参考基因组编号(ID) - total_ani:真实总平均核苷酸一致性(tANI),单位为百分比(%) - ref_length:参考基因组长度 - alt_length:经改造的参考基因组长度 - sn:碱基替换频率 - de:缺失事件频率 - ins:插入事件频率 - du:重复事件频率 - inv:倒位事件频率 - tl:易位事件频率 - n_sn:被替换的核苷酸总数 - n_de:被缺失的核苷酸总数 - n_ins:被插入的核苷酸总数 - n_du:被重复的核苷酸总数 - n_inv:被倒位的核苷酸总数 - n_tl:被易位的核苷酸总数 2. phage-genomes_simulated_mutations.fna:存储参考基因组与经改造的参考基因组的核苷酸序列。
提供机构:
figshare
创建时间:
2024-06-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作