Additional data and code for "You can move, but you can't hide: identification of mobile genetic elements with geNomad"
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7697489
下载链接
链接失效反馈官方服务:
资源简介:
benchmark_data: Data used to train and evaluate the classification models.
giant_virus_data: Sequences and metadata of giant viruses identified in public metagenomes.
neural_network_training: Code used to train geNomad's neural network-based classification model.
provirus_data: Data used to train and evaluate the conditional random field model employed by geNomad to identify provirus regions.
reference_sequences: Sequences of chromosomes, plasmids, and viruses that were used to build geNomad's marker dataset and to generate the training data for the classification models.
benchmark_data:用于训练与评估分类模型的数据集。
giant_virus_data:从公共宏基因组(metagenome)中鉴定出的巨型病毒序列及其元数据。
neural_network_training:用于训练geNomad基于神经网络的分类模型的代码。
provirus_data:用于训练与评估geNomad用于鉴定前病毒(provirus)区域的条件随机场(conditional random field)模型的数据集。
reference_sequences:用于构建geNomad标记数据集,并为分类模型生成训练数据的染色体(chromosome)、质粒(plasmid)与病毒序列。
创建时间:
2023-06-17



