COins database
收藏NIAID Data Ecosystem2026-04-30 收录
下载链接:
https://figshare.com/articles/dataset/COins_database/19130465
下载链接
链接失效反馈官方服务:
资源简介:
COins is a database of COI-5P sequences of insects that includes over 532,000 representative sequences of more than 106,000 species specifically formatted for the QIIME2 software platform. It was developed through a combination of automated and manually curated steps, starting from insects COI sequences available in the Barcode of Life Data System selecting sequences that comply to several standards, including a species-level identification.
seq-degapped.qza --> reference sequences
taxonomy.qza --> sequences taxonomy
SklearnClassifier_COins_QIIME2_v2024.5.qza (NEW!) --> naïve Bayes taxonomic classifier trained on COins (QIIME2 version 2024.5)
SklearnClassifier_COins_QIIME2_v2023.5.qza --> naïve Bayes taxonomic classifier trained on COins (QIIME2 version 2023.5)
SklearnClassifier_COins_QIIME2_v2022.2.qza --> naïve Bayes taxonomic classifier trained on COins (QIIME2 version 2022.2)
Sequences_metadata1.tsv --> Identification procedure of voucher specimens from which reference sequences were developed.
Identification procedure is reported for each sequence included in COins (BOLD id reported in BOLDid reference column) and for all identical sequences within haplotypes that were removed at Step 5 of COins curation (those for which BOLD id is not available in BOLDid reference column). The haplotype to which each sequence belongs is reported in Haplotype column (haplotypes of each species are labeled with increasing numbers). Identification procedure information derived from sequences associated metadata provided by BOLD system.
Sequences_metadata2.tsv -->
Identical sequences belonging to different species present within COins.
Each row represents a cluster of identical sequences associated to different species, sequences included in the cluster are labeled with species name and BOLD id.
创建时间:
2022-05-16



