SILVA database - all extracted bacterial and archaeal taxa (unfiltered)
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12653481
下载链接
链接失效反馈官方服务:
资源简介:
As a first attempt to target microbially related data in the ODIS graph, we extracted all the child clauses of bacterial and archaeal taxa from the SILVA database , that is all headers from the sequences of both domains of life. We saved this dataset in form of a textfile and curated the headers as follows:
The species name's epithet was removed, and any duplicate generic names were eliminated.
We removed non-alphabetic characters from the list, such as /*-_ and numbers
Additionally, we excluded names shorter than four characters and those containing more than two consecutive identical letters.
Even after organizing the dataset, there were still some taxonomic names that were not specific to microbes. These names might have been used as acronyms for specific categories (e.g. "unio" for "uniola" or "unionicola"), as species identifiers like "urhd," or as part of the species name, such as "blood," "texas," or even created purely for creative purposes by the namegiver, such as "unicorn."
创建时间:
2024-07-04



