five

16SGOSeq: A curated bacterial and archaeal 16S rRNA Gene Oral Sequences database

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/15074672
下载链接
链接失效反馈
官方服务:
资源简介:
In a given species, genomes and 16S rRNA gene sequences, along with their intragenomic copy numbers, can vary greatly across environments. The gene copy numbers are crucial for technologies which estimate microbial abundances based on gene counts, such as polymerase chain reaction and high-throughput sequencing. In these, taxa with fewer genes may be underestimated, while those with more genes might be overestimated. Therefore, it is essential to have accurate gene copy number databases specific to the niche under study. The 16S rRNA Gene Oral Sequences database (16SGOSeq) contains the number of 16S rRNA genes and their variants in the complete genomes of the bacterial and archaeal species present in the human oral cavity. It includes 3,192 complete genomes of oral bacteria and 191 complete genomes of oral archaea, from which the 16S rRNA gene sequences were extracted, and the sequence variants were identified. For ease of use, a provided Python script allows for filtering sequences by taxonomy and calculating data averages, such as the mean number of genes per taxonomy group. The oral-specific database of prokaryotic organisms presented here and the pipeline followed for its construction can be applied by clinical microbiologists, bioinformaticians, or microbial ecologists in future microbiome research.
创建时间:
2025-03-24
二维码
社区交流群
二维码
科研交流群
商业服务