five

Taxonomic classification based on k-mers

收藏
Zenodo2022-09-01 更新2026-06-04 收录
下载链接:
https://zenodo.org/record/7040145
下载链接
链接失效反馈
官方服务:
资源简介:
DNA sequencing provides the possibility to obtain complete genomic DNA from environmental samples without the need for laboratory microbiological cultures. To this end, metagenomics, the direct DNA sequencing from microbial communities, has changed radically the field of microbiology, by unearthing a broad space of the planet’s microbial diversity, much of which remains unknown. Metagenomic approaches have become standard methods for identifying the biodiversity and the gene or metabolomic functionalities of bacterial and archaeal communities, with many applications not only in microbial ecology but also in public health as in clinical diagnostics and detection of pathogens. Yet, the decrease in the cost of high throughput sequencing and the great amount of microbial data produced every day, highlight one of the main biological questions: the taxonomic classification of metagenomic short reads. Which organisms are contained in a sample? Are there any features that can be used to identify them? To this end, many algorithms have been developed that achieve high speed, by counting k-mers, short sequence substrings of fixed-length k. In this way for the provided input sequences, a list of features can be computed that describes each one of them. Subsequently, the question now reforms to how can the produced k-mers be used for the taxonomic classification of the input sequences. Sample processing, sequencing, and core amplicon data analysis were performed by the Earth Microbiome Project (www.earthmicrobiome.org), and all amplicon sequence data and metadata have been made public through the EMP data portal (qiita.microbio.me/emp): Thompson, L. R., Sanders, J. G., McDonald, D., Amir, A., …, Jansson, J. K., Gilbert, J. A., Knight, R., & The Earth Microbiome Project Consortium. (2017). A communal catalogue reveals Earth’s multiscale microbial diversity. Nature, 551:457-463. doi:10.1038/nature24621.
提供机构:
Zenodo
创建时间:
2022-09-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作