five

Analysis of nitrous oxide reductase diversity from wastewater: a SINTAX database

收藏
DataONE2024-04-23 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:5bad177d8b93ec331aea25518ca10387a10253d5545008a08fceeb26d9e2b318
下载链接
链接失效反馈
官方服务:
资源简介:
This study explores the genetic landscape of nitrous oxide (N2O) reduction in wastewater treatment plants (WWTPs) by profiling 1083 high-quality metagenome-assembled genomes (HQ MAGs) derived from 23 Danish full-scale WWTPs. The analysis focuses on the distribution and diversity of nitrous oxide reductase (nosZ) genes, key players in N2O reduction, and their connection to other nitrogen metabolism pathways. A custom pipeline for clade-specific nosZ gene identification outperformed existing methods, revealing the presence of 503 nosZ sequences in 489 MAGs. Notably, 48.7% of the MAGs harboured nosZ genes, with clade II dominating (92.3%). Taxonomic profiling reveals the distribution of nosZ clade I and clade II-containing MAGs, emphasizing the dominance of Bacteroidota and Pseudomonadota. Notably, Chloroflexota exhibits unexpected affiliations with nosZ clade I. The taxonomic diversity of non-denitrifying N2O-reducers is also explored, highlighting the presence of these organisms in Bacte..., High-quality metagenome-assembled genomes (HQ MAGs; 1083 in total) were obtained from 23 Danish full-scale WWTPs (Singleton et al., 2023; https://doi.org/10.1038/s41467-021-22203-2). Initially, these HQ MAGs underwent processing with Prodigal v2.6.2 to predict protein-coding genes, which were then isolated and translated into proteins. The identified nucleotide genes were compared to the NCBI GenBank v234 using the BLASTn algorithm and annotated using KEGG elements through EnrichM v0.5.0 to identify nosZ genes. The translated proteins were aligned to high-quality full-length clade I (n=20) and II (n=46) NosZ protein sequences from the Functional Gene Pipeline and Repository (FUNGENE) database version v9.9.11 using the BLASTp algorithm. Subsequently, the translated proteins were aligned to 3 full-length NosZ HMM files (1 clade I (638aa), 2 clades II (765, 656aa)) obtained from FUNGENE using the hmmsearch algorithm. Identified nosZ genes were manually filtered based on length criteria (10..., , # Analysis of nitrous oxide reductase diversity from wastewater: a SINTAX database [https://doi.org/10.5061/dryad.p5hqbzkwq](https://doi.org/10.5061/dryad.p5hqbzkwq) The data is a SINTAX formatted database containing 443 full-length clade-specific nosZ sequences. The data was sourced from Singleton et al., 2021. ## Description of the data and file structure The data is a SINTAX formatted database (fasta sequences), which contains fasta files and joined taxonomic information. To use this format of database, software such as ONT-AmpSeq could be used to map nosZ amplicon sequencing data from Oxford Nanopore to the database. The processing of identifying and filtering the sequences is described in: Unraveling the genetic potential of nitrous oxide reduction in wastewater treatment: Insights from metagenome-assembled genomes - publication in process. ## Sharing/Access information Software available on GitHub (see link in Related Works).
创建时间:
2024-04-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作