five

BAGS.v1.1: BAltic Gene Set gene catalogue

收藏
DataCite Commons2025-05-12 更新2024-07-13 收录
下载链接:
https://figshare.scilifelab.se/articles/dataset/BAGS_v1_BAltic_Gene_Set_gene_catalogue/16677252/2
下载链接
链接失效反馈
官方服务:
资源简介:
The BAltic Gene Set gene catalogue v1.1 encompasses 66,530,673 genes.The 66 million genes are based on metagenomic data from Alneberg at al. (2020) from 124 seawater samples, that span the salinity and oxygen gradients of the Baltic Sea and capture seasonal dynamics at two locations. To obtain the gene catalogue, we used a mix-assembly approach described in Delgado et al. (2022).The gene catalogue has been functionally and taxonomically annotated, using the Mix-assembly Gene Catalog pipeline (https://github.com/EnvGen/mix_assembly_pipeline). The taxonomy annotation was performed using Mmseqs2[1] (uniref90[2]) and CAT[3] (GTDB[4]).Here you find representative mix-assembly gene and protein sequences, and different types of annotations for the proteins. Also, contigs for the co-assembly are included (see Delgado et al. 2022), gene and protein sequences from each individual assembly and the co-assembly, and a table containing the genes in each of the clusters. See README for details.When using the BAGSv1.1 gene catalogue, please cite:1. Delgado LF, Andersson AF. Evaluating metagenomic assembly approaches for biome-specific gene catalogues. Microbiome 10, 72 (2022)2. Alneberg J, Bennke C, Beier S, Bunse C, Quince C, Ininbergs K, Riemann L, Ekman M, Jürgens K, Labrenz M, Pinhassi J, Andersson AF (2020) Ecosystem-wide metagenomic binning enables prediction of ecological niches from genomes. Commun Biol 3, 119 (2020)<br>References:M Mirdita, M Steinegger, F Breitwieser, J Söding, E Levy Karin, Fast and sensitive taxonomic assignment to metagenomic contigs, <i>Bioinformatics</i>, Volume 37, Issue 18, September 2021, Pages 3029–3031, https://doi.org/10.1093/bioinformatics/btab184Baris E. Suzek, Yuqi Wang, Hongzhan Huang, Peter B. McGarvey, Cathy H. Wu, the UniProt Consortium, UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches, <i>Bioinformatics</i>, Volume 31, Issue 6, March 2015, Pages 926–932, https://doi.org/10.1093/bioinformatics/btu739von Meijenfeldt, F.A.B., Arkhipova, K., Cambuy, D.D. <i>et al.</i> Robust taxonomic classification of uncharted microbial sequences and bins with CAT and BAT. <i>Genome Biol</i> <b>20</b>, 217 (2019). https://doi.org/10.1186/s13059-019-1817-xDonovan H Parks, Maria Chuvochina, Christian Rinke, Aaron J Mussig, Pierre-Alain Chaumeil, Philip Hugenholtz, GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy, <i>Nucleic Acids Research</i>, Volume 50, Issue D1, 7 January 2022, Pages D785–D794, https://doi.org/10.1093/nar/gkab776<br>
提供机构:
Swedish Biodiversity Data Infrastructure (SBDI)
创建时间:
2024-01-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作