five

pmoA gene reference database (fasta-formatted sequences and taxonomy)

收藏
DataCite Commons2025-12-13 更新2025-04-15 收录
下载链接:
http://dataservices.gfz.de/panmetaworks/showshort.php?id=escidoc%3A1423157
下载链接
链接失效反馈
官方服务:
资源简介:
This data set is a part of result affiliated to our manuscript about pmoA gene (encoding the alpha subunit of the enzyme of particular methane monooxygenase). The taxonomy database consists of 7809 unaligned pmoA nucleotide sequences in fasta format and a corresponding taxonomy file, according the format specified by the software platforms of Mothur and QIIME. The taxonomy file is a two column text file where the first column is the accession number of the sequence and the second column is a string of taxonomic information separated by semicolons. We created a comprehensive taxonomy database for the pmoA nucleotide sequences which could be probed by the primer set combination of A189f and A682r. Sequences in this database were firstly retrieved from the NCBI database and progressively screened by Biopython or R scripts. The corresponding taxonomy was generally referred to the NCBI taxonomy if the explicit taxonomic ranks from phylum to species are available. For those with ambiguous taxonomies given by the NCBI database, taxonomic classification was improved as possible by referring to the Dumont’s database (Frontier in Microbiology, 2014, 5: 34. doi: 10.3389/fmicb.2014.00034).
提供机构:
GFZ Data Services
创建时间:
2016-04-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作