five

Lemonade Creek, Yellowstone National Park, USA - Microbial Community Analysis - Cyanidiophyceae genome data for HGT analysis

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13651145
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of 12 metagenome samples that were collected from one of three environments in Yellowstone National Park: 4 samples (numbered 1, 2, 3, 4) are from the "CreekBiofilm" environment. 4 samples (1, 2, 3, 4) are from the "Endolithic" environment. 4 samples (1, 2, 3, 4) are from the "Soil" environment. We have found that there are two species of cyanidiophyceae present in these samples: one *Galdieria sulphuraria* (the `*Gsulp*` files) and one *Cyanidioschyzon merolae* (the `*Cmer*` files). For each of these species I extracted their contigs from the metagenome assembly if they had >=10% of their lengths covered by hits with >90% ID to the respective reference genome (i.e., contigs with >10% coverage of hits with >90% ID to a given reference genome). The majority of contigs have >90% hit coverage however, to prevent removal of contigs with novel sequences (arising via HGT or other processes), I used a lenient threshold of 10%. The naming of the files indicate which sample the contigs are from and which of the two cyanidiophyceae species they are putatively from. NOTE: that there are very few predicted proteins in the `YNP_CreekBiofilm_*_Gsulp*` files. This is because this environment is completely dominated by the other algal species and so we recovered very few contigs from this species from these environments.
创建时间:
2024-09-03
二维码
社区交流群
二维码
科研交流群
商业服务