five

Rickettsia PhyloFlash and Kraken analysis of arthropod whole genome projects from Sequence Read Archive

收藏
DataCite Commons2025-06-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Rickettsia_PhyloFlash_and_Kraken_analysis_of_arthropod_whole_genome_projects_from_Sequence_Read_Archive/12801140/2
下载链接
链接失效反馈
官方服务:
资源简介:
Insights into the balance of <i>Rickettsia</i> groups within arthropod symbioses were obtained through searching for <i>Rickettsia</i> presence in Illumina datasets associated with arthropod whole genome sequence (WGS) projects in the SRA (60,409 records as of the 20th May 2019). To reduce the bias from over-represented laboratory model species (e.g. <i>Drosophila</i> spp., <i>Anopheles</i> spp.) a single dataset per species was examined, and where multiple data sets existed for a species, that with the largest read count was retained.<br>This data set was screened with phyloFlash which finds, extracts and identifies <i>16S rRNA</i> sequences. Reconstructed full <i>16S rRNA</i> sequences affiliated to <i>Rickettsia</i> were extracted and compared to sequences derived from the targeted screen phylogenetically (see sections above) to assess group representation within the genus. The microbial composition of all SRA datasets that did not result in a reconstructed <i>Rickettsia 16S rRNA</i> with phyloFlash were re-evaluated using Kraken2, a k-mer based taxonomic classifier for short DNA sequences. A cut-off of at least 40k reads assigned to <i>Rickettsia</i> taxa was applied for reporting potential infections (theoretical genome coverage of ~ 1 – 4X assuming an average genome size of ~1.5Mb). As Rickettsia-infected protists have previously been reported, phyloFlash was also used to identify reads aligned to protists to account for potential positives attributed to protists as opposed to insects.<br>
提供机构:
figshare
创建时间:
2020-10-14
二维码
社区交流群
二维码
科研交流群
商业服务