five

Host whole genome sequence data represent an untapped resource for characterising affiliated parasite diversity

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP668088
下载链接
链接失效反馈
官方服务:
资源简介:
Parasites are ubiquitous and exert varied ecological and evolutionary pressures on their hosts. Yet, characterising parasite diversity and distributions can be challenging and costly. Leveraging existing data to identify parasites is thus an attractive alternative. High-throughput sequencing (HTS) can generate whole genome sequence (WGS) data which are increasingly freely available in public repositories and represent an untapped resource for characterising parasites affiliated with hosts. In this study, we examine WGS data generated for the silvereye (Zosterops lateralis), to identify endogenous eukaryotic parasites that were inadvertently captured during host sequencing. We compared detection of parasite genera by this approach with detection via 18S metabarcoding. Mining WGS data for parasite DNA revealed the broadest range of genera. Results were verified by traditional microscopy of blood slides and conducting a targeted multiplex Polymerase Chain Reaction (PCR) for haemosporidian parasites. Detection of haemosporidians was largely consistent across microscopy, multiplex PCR and WGS data while 18S metabarcoding entirely failed to detect this group of parasites. Our results demonstrate that existing WGS datasets can be used to estimate endoparasite diversity and provide greater insights on diversity than metabarcoding whilst also avoiding the costs and challenges of direct sampling. We provide a framework outlining opportunities and constraints to consider when mining WGS data to identify parasite sequences. The framework particularly stresses the influences of sequencing depth, database completeness, and methodological biases. Our findings demonstrate how repurposing existing WGS data can provide a cost-effective and informative means of unravelling complex host-parasite interactions in future disease ecology studies.
创建时间:
2026-01-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作