five

Benchmarking for alignment and variant calling

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://www.omicsdi.org/dataset/ega/EGAS00001007819
下载链接
链接失效反馈
官方服务:
资源简介:
Background: Lung cancer is a heterogeneous disease and the primary cause of cancer-related mortality worldwide. Somatic mutations, including large structural variants, are important biomarkers in lung cancer for selecting targeted therapy. Genomic studies in lung cancer have been conducted using short-read sequencing. Emerging long-read sequencing technologies are a promising alternative to study somatic structural variants, however there is no current consensus on how to process data and call somatic events. In this study, we preformed whole genome sequencing of lung cancer and matched non-tumour samples using long and short read sequencing to comprehensively benchmark three sequence aligners and six structural variant callers comprised of four generic callers (Sniffles2, cuteSV, SVIM and DELLY in generic mode) and two somatic callers (nanomonsv and DELLY in somatic modes). Results: Different combinations of aligners and variant callers influenced somatic structural variant detection. The choice of caller had a significant influence on somatic structural variant detection in terms of variant type, size, sensitivity, and accuracy. The performance of each variant caller was assessed by comparing to somatic structural variants identified by short read sequencing. When compared to somatic structural variants detected with short read sequencing, more events were detected with long read sequencing. The mean recall of somatic variant events identified by short read sequencing was higher for the somatic callers (64%) than generic callers (52%), with nanomonsv (69%) showing the best performance. Conclusion: Long read sequencing can identify somatic structural variants. The longer reads have the potential to improve our understanding of cancer development and inform personalized cancer treatment.EGA study EGAS00001007819
创建时间:
2024-09-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作