five

Discriminating somatic and germline mutations in tumour DNA samples without matching normals.. Virtual Normal Analysis

收藏
NIAID Data Ecosystem2026-03-08 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJEB9673
下载链接
链接失效反馈
官方服务:
资源简介:
Tumour analyses commonly employ a correction with a matched normal (MN), a sample from healthy tissue of the same individual, in order to distinguish germline mutations from somatic mutations. Since the majority of variants found in an individual are thought to be common within the population, we constructed a set of 931 samples from healthy, unrelated individuals, originating from two different sequencing platforms, to serve as a virtual normal (VN) in the absence of such an associated normal sample. Our approach removed over 96% of the germline variants also removed by the matched normal sample, and a large number (2-8%) of additional variants not corrected for by the associated normal. The combination of the VN with the matched normal improved the correction for polymorphisms significantly with up to ~30% as compared to matched normal and ~15% as compared to VN only. We determined the number unrelated genomes needed in order to correct at least as efficiently as the matched normal is ~200 for SVs and ~400 for small variants. In addition, we propose that the removal of common variants with purely position-based methods is inaccurate and incurs additional false positive variants. Our VN correction can be used on any list of variants, regardless of sequencing platform of origin, in which a confidence score is included with the list of somatic variants by considering the ratio of reference calls vs no-calls at the locus across the CG-sequenced VN samples. This VN methodology is available for use on our public Galaxy server (http://galaxy-demo.ctmm-trait.nl).
创建时间:
2015-07-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作