Discriminating somatic and germline mutations in tumour DNA samples without matching normals.. Virtual Normal Analysis
收藏NIAID Data Ecosystem2026-03-08 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJEB9673
下载链接
链接失效反馈官方服务:
资源简介:
Tumour analyses commonly employ a correction with a matched normal (MN), a sample from healthy tissue of the same individual, in order to distinguish germline mutations from somatic mutations. Since the majority of variants found in an individual are thought to be common within the population, we constructed a set of 931 samples from healthy, unrelated individuals, originating from two different sequencing platforms, to serve as a virtual normal (VN) in the absence of such an associated normal sample. Our approach removed over 96% of the germline variants also removed by the matched normal sample, and a large number (2-8%) of additional variants not corrected for by the associated normal. The combination of the VN with the matched normal improved the correction for polymorphisms significantly with up to ~30% as compared to matched normal and ~15% as compared to VN only. We determined the number unrelated genomes needed in order to correct at least as efficiently as the matched normal is ~200 for SVs and ~400 for small variants. In addition, we propose that the removal of common variants with purely position-based methods is inaccurate and incurs additional false positive variants. Our VN correction can be used on any list of variants, regardless of sequencing platform of origin, in which a confidence score is included with the list of somatic variants by considering the ratio of reference calls vs no-calls at the locus across the CG-sequenced VN samples. This VN methodology is available for use on our public Galaxy server (http://galaxy-demo.ctmm-trait.nl).
创建时间:
2015-07-11



