five

VerySNP: VCF feature based SVM to reduce false positive rate in SNP predictor output

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/ERP005900
下载链接
链接失效反馈
官方服务:
资源简介:
Several open-source tools have been recently developed to identify Single Nucleotide Polymorphisms (SNPs) in whole-genome data, the most popular being SAMtools and GATK. Commonly, SNP predictors provide a VCF file as output, which contains a list of candidate SNPs and additional information such as SNP call quality and depth of coverage. Still, the SNP list presents an unsatisfactory accuracy due to high false positive polymorphism prediction. VCF parameters have been used to train a Support Vector Machine (SVM) that classifies the VCF SNP list in true and false positive SNPs, cleaning the SNP predictor output from the most likely false positive results. We implemented the SVM approach in a new software, called VerySNP, and applied it to model and non-model organisms proving, in both cases, this machine learning method efficiently recognizes true positive from false positive SNPs.
创建时间:
2021-02-04
二维码
社区交流群
二维码
科研交流群
商业服务