five

VerySNP: VCF feature based SVM to reduce false positive rate in SNP predictor output. verysnp

收藏
NIAID Data Ecosystem2026-03-08 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJEB6378
下载链接
链接失效反馈
官方服务:
资源简介:
Several open-source tools have been recently developed to identify Single Nucleotide Polymorphisms (SNPs) in whole-genome data, the most popular being SAMtools and GATK. Commonly, SNP predictors provide a VCF file as output, which contains a list of candidate SNPs and additional information such as SNP call quality and depth of coverage. Still, the SNP list presents an unsatisfactory accuracy due to high false positive polymorphism prediction. VCF parameters have been used to train a Support Vector Machine (SVM) that classifies the VCF SNP list in true and false positive SNPs, cleaning the SNP predictor output from the most likely false positive results. We implemented the SVM approach in a new software, called VerySNP, and applied it to model and non-model organisms proving, in both cases, this machine learning method efficiently recognizes true positive from false positive SNPs.
创建时间:
2014-07-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作