five

Supporting data for "Evaluation of computational genotyping of Structural Variations for clinical diagnoses"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100641
下载链接
链接失效反馈
官方服务:
资源简介:
In recent years, Structural Variation (SV) has been identified as having a pivotal role in causing genetic disease. The discovery of SVs based on short DNA sequence reads from next-generation DNA sequence methods is error-prone, suffering from low sensitivity and high false discovery. These shortcomings can be partially overcome with extensive orthogonal validation methods, or use of long reads, but currently the cost of either precludes their application for routine clinical diagnostics. In contrast, SV genotyping of known sites of SV occurrence is relatively robust. Structural Variant genotyping therefore offers a cost-effective clinical diagnostic tool, with potentially few false positives and low occurrence of false negatives, even when applied to short-read DNA sequence data. We assess five state- of-the- art SV genotyping software methods, applied to short read sequence data. The methods are characterized based on their ability to genotype different SV types, spanning different size ranges. Furthermore, we analyze their ability to parse different VCF file sub-formats and assess their reliance on specific metadata. We compare the SV genotyping methods across a range of simulated and real data including SVs that were not found with Illumina data alone. We assess sensitivity and the ability to filter initial false discovery calls. Our results indicate that, although SV genotyping software methods have superior performance to SV callers, there are limitations that suggest the need for further innovation.
提供机构:
GigaScience Database
创建时间:
2019-08-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作