five

Supporting data for "SimFFPE and FilterFFPE: improving structural variant calling in FFPE samples"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100924
下载链接
链接失效反馈
官方服务:
资源简介:
Artifact chimeric reads are enriched in next-generation sequencing data generated from formalin-fixed paraffin-embedded (FFPE) samples. Previous work indicated that these reads are characterized by erroneous split-read support that is interpreted as evidence of structural variants. Thus, a large number of false positive structural variants is detected. To our knowledge, no tool is currently available to specifically call or filter structural variants in FFPE samples. To overcome this current gap, we developed two R packages: SimFFPE and FilterFFPE. <br>SimFFPE is a read simulator, specifically designed for next-generation sequencing data from FFPE samples. A mixture of characteristic artifact chimeric reads as well as normal reads is generated. FilterFFPE is a filtration algorithm, removing artifact chimeric reads from sequencing data, while keeping real chimeric reads. To evaluate the performance of FilterFFPE, we performed structural variant calling with three common tools (Delly, Lumpy and Manta) with and without prior filtration with FilterFFPE. After applying FilterFFPE, the average positive predictive value improved from 0.27 to 0.48 in simulated samples, and from 0.11 to 0.27 in real samples, while sensitivity remained basically unchanged or even slightly increased. <br>FilterFFPE improves the performance of SV calling in FFPE samples. It was validated by analysis of simulated and real data.
提供机构:
GigaScience Database
创建时间:
2021-08-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作