Detecting Fusion Genes in Long-Read Transcriptome Sequencing Data with FUGAREC
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1030953
下载链接
链接失效反馈官方服务:
资源简介:
Fusion genes are important targets and biomarkers for cancer therapy. Methods of accurately detectingfusion genes are needed in clinical practice. RNA-seq is widely used to detect active fusion genes. Long-read RNA-seq can sequence the full length of mRNA, and long-read RNA-seq is expected to detect fusion genes that cannot be detected by short-read RNA-seq. However, long-read RNA-seq has high base calling error rates, and gap sequences may occur near the breakpoints of long reads that are not aligned to the genome. When gap sequences occur, it is impossible to identify the correct fusion gene or breakpoint using existing methods. To address these challenges in fusion gene detection, we introduce a novel algorithm, FUGAREC (fusion detection with gap re-alignment and breakpoint clustering). FUGAREC uniquely combines gap sequence re-alignment with breakpoint clustering. This approach not only enhances the detection of previously undetectable fusion genes but also significantly reduces false positives. We demonstrate that FUGAREC has high fusion gene detection performance on both simulated data and sequenced dataof a breast cancer cell line.
创建时间:
2023-10-22



