five

A semiparametric kernel independence test with application to mutational signatures

收藏
DataCite Commons2024-02-23 更新2024-07-28 收录
下载链接:
https://tandf.figshare.com/articles/dataset/A_semiparametric_kernel_independence_test_with_application_to_mutational_signatures/13527394/1
下载链接
链接失效反馈
官方服务:
资源简介:
Cancers arise owing to somatic mutations, and the characteristic combinations of somatic mutations form mutational signatures. Despite many mutational signatures being identified, mutational processes underlying a number of mutational signatures remain unknown, which hinders the identification of interventions that may reduce somatic mutation burdens and prevent the development of cancer. We demonstrate that the unknown cause of a mutational signature can be inferred by the associated signatures with known etiology. However, existing association tests are not statistically powerful due to excess zeros in mutational signatures data. To address this limitation, we propose a semiparametric kernel independence test (SKIT). The SKIT statistic is defined as the integrated squared distance between mixed probability distributions and is decomposed into four disjoint components to pinpoint the source of dependency. We derive the asymptotic null distribution and prove the asymptotic convergence of power. Due to slow convergence to the asymptotic null distribution, a bootstrap method is employed to compute <i>p</i>-values. Simulation studies demonstrate that when zeros are prevalent, SKIT is more resilient to power loss than existing tests and robust to random errors. We applied SKIT to The Cancer Genome Atlas (TCGA) mutational signatures data for over 9,000 tumors across 32 cancer types, and identified a novel association between signature 17 curated in the Catalogue Of Somatic Mutations In Cancer (COSMIC) and apolipoprotein B mRNA editing enzyme (APOBEC) signatures in gastrointestinal cancers. It indicates that APOBEC activity is likely associated with the unknown cause of signature 17.
提供机构:
Taylor & Francis
创建时间:
2021-01-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作