Learning sparse log-ratios for high-throughput sequencing data

Name: Learning sparse log-ratios for high-throughput sequencing data
Creator: La Trobe University
License: 暂无描述

Research Data Australia2024-12-14 收录

下载链接：

https://researchdata.edu.au/learning-sparse-log-sequencing-data/1871211

下载链接

链接失效反馈

官方服务：

资源简介：

In the context of high-throughput sequencing (HTS) data, and compositional data (CoDa) more generally, an important class of biomarkers are the log-ratios between the input variables. However, identifying predictive log-ratio biomarkers from HTS data is a combinatorial optimization problem, which is computationally challenging. Existing methods are slow to run and scale poorly with the dimension of the input, which has limited their application to low- and moderate-dimensional metagenomic datasets. Building on recent advances from the field of deep learning, we develop CoDaCoRe, a novel learning algorithm that identifies sparse, interpretable, and predictive log-ratio biomarkers. Our algorithm exploits a continuous relaxation to approximate the underlying combinatorial optimization problem. This relaxation can then be optimized efficiently using the modern ML toolbox, in particular, gradient descent. As a result, CoDaCoRe runs several orders of magnitude faster than competing methods, all while achieving state-of-the-art performance in terms of predictive accuracy and sparsity. survey: https://www.surveymonkey.com/r/5H6CYPW

提供机构：

La Trobe University

5,000+

优质数据集

54 个

任务类型

进入经典数据集