Signal recovery in single cell batch integration
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE280883
下载链接
链接失效反馈官方服务:
资源简介:
Data integration to align cells across batches has become a cornerstone of single cell data analysis, critically affecting downstream results. Yet, how much biological signal is erased during integration? Currently, there are no guidelines for when the biological differences between samples are separable from batch effects, and thus, data integration involves a lot of guesswork: Cells across batches should be aligned to be “appropriately” mixed, while preserving “main cell type clusters”. We show evidence that current paradigms for single cell data integration are unnecessarily aggressive, removing biologically meaningful variation and introducing distortion. To remedy this, we present a novel statistical model and computationally scalable algorithm, CellANOVA, that harnesses experimental design to explicitly recover biological signals that are erased during single cell data integration. CellANOVA utilizes a “pool-of-controls” design concept, applicable across diverse settings, to separate unwanted variation from biological variation of interest. When applied with existing integration methods, CellANOVA allows the recovery of subtle biological signals and corrects, to a large extent, the data distortion introduced by integration. Further, CellANOVA explicitly estimates cell- and gene-specific batch effect terms which can be used to identify the cell types and pathways exhibiting the largest batch variations, providing clarity as to which biological signals can be recovered. These concepts are illustrated on studies of diverse designs, where the biological signals that are recovered by CellANOVA are validated by orthogonal assays. In particular, we show that CellANOVA is effective in the challenging case of single-cell and single-nuclei data integration, where it recovered subtle biological signals are can be validated and replicated by external data. To determine the effects of standard radiation on small intestine injury and regeneration, we performed time-course scRNA-sequencing. Wildtype C57BL/6 mice were treated with sham or 14Gy single-fraction whole-abdomen standard irradiation. Small intestines were harvested at various timepoints post-treatment, separated into the epithelial and lamina propria fractions, and subsequently sequenced.
创建时间:
2024-11-08



