five

The effect of background noise and its removal on the analysis of single-cell expression data

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP410182
下载链接
链接失效反馈
官方服务:
资源简介:
BACKGROUND: In droplet-based single-cell and single-nucleus RNA-seq experiments, not all reads associated with one cell barcode originate from the encapsulated cell. Such background noise is attributed to spillage from cell-free ambient RNA or barcode swappingevents. Here, we characterize this background noise exemplified by three single-cell RNA-seq(scRNA-seq) and two single-nucleus RNA-seq (snRNA-seq) replicates of mouse kidney cells. For each experiment, kidney cells from two mouse subspecies were pooled, allowing to identify cross-genotype contaminating molecules and estimate the levels of background noise. RESULTS: We find that background noise is highly variable across replicates and individual cells, making up on average 3-35% of the total counts (UMIs) per cell and show that this has a considerable impact on the specificity and detectability of marker genes. In search of the source of the background noise, we find that expression profiles of cell-free droplets are very similar to expression profiles of cross-genotype contamination profiles and hence that the majority of background molecules originates from ambient RNA. Finally, we use our genotype-based estimates to evaluate the performance of three methods (CellBender, DecontX, SoupX) that are designed to quantify and remove background noise. We find that CellBender provides the most precise estimates of background noise levels and also yields the highest improvement for marker gene detection. By contrast, clustering and classification of cells are fairly robust towards background noise and only small improvements can be achieved by background removal that may come at the cost of distortions in fine structure. CONCLUSION: Our findings help to better understand the extent, sources and impact of background noise in single-cell experiments and provide guidance on how to deal with it. Overall design: Three scRNA-seq (rep1-3) and two snRNA-seq (nuc2, nuc3) replicates were generated from mouse kidney cells. In each experiment, cells from dissociated kidneys of three mice from different strains (C57BL/6, CAST/EiJ and 129S1/SvImJ) were pooled. The replicates rep2 & nuc2 and rep3 & nuc3 were generated from the same set of mice each.
创建时间:
2023-09-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作