five

ProjecTILs murine reference atlas of virus-specific CD8 T cells, version 2

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/ProjecTILs_murine_reference_atlas_of_virus-specific_CD8_T_cells_version_2/23764572
下载链接
链接失效反馈
官方服务:
资源简介:
We have developed ProjecTILs, a computational approach to project  new data sets into a reference map of T cells, enabling their direct  comparison in a stable, annotated system of coordinates. Because new  cells are embedded in the same space of the reference, ProjecTILs  enables the classification of query cells into annotated, discrete  states, but also over a continuous space of intermediate states. By  comparing multiple samples over the same map, and across alternative  embeddings, the method allows exploring the effect of cellular  perturbations (e.g. as the result of therapy or genetic engineering) and  identifying genetic programs significantly altered in the query  compared to a control set or to the reference map. We  illustrate the projection of several data sets from recent publications  over two cross-study murine T cell reference atlases: the first  describing tumor-infiltrating T lymphocytes (TILs), the second  characterizing acute and chronic viral infection. Single-cell  data to build the virus-specific CD8 T cell reference map were  downloaded from GEO under the following entries: GSE131535, GSE134139  and GSE119943, selecting only samples in wild type conditions. Data for  the Ptpn2-KO, Tox-KO and CD4-depletion projections were obtained from  entries GSE134139, GSE119943, and GSE137007 and were not included in the  construction of the reference map. To  construct the LCMV reference map, we split the dataset into five batches  that displayed strong batch effects, and applied STACAS  (https://github.com/carmonalab/STACAS) to mitigate its confounding  effects. We computed 800 variable genes per batch, excluding cell  cycling genes, ribosomal and mitochondrial genes, and computed pairwise  anchors using 200 integration genes, and otherwise default STACAS  parameters. Anchors were filtered at the default threshold 0.8  percentile, and integration was performed with the IntegrateData Seurat3  function with the guide tree suggested by STACAS. Next,  we performed unsupervised clustering of the integrated cell embeddings  using the Shared Nearest Neighbor (SNN) clustering method implemented in  Seurat 3 with parameters {resolution=0.4, reduction=”pca”, k.param=20}.  We then manually annotated individual clusters (merging clusters when  necessary) based on several criteria: i) average expression of key  marker genes in individual clusters; ii) gradients of gene expression  over the UMAP representation of the reference map; iii) gene-set  enrichment analysis to determine over- and under- expressed genes per  cluster using MAST. In order to have access to predictive methods for  UMAP, we recomputed PCA and UMAP embeddings independently of Seurat3  using respectively the prcomp function from basic R package “stats”, and  the “umap” R package (https://github.com/tkonopka/umap).
创建时间:
2023-07-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作