five

utility: Collection of Tumor-Infiltrating Lymphocyte Single-Cell Experiments with TCR

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/4995298
下载链接
链接失效反馈
官方服务:
资源简介:
Introduction The original intent of assembling a data set of publicly-available tumor-infiltrating T cells (TILs) with paired TCR sequencing was to expand and improve the scRepertoire R package. However, after some discussion, we decided to release the data set for everyone, a complete summary of the sequencing runs and the sample information can be found in the meta data of the Seurat object. This repository is the 4th version of the data, with addition of cells and changes to the workflow.  Methods Single-Cell Data Processing The filtered gene matrices output from Cell Ranger align function from individual sequencing runs (10x Genomics, Pleasanton, CA) loaded into the R global environment. For each sequencing run cell barcodes were appended to contain a unique prefix to prevent issues with duplicate barcodes. The results were then ported into individual Seurat objects (citation), where the cells with > 10% mitochondrial genes and/or 2.5x natural log distribution of counts were excluded for quality control purposes. At the individual sequencing run level, doublets were estimated using the scDblFinder (v1.4.0) R package. Annotation of Cells Automatic annotation was performed using the singler (v1.4.1) R package (citation) with the HPCA (citation) and Monaco (citation) data sets as references and the fine label discriminators. Individual sequencing runs were subsetted to run through the singleR algorithm in order to reduce memory demands. The output of all the singleR analyses were collated and appended to the meta data of the seurat object. Likewise, the ProjecTILs (v0.4.1) R Package (citation) was used for automatic annotation as a partially orthogonal approach.  Addition of TCR data The filtered contig annotation T cell receptor (TCR) data for available sequencing runs were loaded into the R global environment. Individual contigs were combined using the combineTCR() function of scRepertoire (v1.3.5) R Package (citation). Clonotypes were assigned to barcodes and were multiple duplicate chains for individual cells were filtered to select for the top expressing contig by read count. The clonotype data was then added to the Seurat Object with proportion across individual patients being used to calculate frequency. Citations As of right now, there is no citation associated with the assembled data set. However if using the data, please find the corresponding manuscript for each data set in the meta.data of the single-cell object. In addition, if using the processed data, feel free to modify the language in the methods section (above) and please cite the appropriate manuscripts of the software or references that were used. Itemized List of the Software Used Seurat v4.0.3 - citation harmony v1.0 - citation singler v1.4.1 - citation ProjecTILs v2.0.3 - citation UCell v1.0.0 - citation scRepertoire v1.3.5 - citation Itemized List of Reference Data Used Human Primary Cell Atlas (HPCA) - citation Monaco Data Set - citation Future Directions Data Hosting for Interactive Analysis Easy Submission Portal for Researchers to Add Data Using the Data to Build a Reference Atlas There are areas in which we are actively hoping to develop to further facilitate the usage of the data set - if you have other suggestions, please reach out using the contact information below. Contact Questions, comments, and suggestions, please feel free to contact Nick Borcherding via this repository, email, or using twitter.
创建时间:
2022-04-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作