isoMiGA: Isoform and gene-level counts and TPM in short-read human microglia
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/8291210
下载链接
链接失效反馈官方服务:
资源简介:
https://github.com/RajLabMSSM/isoMiGA
Isoform quantification performed with Salmon on 4 datasets from human microglia.
Files are in the naming format {cohort}_{quantification}_{reference}.tsv.gz - 4 cohort, 4, quantifications, 2 references = 32 files.
Cohorts:
1. Raj
350 samples from multiple brain regions. Publication PMID: 34992268
Raw data: https://dss.niagads.org/datasets/ng00105/
2. Roussos
192 samples.
Publication PMID: 35931864.
Raw data: https://doi.org/10.7303/syn26207321 https://www.synapse.org/#!Synapse:syn52052829
3. Gaffney
118 samples.
Raw data: https://ega-archive.org/datasets/EGAD00001005736
4. ipsc
IPS-derived human microglia-like cells (iMGLs)
27 samples from 3 stimulation conditions.
Citation: Navarro et al, LRRK2 G2019S variant triggers transcriptional changes in Parkinson’s disease human myeloid cells under pro-inflammatory environment, in preparation
Raw data: Gene Expression Omnibus GSE240907
Quantification types:
transcript_tpm: transcript-level quantification, TPM normalization
transcript_counts: transcript-level quantification, estimated counts
gene_tpm: gene-level quantification, TPM normalization
gene_counts: gene_level quantification, estimated counts
References:
See https://zenodo.org/record/8290956 for details
gencode: all transcripts in GENCODE v38: https://www.gencodegenes.org/human/release_38.html
union: combination of all GENCODE v38 transcripts with 35,879 novel transcripts found in PacBio long-read RNA-seq
数据集仓库:https://github.com/RajLabMSSM/isoMiGA
本数据集通过Salmon软件对4组人类小胶质细胞数据集开展转录本定量分析。
文件命名格式为`{队列}_{定量类型}_{参考集}.tsv.gz`,共包含4个队列、4类定量方式、2种参考转录本集,总计32个文件。
## 队列详情
1. Raj队列
包含来自多个脑区的350份样本,相关研究论文PMID:34992268
原始数据:https://dss.niagads.org/datasets/ng00105/
2. Roussos队列
包含192份样本,相关研究论文PMID:35931864
原始数据:https://doi.org/10.7303/syn26207321、https://www.synapse.org/#!Synapse:syn52052829
3. Gaffney队列
包含118份样本
原始数据:https://ega-archive.org/datasets/EGAD00001005736
4. ipsc队列
样本为诱导多能干细胞(IPS)衍生的人类小胶质样细胞(iMGLs),包含3种刺激条件下的27份样本
引用文献:Navarro等人《LRRK2 G2019S变异在促炎环境下引发帕金森病人类髓系细胞的转录组改变》,待发表
原始数据:基因表达综合数据库(Gene Expression Omnibus)GSE240907
## 定量类型
1. transcript_tpm:转录本水平定量,采用TPM(转录本每百万片段数)归一化
2. transcript_counts:转录本水平定量,输出估计读段计数
3. gene_tpm:基因水平定量,采用TPM归一化
4. gene_counts:基因水平定量,输出估计读段计数
## 参考转录本集
详细信息见:https://zenodo.org/record/8290956
1. gencode:包含GENCODE v38的全部转录本,详情见https://www.gencodegenes.org/human/release_38.html
2. union:整合GENCODE v38全部转录本与PacBio长读长RNA测序鉴定的35879条新型转录本
创建时间:
2025-04-12



