five

Bonn Dataset 1 of meta-analysis on AML classification, Affymetrix HG-U133 A

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE122505
下载链接
链接失效反馈
官方服务:
资源简介:
The present dataset ("dataset 1") is a subset of a large metastudy on AML classfication. In total, three datasets were generated, each containing data of a different platforms: dataset 1 (Affymetrix HG-U133 A microarrays), dataset 2 (Affymetrix HG-U133 2.0 microarrays) and dataset 3 (RNA-seq). Dataset 1 was generated using the following strategy: All data sets published in the National Center for Biotechnology Information Gene Expression Omnibus (GEO) on 20 September 2017 were reviewed for inclusion in the present study. Basic criteria for inclusion were the cell type under study (human peripheral blood mononuclear cells (PMBCs) and/or bone marrow samples) as well as the species (Homo sapiens). Furthermore, GEO SuperSeries were excluded to avoid duplicated samples. We filtered the datasets for data generated with the Affymetrix Human Genome U133 A Array (GLP96) and excluded studies with small sample sizes (< 50 samples). We then applied a disease-specific search, in which we filtered for acute myeloid leukemia, other leukemia and healthy or non-leukemia-related samples. The results of this search strategy were then internally reviewed and data were excluded based on the following criteria: (i) exclusion of duplicated samples, (ii) exclusion of studies that sorted single cell types (e.g. T cells or B cells) prior to gene expression profiling, (iii) exclusion of studies with inaccessible data. Other than that, no studies were excluded from our analysis. In total, the datasets contained samples from the following GSE Series: GSE10255, GSE1159, GSE12417, GSE12995, GSE13425, GSE14471, GSE14895, GSE16129, GSE25571, GSE26281, GSE33315, GSE34860, GSE37642, GSE43176, GSE4698, GSE51082, GSE6269, GSE67684, GSE83449, GSE8879, GSE9006, GSE9476. All CEL-files were downloaded from GEO and imported into R. Robist Multichip Average (RMA) expression measures were calculated using the R package affy. [Dataset_1_matrix.txt] RMA-normalized expression table of 22283 probes and 2379 samples
创建时间:
2020-01-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作