five

STRING PPI network edges dataset.

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/STRING_PPI_network_edges_dataset_/30045193
下载链接
链接失效反馈
官方服务:
资源简介:
This study was designed to identify immune-related biomarkers associated with allergic rhinitis (AR) and construct a robust a diagnostic model. Two datasets (GSE5010 and GSE50223) were downloaded from the NCBI GEO database, containing 38 and 84 blood CD4 + T cell samples, respectively. To eliminate batch effects, the surrogate variable analysis (sva) R package (version 3.38.0) was employed, enabling the integration of data for subsequent analysis. Immune cell infiltration profiles were assessed using the Gene Set Variation Analysis (GSVA) R package (version 1.36.3). A gene co-expression network was constructed via the Weighted Gene Co-Expression Network Analysis (WGCNA) algorithm to identify disease-related modules. Differentially expressed genes (DEGs) were identified using the linear models for microarray data (limma) R package (version 3.34.7), followed by functional enrichment analysis using DAVID. Protein-protein interaction (PPI) networks were constructed based on the STRING database to highlight key genes. A diagnostic model was subsequently developed utilizing the Least Absolute Shrinkage and Selection Operator (LASSO) regression algorithm and Support Vector Machine (SVM) method, with its discriminative capacity assessed via Receiver Operating Characteristic (ROC) curves. A total of twenty-eight immune cell types were analyzed, revealing significant differences in eight types between the AR and control groups. Through WGCNA, three disease-related modules comprising 4278 candidate genes were identified. Differential expression analysis identified 326 significant DEGs, of which 257 overlapped with WGCNA-selected genes. These genes exhibited significant enrichment in immune-related pathways, including “cytokine-cytokine receptor interaction” and “chemokine signaling pathway.” Gene Set Enrichment Analysis (GSEA) further uncovered 12 KEGG pathways significantly associated with disease risk scores. Drug screening identified 24 small molecule drugs related to key genes. A diagnostic model incorporating five genes (RFC4, LYN, IL3, TNFRSF1B, and RBBP7) was constructed, demonstrating diagnostic efficiencies of 0.843 and 0.739 in the training and validation sets, respectively. An AR mouse model was successfully established, and the expression levels of relevant genes were validated through RT-qPCR experiments. The five-gene diagnostic model established in this study exhibits strong predictive ability in distinguishing AR patients from healthy controls, with potential clinical applications in diagnosing AR and advancing novel diagnostic and therapeutic strategies.
创建时间:
2025-09-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作