five

Statistics of the GEO Datasets.

收藏
Figshare2026-03-09 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/_p_Statistics_of_the_GEO_Datasets_p_/31589796
下载链接
链接失效反馈
官方服务:
资源简介:
Osteoarthritis (OA) is a chronic joint disorder characterized by pain, reduced mobility, and structural degeneration. Despite its complex etiology and multi-tissue involvement, the molecular mechanisms underlying OA remain poorly understood. This study aimed to identify tissue-specific diagnostic biomarkers using an integrative framework combining multiple machine learning (ML) algorithms and SHapley Additive exPlanations (SHAP). Gene expression profiles from cartilage, synovium, and peripheral blood were retrieved from the GEO database. DEGs were identified across tissues, followed by feature selection using Least Absolute Shrinkage and Selection Operator(LASSO), Support Vector Machine Recursive Feature Elimination (SVM-RFE), and Random Forest(RF). Functional enrichment, gene set variation analysis (GSVA), and immune infiltration analyses were conducted. 10 ML models were constructed to evaluate diagnostic performance. A total of 8, 28, and 61 DEGs were identified in cartilage, synovium, and blood, respectively. Enrichment analysis revealed the key roles in inflammatory signaling, metabolism, and immune pathways. Biomarkers identified included CSN1S1, ABCA6, RARRES1, NPTX2 (cartilage); SCRG1, CXCL2, PTGDS, CCL19, BGN, KLF9 (synovium); and GNL3L, C6orf111, NT5C3, ZNF148 (blood). Immune analysis indicated shifts in mast cells and CD8 + T cells in cartilage and dendritic cells in synovium, while no significant immune alterations were found in blood. Diagnostic models demonstrated strong performance, with AUCs of 0.839 (cartilage), 0.934 (synovium), and 0.892 (blood). SHAP analysis was employed to interpret each model by quantifying the contribution of individual genes to predict outcomes. In the optimal cartilage model, CSN1S1 and ABCA6 were the most influential features, with mean absolute SHAP values of 0.146 and 0.122, respectively. For synovium, SCRG1 (0.111) and CXCL2 (0.097) were top contributors, while in blood, GNL3L (0.148) and C6orf111 (0.143) showed the highest predictive importance. These results underscore the interpretability of the models and validate the functional relevance of selected biomarkers. Collectively, this study provides a robust ML-based framework for identifying and interpreting reliable OA biomarkers across multiple tissues, offering valuable insights into disease mechanisms and supporting the development of diagnostic tools.
创建时间:
2026-03-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作