Data set on Employing Machine Learning Approaches to Detect Lung Cancer via Analysis of Clinical and Genetic Data
收藏DataCite Commons2026-02-24 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=721cd32b3eb84cb8929976999b98bd03
下载链接
链接失效反馈官方服务:
资源简介:
Biopsy is still the standard method for genetically characterising a lung cancer tumour, it is offensive & aching for the patient. The growth of personalised medicine has led to a shift in therapeutic strategies from traditional chemotherapy and radiation to targeted genetic modification therapies. Nodule image characteristics collected from CT scans consume been utilised by ML algorithms to forecast the absence or presence of gene alterations in a straightforward, quick, and non-invasive way. Newer research suggests, however, that radiometric features from a wider region beyond the tumour may be much more useful in determining if lung cancer patients have a mutant. Consequently, these traits may greatly improve survival rates for people with lung cancer. To determine if there is a connection between picture phenotypes and the EGF receptor mutation status, this study used radiomic features acquired after the lung region around the node. Lung cancer is most often caused by mutations in this gene, and several therapies are focused on targeting it specifically. Several feature selection methods, a change of linear and nonlinear, & ensemble projecting models for classification were utilised to categorize the binary result of wild-type and distorted EGFR alteration position. As shown by the findings, a comprehensive strategy that encompassed the lung with the nodule inside an ROI may better gather pertinent data and accurately forecast the status of the EGFR mutation than local nodule analysis. The best-performing classifiers were determined using the Principal Component Evaluation feature selection method. These classifiers ranged from Linear Support Vector Machines, flexible the Internet, and Logistic Regression, and their Area Under the Curve (AUC) values ranged from 0.726 to 0.738. Applying a holistic approach, this technique demonstrates that further radio genomic studies should include data from broader lung regions, including the nodule, to provide a more complete characterisation of lung cancer.
提供机构:
Science Data Bank
创建时间:
2026-02-24



