Predictive Models on the 2013 NCDB Colon Cancer Data
收藏Mendeley Data2026-04-18 收录
下载链接:
https://elsevier.digitalcommonsdata.com/datasets/jg44fgspzk
下载链接
链接失效反馈官方服务:
资源简介:
The attached file contains R code which encompasses and describes the process of loading data, cleaning data, selecting variables, imputing missing values, creating training and test sets, model building and evaluation. Additionally, the code contains the process to create graphs and tables for data and model evaluation.
The goal was to build a logistic regression model to predict outcomes after surgery for colon cancer and to compare its performance with machine learning algorithms. An XGBgoost model, a Random Forest model and an XGBoost model from oversampled data using SMOTE were built and compared with logistic regression. Overall, the machine learning algorithms had improved AUC.
创建时间:
2021-05-04



