Novel Aggregate Deletion/Substitution/Addition Learning Algorithms for Recursive Partitioning
收藏DataCite Commons2020-09-02 更新2024-07-25 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Novel_aggregate_deletion_substitution_addition_learning_algorithms_for_recursive_partitioning/4892000
下载链接
链接失效反馈官方服务:
资源简介:
Many complex diseases are caused by a variety of both genetic and environmental factors acting in conjunction. To help understand these relationships, nonparametric methods that use aggregate learning have been developed such as <i>random forests</i> and <i>conditional forests</i>. Molinaro et al. (2010) described a powerful, single model approach called <i>partDSA</i> that has the advantage of producing interpretable models. We propose two extensions to the <i>partDSA</i> algorithm called <i>bagged partDSA</i> and <i>boosted partDSA</i>. These algorithms achieve higher prediction accuracies than individual <i>partDSA</i> objects through aggregating over a set of <i>partDSA</i> objects. Further, by using <i>partDSA</i> objects in the ensemble, each base learner creates decision rules using both “and” and “or” statements, which allows for natural logical constructs. We also provide four variable ranking techniques that aid in identifying the most important individual factors in the models. In the regression context, we compared bagged partDSA and boosted partDSA to random forests and conditional forests. Using simulated and real data, we found that bagged partDSA had lower prediction error than the other methods if the data were generated by a simple logic model, and that it performed similarly for other generating mechanisms. We also found that <i>boosted partDSA</i> was effective for a particularly complex case. Taken together these results suggest that the new methods are useful additions to the ensemble learning toolbox. We implement these algorithms as part of the partDSA R package. Supplementary materials for this article are available online.
提供机构:
Taylor & Francis
创建时间:
2017-04-19



