five

Dataset of the paper “Variable selection for linear regression in large databases: exact methods” Applied Intelligence, 51(6), 3736-3756

收藏
DataCite Commons2025-05-09 更新2025-05-18 收录
下载链接:
http://hdl.handle.net/10259/9825
下载链接
链接失效反馈
官方服务:
资源简介:
The variable selection problem in the context of Linear Regression for large databases is analysed. The problem consists in selecting a small subset of independent variables that can perform the prediction task optimally. This problem has a wide range of applications. One important type of application is the design of composite indicators in various areas (sociology and economics, for example). Other important applications of variable selection in linear regression can be found in fields such as chemometrics, genetics, and climate prediction, among many others. For this problem, we propose a Branch & Bound method. This is an exact method and therefore guarantees optimal solutions. We also provide strategies that enable this method to be applied in very large databases (with hundreds of thousands of cases) in a moderate computation time. A series of computational experiments shows that our method performs well compared with well-known methods in the literature and with commercial software.
提供机构:
Universidad de Burgos
创建时间:
2025-04-04
二维码
社区交流群
二维码
科研交流群
商业服务