Data after collection, but prior to pre-processing in R code

Name: Data after collection, but prior to pre-processing in R code
Creator: Rojen Erik Sürek
License: 暂无描述

IEEE2026-04-17 收录

下载链接：

https://ieee-dataport.org/documents/data-after-collection-prior-pre-processing-r-code

下载链接

链接失效反馈

官方服务：

资源简介：

This data was used for examining various machine learning methods for predicting S&P 500 returns using monthly data from November 1987 to February 2022, utilizing 11 predictors after principal component analysis. The research addresses the literature gap of applying state-of-the-art machine learning algorithms, specifically Diverse Representation Canonical Interval Forest (DrCIF), ARSENAL, and HIVE-COTE 2.0 (HC2), which have demonstrated competitive predictability in various domains but have not been explored for financial forecasting. Additionally, this study includes long-term geometric cumulative total returns in the evaluation of machine learning strategies, filling another gap in the literature. The strategies were evaluated based on out-of-sample test simulation in terms of predictability, risk-adjusted excess returns, and terminal portfolio value. The results indicated that DrCIF and Long Short-Term Memory (LSTM) provide greatest added economic value compared to passive index-following strategies, traditional logistic regression, and other common machine learning algorithms. The practical implication is that industry practitioners can increase their cumulative and risk-adjusted excess returns relative to the S&P 500 by implementing defensive market timing adjustments of their equity weighting, guided by these predictive machine learning models.

提供机构：

Rojen Erik Sürek

5,000+

优质数据集

54 个

任务类型

进入经典数据集