five

Reuters & Bloomberg (R&B) dataset

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/WenchenLi/news-title-stock-prediction-pytorch
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了552,909篇新闻文章,总计覆盖了2,605天,其中有1,794个交易日,用于根据新闻文章预测股票价格变动。在创建样本的交易日集合时,该数据集剔除了模糊不清的样本,最终包含了32,204个清晰样本。数据集被随机划分为60%的训练集、20%的验证集以及20%的测试集。其规模涉及552,909篇文章和2,605天,任务是对股票价格变动进行分类。

This dataset includes 552,909 news articles spanning 2,605 days in total, among which 1,794 are trading days, and is utilized for stock price movement prediction based on news articles. When building the trading day sample set, ambiguous samples were removed, leaving 32,204 unambiguous valid samples in the final dataset. The dataset is randomly divided into three subsets: 60% for training, 20% for validation, and 20% for testing. With a scale involving 552,909 articles across 2,605 days, the core task of this dataset is to classify stock price movements.
提供机构:
Wenchen Li
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作