Open-Web-Math-Pro
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/gair-prox/open-web-math-pro
下载链接
链接失效反馈官方服务:
资源简介:
该数据集主要用于在持续预训练过程中进行重放,以防止模型退化。它专注于数学相关内容,旨在提升模型在数学领域的处理能力。具体任务是为持续预训练提供重放数据。
This dataset is primarily used for replay during continual pre-training to prevent model degradation. It focuses on mathematical content, aiming to enhance the model's mathematical processing capabilities. Its specific task is to provide replay data for continual pre-training.
提供机构:
Hugging Face



