BAAI/OpenSeek-Synthetic-Reasoning-Data-Examples
收藏Hugging Face2025-03-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/BAAI/OpenSeek-Synthetic-Reasoning-Data-Examples
下载链接
链接失效反馈官方服务:
资源简介:
OpenSeek-Reasoning-Data数据集是一个包含数学、代码和通用知识领域推理数据的数据集。它由从大规模原始语料库中合成的推理过程构成,旨在用于激活大型语言模型(LLM)在预训练阶段的推理能力。数据集采用cc-by-sa-4.0许可证,语言为英文,大小在10K到100K之间。
The OpenSeek-Reasoning-Data dataset is a collection of reasoning data from math, code, and general knowledge domains. It consists of synthesized reasoning processes from massive raw corpora intended to activate the reasoning ability of large language models (LLMs) at the pre-training stage. The dataset is licensed under cc-by-sa-4.0, is in English, and falls within the size category of 10K<n<100K.
提供机构:
BAAI



