five

mssfj/openmathinstruct-2_formatted

收藏
Hugging Face2025-12-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/mssfj/openmathinstruct-2_formatted
下载链接
链接失效反馈
官方服务:
资源简介:
OpenMathInstruct-2(格式化版)是一个指令调优数学语料库,源自OpenMathInstruct-2,并转换为简单的问答格式,包含明确的思维链痕迹。数据集包含13,972,791个示例(约23 GB磁盘空间),涵盖从小学到高级数学推理、代数、几何和文字问题。每个条目包含一个自然语言“问题”,一个以“<think>...</think>”推理部分开头并后跟“Final Answer:<value>”的“答案”,以及一个分类“category”标签。数据集以Parquet分片和JSONL转储形式提供,支持高效流式传输。

OpenMathInstruct-2 (formatted) is an instruction-tuning math corpus derived from OpenMathInstruct-2 and converted into a simple question–answer format with explicit chain-of-thought traces. The dataset contains 13,972,791 examples (about 23 GB on disk) covering grade-school to advanced math reasoning, algebra, geometry, and word problems. Each entry has a natural-language `question`, an `answer` that starts with a `<think>...</think>` reasoning section followed by `Final Answer:<value>`, and a categorical `category` tag. The dataset is packaged as a Parquet shard and a JSONL dump for efficient streaming.
提供机构:
mssfj
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作