five

库帕思高质量教育思维链(Chain-of-Thought)数据集-数学篇(上篇)

收藏
国家数据集管理服务平台2026-04-28 更新2026-04-29 收录
下载链接:
https://www.ndsms.cn/dataRetrieval/datasetDetail/?id=2299c25f2ea309cfdc6f354dacfc3bc7
下载链接
链接失效反馈
官方服务:
资源简介:
数学上篇聚焦离散数学与高等数学核心知识模块。包含选择、填空、判断、解答等题型。为AI教育工具提供逻辑推理与连续运算的思维样本,助力模型掌握多样化解题思路,提升复杂数学问题的辅导精度;同时为模型训练注入离散与连续数学的推理范式,优化跨学科问题解决能力。 在数据质量方面,所有数据均通过严格的清洗、校验与标注流程,确保数据的准确性与规范性,并统一数据格式,为模型训练与教育应用提供高可靠性支撑。 与传统数据集不同,我们不仅提供标准答案,更为每个问题配备了由先进大语言模型(LLM)多次独立生成的“采样答案”及其详尽的“思考链”(reasoning_content)。所有采样结果都经过了自动化评估流水线检验,尽量使得最终产出的数据在正确性、逻辑性和一致性上都达到高标准。

Part I of Mathematics focuses on core knowledge modules of discrete mathematics and advanced mathematics, including multiple-choice, fill-in-the-blank, true-false, and free-response question types. It provides thinking samples for AI educational tools, covering logical reasoning and continuous mathematical operations, helping models master diverse problem-solving strategies and improve the accuracy of tutoring for complex mathematical problems; additionally, it injects reasoning paradigms of discrete and continuous mathematics into model training, optimizing cross-disciplinary problem-solving capabilities. Regarding data quality, all datasets have undergone strict cleaning, verification and annotation procedures to ensure their accuracy and standardization, with a unified data format, providing highly reliable support for model training and educational applications. Unlike traditional datasets, this collection not only provides standard answer keys, but also equips each question with "sampled answers" and their detailed "thinking chains" (reasoning_content) independently generated multiple times by state-of-the-art Large Language Models (LLMs). All sampled results have been inspected via automated evaluation pipelines, striving to ensure that the final produced data meets high standards in terms of correctness, logical rigor and consistency.
提供机构:
上海库帕思科技有限公司
创建时间:
2026-04-27
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集聚焦离散数学与高等数学核心知识模块,包含多种题型,旨在为AI教育工具提供逻辑推理与连续运算的思维样本,助力模型掌握多样化解题思路。数据经过严格清洗与标注确保高质量,其特色在于不仅提供标准答案,还为每个问题配备了由大语言模型生成的采样答案及详细思考链,并经过自动化评估。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务