库帕思高质量教育思维链(Chain-of-Thought)数据集-物理(上篇)
收藏国家数据集管理服务平台2026-04-28 更新2026-04-29 收录
下载链接:
https://www.ndsms.cn/dataRetrieval/datasetDetail/?id=1c127aab55d417c4f43801924a769226
下载链接
链接失效反馈官方服务:
资源简介:
物理上篇聚焦物理学科的客观题,如选择、填空、判断等。为教育智能化提供物理客观题的标准化数据支撑,可助力智能题库精准出题,帮助模型强化对物理基础知识点的辨析与判断能力,提升其在客观题场景下的答题准确性。
在数据质量方面,所有数据均通过严格的清洗、校验与标注流程,确保数据的准确性与规范性,并统一数据格式,为模型训练与教育应用提供高可靠性支撑。
与传统数据集不同,我们不仅提供标准答案,更为每个问题配备了由先进大语言模型(LLM)多次独立生成的“采样答案”及其详尽的“思考链”(reasoning_content)。所有采样结果都经过了自动化评估流水线检验,尽量使得最终产出的数据在正确性、逻辑性和一致性上都达到高标准。
Part I of this dataset focuses on objective physics questions, including multiple-choice, fill-in-the-blank, and true-false questions. It provides standardized data support for physics objective questions to underpin educational intelligence applications, enabling intelligent question banks to generate questions precisely, helping models enhance their ability to distinguish and judge basic physics knowledge points, and improving their answer accuracy in objective question scenarios. In terms of data quality, all data has undergone strict cleaning, verification, and annotation processes to ensure its accuracy and standardization, with a unified data format, providing highly reliable support for model training and educational applications. Unlike traditional datasets, this resource not only provides standard answers but also equips each question with "sampled answers" and their detailed reasoning chains (reasoning_content) independently generated multiple times by advanced large language models (LLMs). All sampled results have been verified through an automated evaluation pipeline, striving to ensure that the final generated data meets high standards in terms of correctness, logic, and consistency.
提供机构:
上海库帕思科技有限公司
创建时间:
2026-04-27
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集专注于物理学科的客观题型,旨在为教育智能化提供标准化数据支持,以提升AI模型在物理基础知识点辨析和客观题答题方面的准确性。数据经过严格的质量控制流程,确保规范可靠。其特色在于除标准答案外,还提供了由大语言模型生成的多组采样答案及详细的思考链内容,这些内容均通过了自动化评估,以增强数据的逻辑性和一致性。
以上内容由遇见数据集搜集并总结生成



