TAUR-dev/add_until_it_works
收藏Hugging Face2025-03-27 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/add_until_it_works
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了解题过程中的问题、解决方案、思考轨迹、尝试次数以及评分等信息。具体包括问题(question)、解决方案(solution)、cot类型(cot_type)、数据来源类型(source_type)、元数据(metadata)、两种方法的思考轨迹(gemini_thinking_trajectory和deepseek_thinking_trajectory)、两种方法的尝试次数(gemini_attempt和deepseek_attempt)、两种方法的评分(gemini_grade和deepseek_grade)以及评分理由(gemini_grade_reason和deepseek_grade_reason)。此外,还包含了不同比例的cot的准确度(acc_by_fraction_of_cot)。
The dataset contains information about the problem-solving process, including questions, solutions, thinking trajectories, number of attempts, and ratings. Specifically, it includes fields for question (question), solution (solution), cot type (cot_type), source type (source_type), metadata (metadata), thinking trajectories for two methods (gemini_thinking_trajectory and deepseek_thinking_trajectory), number of attempts for two methods (gemini_attempt and deepseek_attempt), ratings for two methods (gemini_grade and deepseek_grade), and reasons for the ratings (gemini_grade_reason and deepseek_grade_reason). Additionally, it contains the accuracy of cot at different ratios (acc_by_fraction_of_cot).
提供机构:
TAUR-dev



