violetxi/PRM-ak-prm-full-sft-MATH-500_L3_beam_N128_B5_D40_T0.0001_0-105
收藏Hugging Face2024-12-20 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/violetxi/PRM-ak-prm-full-sft-MATH-500_L3_beam_N128_B5_D40_T0.0001_0-105
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,主要涉及问题(problem)、解决方案(solution)、搜索轨迹(search_trace_with_values)、搜索方法(search_method)、真实答案(ground_truth)以及相关的输入输出令牌数量(search_input_tokens, search_output_tokens, solution_input_tokens, solution_output_tokens)。这些字段表明数据集可能用于训练或评估与问题解决和搜索相关的模型,特别是在需要追踪搜索过程和评估解决方案有效性的场景中。
This dataset includes multiple fields related to problems, solutions, search traces, search methods, ground truths, and associated input and output token counts. These fields suggest that the dataset is likely used for training or evaluating models related to problem-solving and searching, particularly in scenarios where tracking the search process and evaluating the effectiveness of solutions are necessary.
提供机构:
violetxi



