sibasmarakp/Qwen2.5-14B-Instruct-best_of_n-completions
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/sibasmarakp/Qwen2.5-14B-Instruct-best_of_n-completions
下载链接
链接失效反馈官方服务:
资源简介:
提供的README内容描述了一个名为HuggingFaceH4_MATH-500的数据集,该数据集似乎与数学问题及其解决方案相关。数据集包含多个配置,每个配置具有不同的参数,如温度(T)、top_p、完成次数(n)、种子值和聚合策略(agg_strategy)。每个配置包含诸如problem(问题)、solution(解决方案)、answer(答案)、subject(主题)、level(级别)、unique_id(唯一标识符)、completions(完成情况)、scores(分数)、pred(预测)以及各种与预测相关的字段。数据集被划分为训练集,并指定了字节数和示例数。配置还包括评估指标(evals),具有n、acc_naive、acc_weighted和acc_maj等特征。该数据集似乎旨在评估模型在不同采样和聚合策略下解决数学问题的性能。
The provided README content describes a dataset named HuggingFaceH4_MATH-500 which appears to be related to mathematical problems and their solutions. The dataset includes multiple configurations with different parameters such as temperature (T), top_p, number of completions (n), seed values, and aggregation strategies (agg_strategy). Each configuration contains features like problem, solution, answer, subject, level, unique_id, completions, scores, pred, and various prediction-related fields. The dataset is split into training sets with specified numbers of bytes and examples. The configurations also include evaluation metrics (evals) with features like n, acc_naive, acc_weighted, and acc_maj. The dataset seems to be designed for evaluating model performance on mathematical problem-solving tasks with different sampling and aggregation strategies.
提供机构:
sibasmarakp



