Ayush-Singh/reward-bench-Qwen2.5-Math-7B-yes-no
收藏Hugging Face2025-02-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Ayush-Singh/reward-bench-Qwen2.5-Math-7B-yes-no
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含数学问题的数据集,其中包括问题(prompt)、选择的答案(chosen)、选择答案的模型(chosen_model)、被拒绝的答案(rejected)、拒绝答案的模型(rejected_model)、数据子集(subset)、唯一标识符(id)以及各种概率信息(chosen_yes_prob、chosen_no_prob、rejected_yes_prob、rejected_no_prob)。数据集分为math_prm部分,共有447个示例。
This is a dataset containing math questions, which includes the question (prompt), the chosen answer (chosen), the model for the chosen answer (chosen_model), the rejected answer (rejected), the model for the rejected answer (rejected_model), the subset of data (subset), a unique identifier (id), and various probability information (chosen_yes_prob, chosen_no_prob, rejected_yes_prob, rejected_no_prob). The dataset is split into a math_prm section with a total of 447 examples.
提供机构:
Ayush-Singh



