ttc-research/DeepSeek-R1-Distill-Qwen-1.5B-PRM-prm800k-Llama-3.2-3B-Instruct-best_of_n-completions
收藏Hugging Face2025-03-13 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ttc-research/DeepSeek-R1-Distill-Qwen-1.5B-PRM-prm800k-Llama-3.2-3B-Instruct-best_of_n-completions
下载链接
链接失效反馈官方服务:
资源简介:
MATH数据集,包含不同种子值的多个配置版本,每个版本包含四个特征:问题数量(n),朴素准确率(acc_naive),加权准确率(acc_weighted)和多数投票准确率(acc_maj)。每个配置的训练集包含9个示例。
The MATH dataset consists of multiple configuration versions with different seed values. Each version includes four features: the number of problems (n), naive accuracy (acc_naive), weighted accuracy (acc_weighted), and majority vote accuracy (acc_maj). Each configurations training set contains 9 examples.
提供机构:
ttc-research



