haoranli-ml/AIME2025_Qwen3-4B-Instruct_16384_128
收藏Hugging Face2025-10-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/haoranli-ml/AIME2025_Qwen3-4B-Instruct_16384_128
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个问题回答或对话数据集,包含问题、答案、分词后的提示、响应列表、奖励列表、最大奖励和平均奖励等字段。数据集被分割为AIME2025部分,共有30个示例。
This dataset is a question answering or dialogue dataset, including fields such as problem, answer, tokenized prompt, list of responses, list of rewards, maximum reward, and average reward. The dataset is split into the AIME2025 section, containing a total of 30 examples.
提供机构:
haoranli-ml



