haoranli-ml/AIME2025_Qwen3-4B-Instruct-raw-sft_rl_4096budget_16rollouts_0.8temp
收藏Hugging Face2025-11-08 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/haoranli-ml/AIME2025_Qwen3-4B-Instruct-raw-sft_rl_4096budget_16rollouts_0.8temp
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了问题、答案、分词后的提示、响应列表、奖励列表、最大奖励和平均奖励等字段。数据集分为AIME2025一个分片,包含30个示例。数据集总大小为6010169字节,下载大小为2391114字节。
The dataset includes fields such as problem, answer, tokenized prompt, list of responses, list of rewards, maximum reward, and average reward. It is split into one segment called AIME2025, containing 30 examples. The total size of the dataset is 6010169 bytes, with a download size of 2391114 bytes.
提供机构:
haoranli-ml



