RyanYr/DAPO-Math-17k_gemini3-flash_qwen3-4B-Base-n4_1e-5_sft_matheval
收藏Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/RyanYr/DAPO-Math-17k_gemini3-flash_qwen3-4B-Base-n4_1e-5_sft_matheval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个自然语言处理数据集,包含多个特征字段:data_source(数据来源)、problem(问题描述)、solution(解决方案)、answer(答案)、prompt(提示信息,由角色和内容组成)、reward_model(奖励模型信息,包括ground_truth和style)和responses(响应列表)。数据集分为mixed和hard两种类型,每种类型有不同比例的分割(如100%、95%、90%、85%),其中mixed分割包含约1447个示例,hard分割包含100个示例。总下载大小约为62.5MB,数据集大小约为66.6MB。该数据集可能用于训练或评估语言模型在问题解答和奖励模型方面的性能。
This dataset is a natural language processing dataset containing multiple feature fields: data_source (data source), problem (problem description), solution (solution), answer (answer), prompt (prompt information, consisting of role and content), reward_model (reward model information, including ground_truth and style), and responses (list of responses). The dataset is divided into mixed and hard types, each with different proportion splits (e.g., 100%, 95%, 90%, 85%), where mixed splits contain approximately 1447 examples and hard splits contain 100 examples. The total download size is approximately 62.5MB, and the dataset size is approximately 66.6MB. This dataset may be used for training or evaluating language models in problem-solving and reward model performance.
提供机构:
RyanYr



