feedbackagent/train_reflection_eval1_with_rewards
收藏Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/feedbackagent/train_reflection_eval1_with_rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,包括gt(目标)、idx(索引)、prompt(提示)、completions(完成项)、problem(问题)、response(响应)、reflection(反思)、rewards(奖励)和preds(预测)。这些特征的数据类型包括字符串、整数和布尔值序列。数据集包含一个训练集,大小为1069622536字节,包含49998个示例。下载大小为344998480字节,数据集总大小为1069622536字节。
The dataset contains multiple features, including gt (ground truth), idx (index), prompt, completions, problem, response, reflection, rewards, and preds (predictions). The data types of these features include strings, integers, and sequences of boolean values. The dataset includes a training set with a size of 1069622536 bytes, containing 49998 examples. The download size is 344998480 bytes, and the total dataset size is 1069622536 bytes.
提供机构:
feedbackagent



