feedbackagent/train3ep_test_with_reflection_and_completion_and_rewards
收藏Hugging Face2024-12-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/feedbackagent/train3ep_test_with_reflection_and_completion_and_rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征字段,包括gt(字符串类型)、idx(整型)、prompt(字符串类型)、completions(字符串序列)、problem(字符串类型)、response(字符串类型)、reflection(字符串类型)、rewards(布尔序列)和preds(字符串序列)。数据集包含一个训练集分割,共有7936个样本,总大小为211512550字节。
The dataset includes multiple feature fields such as gt (string type), idx (integer type), prompt (string type), completions (sequence of strings), problem (string type), response (string type), reflection (string type), rewards (sequence of booleans), and preds (sequence of strings). The dataset contains a training split with 7936 examples, totaling 211512550 bytes in size.
提供机构:
feedbackagent



