feedbackagent/train_reflection_eval4_with_rewards
收藏Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/feedbackagent/train_reflection_eval4_with_rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如gt(字符串类型)、idx(int64类型)、prompt(字符串类型)、completions(字符串序列)、problem(字符串类型)、response(字符串类型)、reflection(字符串类型)、rewards(布尔序列)和preds(字符串序列)。数据集分为一个训练集,包含124304个样本,总大小为3281174028字节,下载大小为1065772357字节。数据集的配置文件名为default,数据文件路径为data/train-*。
The dataset contains multiple features, including gt (string type), idx (int64 type), prompt (string type), completions (sequence of strings), problem (string type), response (string type), reflection (string type), rewards (sequence of booleans), and preds (sequence of strings). The dataset is divided into a training set containing 124304 samples, with a total size of 3281174028 bytes and a download size of 1065772357 bytes. The configuration file for the dataset is named default, and the data file path is data/train-*.
提供机构:
feedbackagent



