xqx-chloe1/rollouts_po2_train_2_cleaned
收藏Hugging Face2025-10-11 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/xqx-chloe1/rollouts_po2_train_2_cleaned
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如当前选择(curr_choice),距离目标的距离(distance_to_goal),被拒绝的内容(rejected)及其角色,选择的内容(chosen)及其角色,目标(target),上一次的选择(last_choice),epsilon值(eps),步骤(step),以及提示(prompt)的内容和类型。数据集分为训练集(train),包含38504个示例,总大小为81124625字节。
The dataset includes multiple fields such as current choice (curr_choice), distance to goal (distance_to_goal), rejected content and its role, chosen content and its role, target (target), last choice (last_choice), epsilon value (eps), step (step), and prompt content and type. The dataset is split into a training set (train) containing 38504 examples with a total size of 81124625 bytes.
提供机构:
xqx-chloe1



