RyanYr/reflect_qwen3Bb_postSft_Om2G8kOm2AgG8k40k_traj_it1_dpo
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/RyanYr/reflect_qwen3Bb_postSft_Om2G8kOm2AgG8k40k_traj_it1_dpo
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:提示(prompt)、选中(chosen)、拒绝(rejected)和评论(comment)。它适用于训练可能涉及选择和评论任务的模型。训练集包含27818个示例,文件大小为276,316,966字节。
The dataset includes four fields: prompt, chosen, rejected, and comment. It is suitable for training models that may involve selection and commenting tasks. The training set contains 27,818 examples with a file size of 276,316,966 bytes.
提供机构:
RyanYr



