sailor2/sea-ultrafeedback-onpolicy
收藏Hugging Face2025-02-16 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/sailor2/sea-ultrafeedback-onpolicy
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了语言、提示、选择的内容和角色以及拒绝的内容和角色等信息。它被设计用于训练模型,能够处理和识别文本中的提示以及相关联的内容和角色。数据集分为训练集,共有38327个示例。
The dataset includes language, prompt, chosen content and role, as well as rejected content and role. It is designed for training models to process and recognize prompts and associated content and roles in text. The dataset is split into a training set with a total of 38,327 examples.
提供机构:
sailor2



