flammenai/Date-DPO-v3
收藏Hugging Face2024-08-01 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/flammenai/Date-DPO-v3
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license: apache-2.0
---
# Date-DPO-v3
DPO dataset aiming to reduce output verbosity and "GPT-speak."
This volume adds roleplay elements and encourages chosen answers to respond as if the assistant were a human with personal preferences.
## Method
100 questions were generated by flammen24-mistral-7B.
ChatGPT 3.5's one-shot answers were selected as the `rejected` responses.
flammen22X-mistral-7B AND flammen24-mistral-7B were used to generate `chosen` prompts. Many of these responses were manually edited.
提供机构:
flammenai
原始信息汇总
数据集概述
数据集名称
Date-DPO-v3
数据集目的
旨在减少输出冗长和“GPT-speak”,并增加角色扮演元素,鼓励选定答案以人类助手的个人偏好进行响应。
数据集方法
- 问题生成:100个问题由flammen24-mistral-7B生成。
- 拒绝响应:ChatGPT 3.5的一次性答案被选为
拒绝响应。 - 选定提示:flammen22X-mistral-7B和flammen24-mistral-7B用于生成
选定提示,其中许多响应经过手动编辑。



