lemon07r/VellumK2-Unfettered-DPO-01

Name: lemon07r/VellumK2-Unfettered-DPO-01
Creator: lemon07r
Published: 2025-11-07 07:55:15
License: 暂无描述

Hugging Face2025-11-07 更新2025-11-15 收录

下载链接：

https://hf-mirror.com/datasets/lemon07r/VellumK2-Unfettered-DPO-01

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个高质量的DPO数据集，通过智能拒绝过滤确保所有“选择”的响应符合请求且无审查。该数据集结合了VellumForge2的合成生成和先进的拒绝检测，为处理成熟内容的模型创建训练数据。每个条目包含三个字段：提示、选择的响应和拒绝的响应。该数据集适用于训练语言模型以直接优化偏好，处理成熟内容而不必进行不必要的拒绝，为创意写作请求提供合规、有帮助的响应，减少过度审查，并理解未经审查内容在适当情况下的合适性。

A high-quality DPO (Direct Preference Optimization) dataset with intelligent refusal filtering to ensure all chosen responses comply with requests without censorship. The dataset is generated using VellumForge2 and filtered using a three-stage hybrid refusal detection system. It is designed for training models to handle mature content appropriately without unnecessary refusal behaviors. Each entry includes three fields: prompt, chosen response, and rejected response. The dataset is suitable for training language models to directly optimize preferences, handle mature content without unnecessary refusals, provide compliant and helpful responses to creative writing requests, reduce over-censorship, and understand the appropriateness of uncensored content in context.

提供机构：

lemon07r

5,000+

优质数据集

54 个

任务类型

进入经典数据集