hassanjbara/LONG-DPO

Name: hassanjbara/LONG-DPO
Creator: hassanjbara
Published: 2024-07-21 13:34:22
License: 暂无描述

Hugging Face2024-07-21 更新2024-06-29 收录

下载链接：

https://hf-mirror.com/datasets/hassanjbara/LONG-DPO

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是[hassanjbara/LONG](https://huggingface.co/datasets/hassanjbara/LONG)数据集的DPO（Direct Preference Optimization）版本，用于训练模型以“击败”[Hello-SimpleAI/chatgpt-detector-roberta](https://huggingface.co/Hello-SimpleAI/chatgpt-detector-roberta)检测器。数据集包含查询、被拒绝的响应和选择的响应三个特征，主要用于文本生成任务。

This dataset is a DPO version of the [hassanjbara/LONG](https://huggingface.co/datasets/hassanjbara/LONG) dataset, intended for training a model to beat the [Hello-SimpleAI/chatgpt-detector-roberta](https://huggingface.co/Hello-SimpleAI/chatgpt-detector-roberta) detector. It includes features such as query, rejected response, and chosen response, primarily used for text generation tasks.

提供机构：

hassanjbara

原始信息汇总

数据集概述

数据集信息

特征:
- query: 类型为字符串
- rejected_response: 类型为字符串
- choosen_response: 类型为字符串
分割:
- train: 包含9857个样本，占用36074663字节
下载大小: 20267392字节
数据集大小: 36074663字节
配置:
- default: 包含训练数据文件路径为data/train-*
许可证: MIT
任务类别: 文本生成
语言: 英语
标签: DPO
大小类别: 1K<n<10K

5,000+

优质数据集

54 个

任务类型

进入经典数据集