Sivaganesh07/preference_dataset

Name: Sivaganesh07/preference_dataset
Creator: Sivaganesh07
Published: 2025-03-13 20:12:20
License: 暂无描述

Hugging Face2025-03-13 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/Sivaganesh07/preference_dataset

下载链接

链接失效反馈

官方服务：

资源简介：

TL;DR数据集是一个经过处理的Reddit帖子版本，专门用于使用TRL库进行偏好学习和从人类反馈中强化学习任务。它利用了Reddit用户常见的做法，即在长帖子后附加TL;DR（Too Long; Didnt Read）摘要，为训练模型理解并生成简洁摘要提供了丰富的配对文本数据。

The TL;DR dataset is a processed version of Reddit posts, specifically curated for training models using the TRL library for preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It leverages the common practice on Reddit where users append TL;DR (Too Long; Didnt Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise summaries.

提供机构：

Sivaganesh07

5,000+

优质数据集

54 个

任务类型

进入经典数据集