trl-lib/tldr-preference
收藏Hugging Face2025-01-08 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/trl-lib/tldr-preference
下载链接
链接失效反馈官方服务:
资源简介:
TL;DR数据集是一个经过处理的Reddit帖子版本,专门用于偏好学习和从人类反馈中强化学习任务的模型训练。该数据集利用了Reddit用户常见的做法,即在长帖子后附加TL;DR(Too Long; Didnt Read)摘要,为训练模型理解并生成简洁摘要提供了丰富的成对文本数据。
The TL;DR Dataset is a processed version of Reddit posts, specifically curated for training models on preference learning and Reinforcement Learning from Human Feedback (RLHF) tasks. It utilizes the common practice on Reddit where users append TL;DR (Too Long; Didnt Read) summaries to lengthy posts, providing a rich source of paired text data for training models to understand and generate concise summaries.
提供机构:
trl-lib



