seangogo/processed_tldr_comparison_dataset_20251102_065554

Name: seangogo/processed_tldr_comparison_dataset_20251102_065554
Creator: seangogo
Published: 2025-11-02 06:57:17
License: 暂无描述

Hugging Face2025-11-02 更新2025-11-15 收录

下载链接：

https://hf-mirror.com/datasets/seangogo/processed_tldr_comparison_dataset_20251102_065554

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个用于训练奖励模型的比较数据集。每个样例包含一个用于摘要的查询（帖子）和两个响应（选择的和被拒绝的），其中选择的响应是通过人类反馈更受欢迎的。

This is a comparison dataset used for training a reward model. Each example contains a query (post) and two responses (chosen and rejected) where the chosen response is preferred by human feedback.

提供机构：

seangogo

5,000+

优质数据集

54 个

任务类型

进入经典数据集