SAA-Lab/LitBench-Test-Enhanced

Name: SAA-Lab/LitBench-Test-Enhanced
Creator: SAA-Lab
Published: 2025-06-17 20:30:33
License: 暂无描述

Hugging Face2025-06-17 更新2025-07-05 收录

下载链接：

https://hf-mirror.com/datasets/SAA-Lab/LitBench-Test-Enhanced

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个包含用户在Reddit上对故事选择的评论数据集。数据集中的字段包括故事提示(prompt)、被选中的故事(chosen_story)、被拒绝的故事(rejected_story)、选中评论的ID(chosen_comment_id)、拒绝评论的ID(rejected_comment_id)、选中评论的分数(chosen_comment_score)、拒绝评论的分数(rejected_comment_score)、选中评论的用户名(chosen_username)、拒绝评论的用户名(rejected_username)、选中评论的时间戳(chosen_timestamp)、拒绝评论的时间戳(rejected_timestamp)、选中故事在Reddit的帖子ID(chosen_reddit_post_id)和拒绝故事在Reddit的帖子ID(rejected_reddit_post_id)。数据集分为训练集(train)，大小为15857748字节，共有2480个样本。

This is a dataset containing user comments on story selection on Reddit. The dataset includes fields such as story prompt (prompt), chosen story (chosen_story), rejected story (rejected_story), chosen comment ID (chosen_comment_id), rejected comment ID (rejected_comment_id), chosen comment score (chosen_comment_score), rejected comment score (rejected_comment_score), chosen comment username (chosen_username), rejected comment username (rejected_username), chosen comment timestamp (chosen_timestamp), rejected comment timestamp (rejected_timestamp), chosen story Reddit post ID (chosen_reddit_post_id), and rejected story Reddit post ID (rejected_reddit_post_id). The dataset is split into a training set (train), which is 15857748 bytes in size and contains 2480 samples.

提供机构：

SAA-Lab

5,000+

优质数据集

54 个

任务类型

进入经典数据集