five

SAA-Lab/LitBench-Test-Enhanced

收藏
Hugging Face2025-06-17 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/SAA-Lab/LitBench-Test-Enhanced
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个包含用户在Reddit上对故事选择的评论数据集。数据集中的字段包括故事提示(prompt)、被选中的故事(chosen_story)、被拒绝的故事(rejected_story)、选中评论的ID(chosen_comment_id)、拒绝评论的ID(rejected_comment_id)、选中评论的分数(chosen_comment_score)、拒绝评论的分数(rejected_comment_score)、选中评论的用户名(chosen_username)、拒绝评论的用户名(rejected_username)、选中评论的时间戳(chosen_timestamp)、拒绝评论的时间戳(rejected_timestamp)、选中故事在Reddit的帖子ID(chosen_reddit_post_id)和拒绝故事在Reddit的帖子ID(rejected_reddit_post_id)。数据集分为训练集(train),大小为15857748字节,共有2480个样本。

This is a dataset containing user comments on story selection on Reddit. The dataset includes fields such as story prompt (prompt), chosen story (chosen_story), rejected story (rejected_story), chosen comment ID (chosen_comment_id), rejected comment ID (rejected_comment_id), chosen comment score (chosen_comment_score), rejected comment score (rejected_comment_score), chosen comment username (chosen_username), rejected comment username (rejected_username), chosen comment timestamp (chosen_timestamp), rejected comment timestamp (rejected_timestamp), chosen story Reddit post ID (chosen_reddit_post_id), and rejected story Reddit post ID (rejected_reddit_post_id). The dataset is split into a training set (train), which is 15857748 bytes in size and contains 2480 samples.
提供机构:
SAA-Lab
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作