five

stevied67/autotrain-data-pegasus-subreddit-comments-summarizer

收藏
Hugging Face2023-03-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/stevied67/autotrain-data-pegasus-subreddit-comments-summarizer
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集由AutoTrain自动处理,用于项目pegasus-subreddit-comments-summarizer。数据集的语言为英语。数据集包含文本和目标字段,其中文本是原始评论,目标是评论的摘要。数据集被分割为训练集和验证集,训练集包含7177个样本,验证集包含1796个样本。

该数据集由AutoTrain自动处理,用于项目pegasus-subreddit-comments-summarizer。数据集的语言为英语。数据集包含文本和目标字段,其中文本是原始评论,目标是评论的摘要。数据集被分割为训练集和验证集,训练集包含7177个样本,验证集包含1796个样本。
提供机构:
stevied67
原始信息汇总

AutoTrain Dataset for project: pegasus-subreddit-comments-summarizer

数据集描述

该数据集由AutoTrain自动处理,用于项目pegasus-subreddit-comments-summarizer。

语言

数据集的语言代码为BCP-47标准的en。

数据集结构

数据实例

数据集中的样本示例如下:

json [ { "text": "I go through this every single year. We have an Ironman competition that is 2 miles from my hotel, and I sell out for that weekend almost a year in advance. Without fail I will have some nitwit who will come up on their checkout day and ask to extend, when I tell them I cant they lose their mind at me. Its their room, they paid for it, theyre already in there how can I just give it away. People do not understand how reservations work.", "target": "The commenter experiences this every year - they sell out their hotel almost a year in advance for an Ironman competition nearby. Despite this, some customers still ask to extend their stay at checkout and get angry when told its not possible because they dont understand how reservations work." }, { "text": "Can i just say .. thanks for going back to make sure you hadnt overreacted. Im sure that made things so much easier on all the staff, with it being their first days back, being understaffed, Im sure, and trying to get back into the swing of things. I think you handled that really well :)", "target": "The commenter appreciates the posters effort in going back to verify if they had overreacted. The commenter believes this action might have made things easier for the understaffed team during their first days back. The commenter commends the poster for handling the situation well." } ]

数据集字段

数据集包含以下字段:

json { "text": "Value(dtype=string, id=None)", "target": "Value(dtype=string, id=None)" }

数据集分割

数据集被分割为训练集和验证集,分割大小如下:

分割名称 样本数量
训练集 7177
验证集 1796
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作