five

skashyap96/autotrain-data-led-samsum-dialogsum

收藏
Hugging Face2022-11-09 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/skashyap96/autotrain-data-led-samsum-dialogsum
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - conditional-text-generation --- # AutoTrain Dataset for project: led-samsum-dialogsum ## Dataset Description This dataset has been automatically processed by AutoTrain for project led-samsum-dialogsum. ### Languages The BCP-47 code for the dataset's language is unk. ## Dataset Structure ### Data Instances A sample from this dataset looks as follows: ```json [ { "feat_Unnamed: 0": 0, "feat_id": 0, "text": "Amanda: I baked cookies. Do you want some?\nJerry: Sure!\nAmanda: I'll bring you tomorrow :-)", "target": "Amanda baked cookies and will bring Jerry some tomorrow." }, { "feat_Unnamed: 0": 1, "feat_id": 1, "text": "Olivia: Who are you voting for in this election? \nOliver: Liberals as always.\nOlivia: Me too!!\nOliver: Great", "target": "Olivia and Olivier are voting for liberals in this election. " } ] ``` ### Dataset Fields The dataset has the following fields (also called "features"): ```json { "feat_Unnamed: 0": "Value(dtype='int64', id=None)", "feat_id": "Value(dtype='int64', id=None)", "text": "Value(dtype='string', id=None)", "target": "Value(dtype='string', id=None)" } ``` ### Dataset Splits This dataset is split into a train and validation split. The split sizes are as follow: | Split name | Num samples | | ------------ | ------------------- | | train | 27191 | | valid | 1318 |
提供机构:
skashyap96
原始信息汇总

数据集概述

任务类别

  • 条件文本生成

数据集描述

  • 数据集为AutoTrain自动处理,用于项目led-samsum-dialogsum。
  • 语言代码为unk。

数据集结构

数据实例

  • 示例数据包含对话文本和目标摘要。

数据集字段

  • feat_Unnamed: 0: 整数类型
  • feat_id: 整数类型
  • text: 字符串类型,包含对话内容
  • target: 字符串类型,包含对话摘要

数据集分割

  • 训练集样本数: 27191
  • 验证集样本数: 1318
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作