skashyap96/autotrain-data-led-samsum-dialogsum
收藏Hugging Face2022-11-09 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/skashyap96/autotrain-data-led-samsum-dialogsum
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- conditional-text-generation
---
# AutoTrain Dataset for project: led-samsum-dialogsum
## Dataset Description
This dataset has been automatically processed by AutoTrain for project led-samsum-dialogsum.
### Languages
The BCP-47 code for the dataset's language is unk.
## Dataset Structure
### Data Instances
A sample from this dataset looks as follows:
```json
[
{
"feat_Unnamed: 0": 0,
"feat_id": 0,
"text": "Amanda: I baked cookies. Do you want some?\nJerry: Sure!\nAmanda: I'll bring you tomorrow :-)",
"target": "Amanda baked cookies and will bring Jerry some tomorrow."
},
{
"feat_Unnamed: 0": 1,
"feat_id": 1,
"text": "Olivia: Who are you voting for in this election? \nOliver: Liberals as always.\nOlivia: Me too!!\nOliver: Great",
"target": "Olivia and Olivier are voting for liberals in this election. "
}
]
```
### Dataset Fields
The dataset has the following fields (also called "features"):
```json
{
"feat_Unnamed: 0": "Value(dtype='int64', id=None)",
"feat_id": "Value(dtype='int64', id=None)",
"text": "Value(dtype='string', id=None)",
"target": "Value(dtype='string', id=None)"
}
```
### Dataset Splits
This dataset is split into a train and validation split. The split sizes are as follow:
| Split name | Num samples |
| ------------ | ------------------- |
| train | 27191 |
| valid | 1318 |
提供机构:
skashyap96
原始信息汇总
数据集概述
任务类别
- 条件文本生成
数据集描述
- 数据集为AutoTrain自动处理,用于项目led-samsum-dialogsum。
- 语言代码为unk。
数据集结构
数据实例
- 示例数据包含对话文本和目标摘要。
数据集字段
feat_Unnamed: 0: 整数类型feat_id: 整数类型text: 字符串类型,包含对话内容target: 字符串类型,包含对话摘要
数据集分割
- 训练集样本数: 27191
- 验证集样本数: 1318



