five

neil-code/autotrain-data-test-summarization

收藏
Hugging Face2023-08-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/neil-code/autotrain-data-test-summarization
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集由AutoTrain自动处理,用于测试摘要生成项目。数据集的语言为英语,包含对话文本及其对应的摘要。数据集的字段包括feat_id、text、target和feat_topic。数据集分为训练集和验证集,分别包含1599和400个样本。
提供机构:
neil-code
原始信息汇总

AutoTrain Dataset for project: test-summarization

数据集描述

该数据集由AutoTrain自动处理,用于项目test-summarization。

语言

数据集的语言BCP-47代码为en。

数据集结构

数据实例

数据集的一个样本如下所示:

json [ { "feat_id": "train_1087", "text": "#Person1#: Hello sir, how can I help you? #Person2#: Yes, I need this prescription please. #Person1#: Lets see. Okay, so 50 mg of Prozac, would you prefer this in capsule or tablet? #Person2#: Capsules are fine. #Person1#: Okay, you should take 1 capsule 3 times a day. Be sure not to take it on an empty stomach, and also, dont ever mix it with alcohol! #Person2#: Yes, I know. Its not the first time Im taking this! Dont worry, I wont overdose! #Person1#: Okay, anything else I can get you? #Person2#: Oh, yes, I almost forgot! Can I also get some eye drops and um, some condoms? #Person1#: Sure. Darn condoms arent registered in our system. #Person2#: Oh, well thats okay, Ill get some later, thanks. . . Really its no problem. #Person1#: Just hang on there a sec. Can I get a price check on Fun Times Ribbed Condoms please!", "target": "#Person1# gives #Person2# #Person2#s prescription. #Person2# also wants some eye drops and some condoms but is told that darn condoms are registered in their system.", "feat_topic": "the pharmacy" }, { "feat_id": "train_1193", "text": "#Person1#: Dont be too sad. If you really think that you have no feeling with him, then, in my opinion, getting divorced maybe is the best way to solve the problem. #Person2#: I know clearly at the bottom of my heart. I just cant set my mind at rest because of the child. Shes little. She cannot understand us and accept such truth. #Person1#: Yeah, child is the matter. Dont tell Jenny the truth, only tell her the white lie. When she grows up, you find the suitable opportunity to tell her. #Person2#: I see. OK. ", "target": "#Person1# suggests #Person2# get divorced if #Person2# has no feeling with a man and tell their daughter the white lie.", "feat_topic": "divorce" } ]

数据集字段

数据集包含以下字段(也称为“特征”):

json { "feat_id": "Value(dtype=string, id=None)", "text": "Value(dtype=string, id=None)", "target": "Value(dtype=string, id=None)", "feat_topic": "Value(dtype=string, id=None)" }

数据集分割

该数据集被分割为训练集和验证集。分割大小如下:

分割名称 样本数量
train 1599
valid 400
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作