neil-code/autotrain-data-test-summarization
收藏AutoTrain Dataset for project: test-summarization
数据集描述
该数据集由AutoTrain自动处理,用于项目test-summarization。
语言
数据集的语言BCP-47代码为en。
数据集结构
数据实例
数据集的一个样本如下所示:
json [ { "feat_id": "train_1087", "text": "#Person1#: Hello sir, how can I help you? #Person2#: Yes, I need this prescription please. #Person1#: Lets see. Okay, so 50 mg of Prozac, would you prefer this in capsule or tablet? #Person2#: Capsules are fine. #Person1#: Okay, you should take 1 capsule 3 times a day. Be sure not to take it on an empty stomach, and also, dont ever mix it with alcohol! #Person2#: Yes, I know. Its not the first time Im taking this! Dont worry, I wont overdose! #Person1#: Okay, anything else I can get you? #Person2#: Oh, yes, I almost forgot! Can I also get some eye drops and um, some condoms? #Person1#: Sure. Darn condoms arent registered in our system. #Person2#: Oh, well thats okay, Ill get some later, thanks. . . Really its no problem. #Person1#: Just hang on there a sec. Can I get a price check on Fun Times Ribbed Condoms please!", "target": "#Person1# gives #Person2# #Person2#s prescription. #Person2# also wants some eye drops and some condoms but is told that darn condoms are registered in their system.", "feat_topic": "the pharmacy" }, { "feat_id": "train_1193", "text": "#Person1#: Dont be too sad. If you really think that you have no feeling with him, then, in my opinion, getting divorced maybe is the best way to solve the problem. #Person2#: I know clearly at the bottom of my heart. I just cant set my mind at rest because of the child. Shes little. She cannot understand us and accept such truth. #Person1#: Yeah, child is the matter. Dont tell Jenny the truth, only tell her the white lie. When she grows up, you find the suitable opportunity to tell her. #Person2#: I see. OK. ", "target": "#Person1# suggests #Person2# get divorced if #Person2# has no feeling with a man and tell their daughter the white lie.", "feat_topic": "divorce" } ]
数据集字段
数据集包含以下字段(也称为“特征”):
json { "feat_id": "Value(dtype=string, id=None)", "text": "Value(dtype=string, id=None)", "target": "Value(dtype=string, id=None)", "feat_topic": "Value(dtype=string, id=None)" }
数据集分割
该数据集被分割为训练集和验证集。分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| train | 1599 |
| valid | 400 |



