five

j23349/autotrain-data-test

收藏
Hugging Face2023-10-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/j23349/autotrain-data-test
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是通过AutoTrain为项目test自动处理的,语言为英语(en),任务类别为摘要生成(summarization)。数据实例展示了对话文本及其对应的摘要,数据集字段包括feat_id、text和target。数据集被分割为训练集和验证集,分别包含11785和2947个样本。
提供机构:
j23349
原始信息汇总

数据集描述

语言

数据集的语言代码为 en。

数据集结构

数据实例

数据集中的样本示例如下:

json [ { "feat_id": "13829542", "text": "Kasia: When are u coming back? Matt: Back where? Kasia: Oh come on Kasia: you know what i mean Matt: I really dont Kasia: When are you coming back to Warsaw Matt: I have no idea Matt: maybe around easter Kasia: will you let me know Matt: sure if I know something then I will let you know asap Kasia: ok Matt: are you mad? Kasia: a bit Matt: oh come on Matt: this is not my fault Matt: there is no way that I can answer that question Matt: not now Kasia: Fine", "target": "Matt doesnt know when hes coming back to Warsaw. He might come around Easter. When he knows more, he will let Kasia know. Kasia is a bit upset." }, { "feat_id": "13862523", "text": "Oliver: Have you beaten the game yet? Tom: Not yet Oliver: Ok... what mission are you playing? Tom: The one before the final one, its pretty hard Oliver: I didnt find it particularly hard Tom: I mean, combat is easy at this point in the game but the puzzles are difficult Oliver: Ok, I got it Tom: Its fun how most horror action games let you get really powerful by the end of the story Oliver: Well, you know, being a pussy from start to finish aint my idea of fun even in a horror game Tom: I know... but do you remember Bioshock? You turned into some sort of superhero and the final boss was pretty much a joke Oliver: Dont you dare talk like that about my favorite game Tom: I know, I love it too, but it had its flaws Oliver: No it didnt XD Tom: Lol Oliver: Well, keep playing Tom: Ill let you know when I finish this one and my overall impressions Oliver: Ok", "target": "Tom will contact Oliver after finishing new horror action game." } ]

数据集字段

数据集包含以下字段:

json { "feat_id": "Value(dtype=string, id=None)", "text": "Value(dtype=string, id=None)", "target": "Value(dtype=string, id=None)" }

数据集分割

数据集分为训练集和验证集,分割大小如下:

分割名称 样本数量
train 11785
valid 2947
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作