j23349/autotrain-data-test
收藏数据集描述
语言
数据集的语言代码为 en。
数据集结构
数据实例
数据集中的样本示例如下:
json [ { "feat_id": "13829542", "text": "Kasia: When are u coming back? Matt: Back where? Kasia: Oh come on Kasia: you know what i mean Matt: I really dont Kasia: When are you coming back to Warsaw Matt: I have no idea Matt: maybe around easter Kasia: will you let me know Matt: sure if I know something then I will let you know asap Kasia: ok Matt: are you mad? Kasia: a bit Matt: oh come on Matt: this is not my fault Matt: there is no way that I can answer that question Matt: not now Kasia: Fine", "target": "Matt doesnt know when hes coming back to Warsaw. He might come around Easter. When he knows more, he will let Kasia know. Kasia is a bit upset." }, { "feat_id": "13862523", "text": "Oliver: Have you beaten the game yet? Tom: Not yet Oliver: Ok... what mission are you playing? Tom: The one before the final one, its pretty hard Oliver: I didnt find it particularly hard Tom: I mean, combat is easy at this point in the game but the puzzles are difficult Oliver: Ok, I got it Tom: Its fun how most horror action games let you get really powerful by the end of the story Oliver: Well, you know, being a pussy from start to finish aint my idea of fun even in a horror game Tom: I know... but do you remember Bioshock? You turned into some sort of superhero and the final boss was pretty much a joke Oliver: Dont you dare talk like that about my favorite game Tom: I know, I love it too, but it had its flaws Oliver: No it didnt XD Tom: Lol Oliver: Well, keep playing Tom: Ill let you know when I finish this one and my overall impressions Oliver: Ok", "target": "Tom will contact Oliver after finishing new horror action game." } ]
数据集字段
数据集包含以下字段:
json { "feat_id": "Value(dtype=string, id=None)", "text": "Value(dtype=string, id=None)", "target": "Value(dtype=string, id=None)" }
数据集分割
数据集分为训练集和验证集,分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| train | 11785 |
| valid | 2947 |



