breadlicker45/autotrain-data-yahoo-answer-small
收藏AutoTrain Dataset for project: yahoo-answer-small
数据集描述
该数据集为AutoTrain自动处理的项目“yahoo-answer-small”所用。
语言
数据集的语言BCP-47代码为unk。
数据集结构
数据实例
数据集样本示例如下:
json [ { "text": "how do you get a girl to like you? and how can you make her your girlfriend?", "target": "Be yourself. Its the oldest and best advice. She may still not like you, but thats the risk, and you keep your manliness and dignity. Never, never forget this." }, { "text": "how long is a bacteriums life?", "target": "It depends on the bacterium. For E. coli (common lab bacteria) 20-30 minutes is an average doubling time, but different strains vary.\n\nI heard something somewhere about a weird form of bacteria that lives miles underground in granite formations and only divides once every ten thousand years, or something crazy like that. I cant give you a source, its just a freaky thing off the top of my head that I havent gone to the trouble to confirm. My contention is that if reincarnation is true, that would be the worst thing to come back as. So mind your karma.\n\nPart of the reason it would be the worst is that bacteria reproduce by one cell dividing into two, so as long as there are any of that strain still alive, it hasnt really died.\n\nThey can die though. In a liquid culture you can tell because of a lot of turbidity (cloudiness) some of which is from cells and some from debris from dead cells. Or, you could have agar plates that get really nasty and dried up, and most of those bacteria are probably dead. Or you could tell because they look really crappy under a microscope. If you try to streak it and grow it on a plate and it doesnt grow, its probably dead." } ]
数据集字段
数据集包含以下字段(特征):
json { "text": "Value(dtype=string, id=None)", "target": "Value(dtype=string, id=None)" }
数据集分割
数据集分为训练集和验证集,分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| 训练集 | 2399 |
| 验证集 | 600 |



