ridenight/autotrain-data-vape-chat
收藏AutoTrain Dataset for project: vape-chat
数据集描述
该数据集由AutoTrain自动处理,用于vape-chat项目。
语言
数据集的语言BCP-47代码为en。
数据集结构
数据实例
数据集的一个样本如下:
json [ { "context": "u2020###Human: generate a help prompt for Whats in vape juice?
###Assistant: Vape juice, or e-liquid, typically contains propylene glycol (PG), vegetable glycerin (VG), nicotine, and flavorings. However, there are nicotine-free versions available.", "question": "Whats in vape juice?", "answers.text": [ "Vape juice, or e-liquid, typically contains propylene glycol (PG), vegetable glycerin (VG), nicotine, and flavorings. However, there are nicotine-free versions available." ], "answers.answer_start": [ 75 ] }, { "context": "u2020###Human: generate a help prompt for Whats in vape juice? (version 6)
###Assistant: Vape juice, or e-liquid, typically contains propylene glycol (PG), vegetable glycerin (VG), nicotine, and flavorings. However, there are nicotine-free versions available. (version 6)", "question": "Whats in vape juice? (version 6)", "answers.text": [ "Vape juice, or e-liquid, typically contains propylene glycol (PG), vegetable glycerin (VG), nicotine, and flavorings. However, there are nicotine-free versions available. (version 6)" ], "answers.answer_start": [ 87 ] } ]
数据集字段
数据集包含以下字段(也称为“特征”):
json { "context": "Value(dtype=string, id=None)", "question": "Value(dtype=string, id=None)", "answers.text": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)", "answers.answer_start": "Sequence(feature=Value(dtype=int32, id=None), length=-1, id=None)" }
数据集分割
该数据集分为训练集和验证集。分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| train | 64 |
| valid | 16 |



