NagasriramKochetti/TAX_ISSUES
收藏AutoTrain Dataset for project: tax_issues
数据集描述
该数据集为自动处理用于项目tax_issues的文本分类数据集。
语言
数据集的语言BCP-47代码为unk。
数据集结构
数据实例
数据集样例如下:
json [ { "text": "How is Inheritance Tax calculated?", "target": 10, "feat_Unnamed: 2": null, "feat_Unnamed: 3": null, "feat_Unnamed: 4": null }, { "text": "What happens if I work part-time or have multiple jobs as an international student in the UK?", "target": 13, "feat_Unnamed: 2": null, "feat_Unnamed: 3": null, "feat_Unnamed: 4": null } ]
数据集字段
数据集包含以下字段:
json { "text": "Value(dtype=string, id=None)", "target": "ClassLabel(names=[Question1, Question10, Question11, Question12, Question13, Question14, Question15, Question16, Question17, Question18, Question19, Question2, Question20, Question21, Question22, Question23, Question24, Question25, Question26, Question27, Question28, Question29, Question3, Question30, Question31, Question32, Question33, Question34, Question35, Question36, Question37, Question38, Question39, Question4, Question40, Question41, Question42, Question43, Question44, Question45, Question46, Question47, Question49, Question5, Question50, Question6, Question7, Question8, Question9, question48], id=None)", "feat_Unnamed: 2": "Value(dtype=float64, id=None)", "feat_Unnamed: 3": "Value(dtype=float64, id=None)", "feat_Unnamed: 4": "Value(dtype=float64, id=None)" }
数据集分割
数据集分为训练集和验证集,分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| 训练集 | 2377 |
| 验证集 | 622 |



