Stojke42/multilang_sst
收藏Hugging Face2023-12-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Stojke42/multilang_sst
下载链接
链接失效反馈官方服务:
资源简介:
global_label={0:'Negative', 1:'Positive',2:'Neutral'}
label2id_custom={0: 2, 1: 0, 2: 1}
train_label = [label2id_custom[x] for x in train_label]
dev_label = [label2id_custom[x] for x in dev_label]
本数据集的标签映射与转换流程如下:
1. 定义全局标签字典(global_label):`{0:'Negative(负面标签)', 1:'Positive(正面标签)',2:'Neutral(中性标签)'}`
2. 构建自定义标签ID映射字典(label2id_custom):`{0: 2, 1: 0, 2: 1}`
3. 采用列表推导式对训练集标签进行批量转换:`train_label = [label2id_custom[x] for x in train_label]`,即遍历原训练集标签的每个元素,通过自定义映射字典替换为对应的新标签ID
4. 采用完全相同的列表推导式对开发集标签进行批量转换:`dev_label = [label2id_custom[x] for x in dev_label]`
提供机构:
Stojke42
原始信息汇总
数据集标签映射
全局标签定义
- 0: Negative
- 1: Positive
- 2: Neutral
自定义标签映射
- 0: 2
- 1: 0
- 2: 1
训练和验证标签转换
- 训练标签(train_label)和验证标签(dev_label)根据自定义标签映射进行转换。



