balzanilo/dialogs-mtl-dataset
收藏对话情感与触发数据集
数据集概述
对话情感与触发数据集是一个包含对话交流并标注情感和触发因素的集合。每个实例包含话语、说话者话语、相应的情感和触发因素。
数据集信息
- 语言: 英语
- 许可证: MIT
- 大小分类: 1K<n<10K
- 任务分类: 文本分类
- 美观名称: Emotions/Triggers in multi-speaker dialogues
特征
- speakers_utterances: 序列类型为字符串
- emotions: 序列类型为int64
- triggers: 序列类型为float64
分割
- train: 字节数为1956813,实例数为3187
- test: 字节数为270767,实例数为419
- eval: 字节数为239586,实例数为394
大小
- 下载大小: 644361字节
- 数据集大小: 2467166字节
配置
- config_name: default
- data_files:
- train: data/train-*
- test: data/test-*
- eval: data/eval-*
- data_files:
标签
- emotion_classification
- trigger_detection
列信息
- Utterances: 包含对话中每个话语的文本内容。
- Speaker Utterances: 包含说话者的名字和相应的话语。
- Emotions: 每个整数代表相应话语的情感,编码如下:
- 0: 中性
- 1: 惊讶
- 2: 恐惧
- 3: 悲伤
- 4: 喜悦
- 5: 厌恶
- 6: 愤怒
- Triggers: 指示相应话语是否触发情感变化。1表示触发,0表示无触发。
示例实例
Utterances
python [ "Hey.", "Hey!", "So how was Joan?", "I broke up with her.", "Dont tell me, because of the big nostril thing?", "They were huge. When she sneezed, bats flew out of them.", "Come on, they were not that huge.", "Im tellin you, she leaned back; I could see her brain.", "How many perfectly fine women are you gonna reject over the most superficial insignificant things?" ]
Speaker Utterances
python [ "Chandler: Hey.", "All: Hey!", "Monica: So how was Joan?", "Chandler: I broke up with her.", "Ross: Dont tell me, because of the big nostril thing?", "Chandler: They were huge. When she sneezed, bats flew out of them.", "Rachel: Come on, they were not that huge.", "Chandler: Im tellin you, she leaned back; I could see her brain.", "Monica: How many perfectly fine women are you gonna reject over the most superficial insignificant things?" ]
Emotions
python [0, 4, 0, 0, 1, 5, 0, 5, 1]
Triggers
python [0, 0, 0, 0, 0, 0, 0, 1, 0]



