five

Hieubeo23/UIT-VSMEC

收藏
Hugging Face2026-04-19 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Hieubeo23/UIT-VSMEC
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - text-classification language: - vi size_categories: - 1K<n<10K --- # Introduction Emotion recognition is a higher approach or special case of sentiment analysis. In this task, the result is not produced in terms of either polarity: positive or negative or in the form of rating (from 1 to 5) but of a more detailed level of sentiment analysis in which the result are depicted in more expressions like sadness, enjoyment, anger, disgust, fear and surprise. Emotion recognition plays a critical role in measuring brand value of a product by recognizing specific emotions of customers’ comments. In this study, we have achieved two targets. First and foremost, we built a standard Vietnamese Social Media Emotion Corpus (UIT-VSMEC) with about 6,927 human-annotated sentences with six emotion labels, contributing to emotion recognition research in Vietnamese which is a low-resource language in Natural Language Processing (NLP). Secondly, we assessed and measured machine learning and deep neural network models on our UIT-VSMEC. As a result, Convolutional Neural Network (CNN) model achieved the highest performance with 57.61% of F1-score.

--- 任务类别: - 文本分类(text-classification) 语言: - 越南语 样本量范围: - 1000 < 样本数 < 10000 --- # 引言 情感识别(Emotion Recognition)属于情感分析(Sentiment Analysis)的高阶研究路径或特殊子类。在该任务中,模型输出结果并非仅基于情感极性(积极或消极)或评分(1至5分)的形式,而是属于更细粒度的情感分析范畴,其结果以诸如悲伤、愉悦、愤怒、厌恶、恐惧与惊讶等多种情感表达形式呈现。通过识别用户评论中的特定情感,情感识别在衡量产品品牌价值方面具有关键作用。 本研究达成了两项核心目标:其一,构建了标准越南语社交媒体情感语料库(Vietnamese Social Media Emotion Corpus, UIT-VSMEC),该语料库包含约6927条经人工标注的句子,涵盖6种情感标签,可为属于低资源自然语言处理(Natural Language Processing, NLP)范畴的越南语情感识别研究提供支撑;其二,在UIT-VSMEC语料库上对机器学习与深度神经网络模型开展了性能评估与测试,最终卷积神经网络(Convolutional Neural Network, CNN)以57.61%的F1分数取得了最优性能。
提供机构:
Hieubeo23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作