Bengali & Banglish: A Monolingual Dataset for Emotion Detection in Linguistically Diverse Contexts
收藏Mendeley Data2024-05-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/4dnrwbxt8n
下载链接
链接失效反馈官方服务:
资源简介:
This dataset, positioned at the intersection of Bengali and Banglish (an English-character variant of Bengali), is a valuable resource for emotion detection. It encompasses a total of 80,098 data entries, comprising both languages. The dataset is organized into six distinct emotional categories: anger (15,179), disgust (13,098), fear (7,565), joy (17,836), sadness (16,309), and surprise (10,107), aligning with Ekman's six basic emotions framework. Sourced from platforms such as EmoNoBa, UBMEC, MONOVAB, and comments from YouTube and Twitter posts, it offers a diverse and rich dataset for research and analysis. Moreover, given its bilingual nature, this data also holds relevance for neural machine translation tasks.
创建时间:
2024-04-27



