Sentiment Lexicons for 81 Languages 情感词典,支持81种语言
收藏阿里云天池2026-06-09 更新2024-03-07 收录
下载链接:
https://tianchi.aliyun.com/dataset/89881
下载链接
链接失效反馈官方服务:
资源简介:
情感分析是自动检测一段文字是肯定还是否定的任务,通常依赖于手写的带有积极情绪(好,好,很棒)和消极情绪(坏,严重,糟糕)的单词列表。该数据集包含用于81种语言的正面和负面情感词典。该数据集中的情感词典是通过基于知识图谱的图传播而生成的——知识图谱是现实世界实体及其之间链接的图形表示。通常的直觉是,在知识图谱上紧密链接的单词可能具有相似的情感极性。在这个项目中,情感是基于英语情感词汇生成的。
Sentiment analysis is the task of automatically detecting whether a given text conveys positive or negative sentiment. It typically relies on hand-curated word lists containing positive emotional terms (e.g., 'good', 'great', 'awesome') and negative emotional terms (e.g., 'bad', 'severe', 'terrible'). This dataset provides positive and negative sentiment lexicons for 81 languages. The sentiment lexicons within this dataset are generated through graph propagation based on knowledge graphs—where a knowledge graph is a graphical representation of real-world entities and the interconnections between them. A core intuition guiding this approach is that words tightly linked on a knowledge graph tend to exhibit similar sentiment polarities. In this project, the sentiment lexicons are constructed using English sentiment vocabulary.
提供机构:
阿里云天池
创建时间:
2021-02-01
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集提供了覆盖81种语言的正面与负面情感词典。这些情感词汇是通过基于知识图谱的图传播方法生成的,其原理是知识图谱中关联紧密的词语往往具有相似的情感极性。整个生成过程以英语情感词汇为基础。
以上内容由遇见数据集搜集并总结生成



