TUKE-KEMT/senti-sk
收藏Hugging Face2025-03-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/TUKE-KEMT/senti-sk
下载链接
链接失效反馈官方服务:
资源简介:
SentiSK是一个来自斯洛伐克社交媒体的情感分析数据集,包含来自Facebook的34,006条手工标注的评论。数据集分为负面、中性、正面三种情感标签,分别有20,668条、9,581条和3,779条评论。这些评论由科希策技术大学的博士生和助理教授进行标注,标注者的母语为斯洛伐克语。每个评论只被标注一次,标注基于标注者的个人意见。数据集共有34,028个句子,平均句子长度为10.45个单词。
SentiSK is a sentiment analysis dataset from Slovak social media, containing 34,006 manually annotated comments from Facebook. The dataset is divided into three sentiment labels: negative, neutral, and positive, with 20,668, 9,581, and 3,779 comments respectively. The comments were annotated by PhD students and assistant professors from the Technical University of Košice, whose native language is Slovak. Each comment was annotated only once, and the labeling is based on the personal opinion of the annotator. The dataset consists of 34,028 sentences with an average sentence length of 10.45 words.
提供机构:
TUKE-KEMT



