NLPCC Weibo Dataset

Name: NLPCC Weibo Dataset
Creator: Science Data Bank
Published: 2025-04-27 13:09:06
License: 暂无描述

DataCite Commons2025-04-27 更新2025-04-16 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=7f8d2c5d0ad84f5aa99fc8a611b62f1f

下载链接

链接失效反馈

官方服务：

资源简介：

The fine-grained NLPCC Weibo dataset includes the NLPCC2013 Task 2 Weibo Sentence Emotion Dataset and the NLPCC2014 Task 1 Emotion Dataset. The NLPCC2013 Task 2 Weibo Sentence Emotion Dataset consists of 8 types of sentiment labels: None, Sadness, Like, Anger, Happiness, Distinct, Fear, Surprise. It includes a total of 4000 training sets and 10000 testing sets. During the experiment, this article merged the two datasets into one dataset and re divided them according to the ratio of 8:2 between the training and testing sets. The NLPCC2014 Task 1 sentiment dataset consists of 8 types of sentiment labels: None, Sadness, Like, Anger, Happiness, Distust, Wear, Surprise, and includes a total of 14000 training sets and 6000 testing sets.

细粒度NLPCC微博数据集包含NLPCC2013任务2微博语句情感数据集与NLPCC2014任务1情感数据集。其中，NLPCC2013任务2微博语句情感数据集共涵盖8类情感标签：无情感（None）、悲伤（Sadness）、喜爱（Like）、愤怒（Anger）、愉悦（Happiness）、明确（Distinct）、恐惧（Fear）与惊讶（Surprise），总计包含4000条训练样本与10000条测试样本。本研究在实验过程中将两个数据集合并为统一数据集，并按照8:2的比例重新划分训练集与测试集。NLPCC2014任务1情感数据集共涵盖8类情感标签：无情感（None）、悲伤（Sadness）、喜爱（Like）、愤怒（Anger）、愉悦（Happiness）、不信任（Distust）、疲惫（Wear）与惊讶（Surprise），总计包含14000条训练样本与6000条测试样本。

提供机构：

Science Data Bank

创建时间：

2023-06-27

搜集汇总

数据集介绍