five

LABR

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/mohamedadaly/labr
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为LABR,包含了63,257条针对2,131本图书的评论,这些评论来自16,486名用户,评分范围在1到5之间。其中,评分为4和5的评论被视为积极情感,而评分为1和2的评论被视为消极情感。评分为3的评论被认为是中性的,因此被从数据集中剔除。此外,该数据集存在不平衡性,积极评论有42,832条,消极评论有8,224条。在进行实验时,有40,844条评论用于训练,10,212条用于测试。该数据集规模较大,适用于情感分析任务。

This dataset is named LABR. It contains 63,257 reviews about 2,131 books, submitted by 16,486 unique users, with ratings ranging from 1 to 5. Reviews with ratings of 4 and 5 are categorized as positive sentiment, while those with ratings of 1 and 2 are regarded as negative sentiment. Reviews with a rating of 3 are deemed neutral and thus excluded from the dataset. Furthermore, this dataset suffers from class imbalance, with 42,832 positive reviews and 8,224 negative reviews. For experimental purposes, 40,844 reviews are used for training, and 10,212 reviews are reserved for testing. Owing to its considerable scale, this dataset is suitable for sentiment analysis tasks.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作