AlexSham/amazon_cells_filtered

Name: AlexSham/amazon_cells_filtered
Creator: AlexSham
Published: 2024-03-24 18:49:26
License: 暂无描述

Hugging Face2024-03-24 更新2024-06-11 收录

下载链接：

https://hf-mirror.com/datasets/AlexSham/amazon_cells_filtered

下载链接

链接失效反馈

官方服务：

资源简介：

https://archive.ics.uci.edu/dataset/331/sentiment+labelled+sentences This dataset was created for the Paper 'From Group to Individual Labels using Deep Features', Kotzias et. al,. KDD 2015 Please cite the paper if you want to use it :) It contains sentences labelled with positive or negative sentiment, extracted from reviews of products, movies, and restaurants ======= Format: ======= sentence \t score \n ======= Details: ======= Score is either 1 (for positive) or 0 (for negative) The sentences come from three different websites/fields: imdb.com amazon.com yelp.com For each website, there exist 500 positive and 500 negative sentences. Those were selected randomly for larger datasets of reviews. We attempted to select sentences that have a clearly positive or negative connotaton, the goal was for no neutral sentences to be selected. For the full datasets look: imdb: Maas et. al., 2011 'Learning word vectors for sentiment analysis' amazon: McAuley et. al., 2013 'Hidden factors and hidden topics: Understanding rating dimensions with review text' yelp: Yelp dataset challenge http://www.yelp.com/dataset_challenge

提供机构：

AlexSham

原始信息汇总

数据集概述

数据集来源

该数据集是为论文 From Group to Individual Labels using Deep Features, Kotzias et. al,. KDD 2015 创建的。

数据内容

包含从产品、电影和餐厅评论中提取的带有情感标签的句子。
情感标签分为正面（1）和负面（0）。

数据格式

数据格式为：sentence score

数据详情

句子来源于以下三个网站：
- imdb.com
- amazon.com
- yelp.com
每个网站各有500条正面和500条负面句子，这些句子是从更大的评论数据集中随机选取的。
选取的句子旨在具有明确的正面或负面含义，避免包含中性句子。

引用信息

若使用此数据集，请引用上述论文。

5,000+

优质数据集

54 个

任务类型

进入经典数据集