SST-2 Dataset

Name: SST-2 Dataset
Creator: paperswithcode.com
License: 暂无描述

paperswithcode.com2025-03-25 收录

下载链接：

https://paperswithcode.com/dataset/sst-2

下载链接

链接失效反馈

官方服务：

资源简介：

The Stanford Sentiment Treebank is a corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of sentiment in language. The corpus is based on the dataset introduced by Pang and Lee (2005) and consists of 11,855 single sentences extracted from movie reviews. It was parsed with the Stanford parser and includes a total of 215,154 unique phrases from those parse trees, each annotated by 3 human judges. Binary classification experiments on full sentences (negative or somewhat negative vs somewhat positive or positive with neutral sentences discarded) refer to the dataset as SST-2 or SST binary.

斯坦福情感树库（Stanford Sentiment Treebank）是一套包含完全标注的句法分析树语料库，它允许对语言中情感成分的组成效应进行全面分析。该语料库源于Pang和Lee（2005年）所引入的语料集，由从电影评论中提取的11,855个单个句子组成。这些句子使用斯坦福解析器进行分析，并包含了来自这些分析树的共计215,154个独特短语，每个短语均由三位人类评判员进行标注。针对完整句子的二元分类实验（负面或略显负面与略显正面或正面的句子，排除中性句子），将该语料库称为SST-2或SST二元。

提供机构：

paperswithcode.com

搜集汇总

数据集介绍

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集