jsy52/sst2
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/jsy52/sst2
下载链接
链接失效反馈官方服务:
资源简介:
斯坦福情感树库是一个包含完全标注的解析树结构的语料库,用于全面分析语言中情感的组成效应。该语料库基于Pang和Lee(2005年)引入的数据集,包含从电影评论中提取的11,855个单句。这些句子由斯坦福解析器解析,并包含来自这些解析树的215,154个独特短语,每个短语由3名人类评委进行标注。在完整句子上进行的二元分类实验(负面或有些负面 vs 有些正面或正面,中性句子被丢弃)将该数据集称为SST-2或SST二元分类。
The Stanford Sentiment Treebank is a corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of sentiment in language. The corpus is based on the dataset introduced by Pang and Lee (2005) and consists of 11,855 single sentences extracted from movie reviews. It was parsed with the Stanford parser and includes a total of 215,154 unique phrases from those parse trees, each annotated by 3 human judges. Binary classification experiments on full sentences (negative or somewhat negative vs somewhat positive or positive with neutral sentences discarded) refer to the dataset as SST-2 or SST binary.
提供机构:
jsy52



