eraser-benchmark/movie_rationales
收藏数据集概述
基本信息
- 数据集名称: MovieRationales
- 语言: 英语
- 许可证: 未知
- 多语言性: 单语种
- 数据集大小: 1K<n<10K
- 源数据集: 原始数据
- 任务类别: 文本分类
- 任务ID: 情感分类
数据结构
数据字段
review: 字符串类型,表示电影评论。label: 分类标签,可能的值包括NEG(0) 和POS(1)。evidences: 字符串列表,表示评论的证据。
数据分割
- 训练集: 1600 条数据
- 验证集: 200 条数据
- 测试集: 199 条数据
数据示例
json { "evidences": ["Fun movie"], "label": 1, "review": "Fun movie " }
引用信息
bibtex @inproceedings{deyoung-etal-2020-eraser, title = "{ERASER}: {A} Benchmark to Evaluate Rationalized {NLP} Models", author = "DeYoung, Jay and Jain, Sarthak and Rajani, Nazneen Fatema and Lehman, Eric and Xiong, Caiming and Socher, Richard and Wallace, Byron C.", booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics", month = jul, year = "2020", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2020.acl-main.408", doi = "10.18653/v1/2020.acl-main.408", pages = "4443--4458", } @InProceedings{zaidan-eisner-piatko-2008:nips, author = {Omar F. Zaidan and Jason Eisner and Christine Piatko}, title = {Machine Learning with Annotator Rationales to Reduce Annotation Cost}, booktitle = {Proceedings of the NIPS*2008 Workshop on Cost Sensitive Learning}, month = {December}, year = {2008} }




