rohitsaxena/MENSA
收藏MENSA: Movie Scene Saliency Dataset
数据集概述
MENSA(Movie Scene Saliency Dataset)数据集来自论文《Select and Summarize: Scene Saliency for Movie Script Summarization》,包含电影剧本及其对应的摘要。每个场景都标注了场景显著性标签。训练集包含自动生成的银标签,而验证集和测试集包含人工标注的金标签。
数据集结构
数据集分为三部分:
- 训练集:包含电影剧本和摘要,带有自动生成的银场景显著性标签。
- 验证集:包含电影剧本和摘要,带有手工标注的金场景显著性标签。
- 测试集:包含电影剧本和摘要,带有手工标注的金场景显著性标签。
许可证
Creative Commons Attribution Non Commercial 4.0
引用
@misc{saxena2024select, title={Select and Summarize: Scene Saliency for Movie Script Summarization}, author={Rohit Saxena and Frank Keller}, year={2024}, eprint={2404.03561}, archivePrefix={arXiv}, primaryClass={cs.CL} }
@inproceedings{saxena-keller-2024-select, title = "Select and Summarize: Scene Saliency for Movie Script Summarization", author = "Saxena, Rohit and Keller, Frank", editor = "Duh, Kevin and Gomez, Helena and Bethard, Steven", booktitle = "Findings of the Association for Computational Linguistics: NAACL 2024", month = jun, year = "2024", address = "Mexico City, Mexico", publisher = "Association for Computational Linguistics", pages = "3439--3455", }



