CASE 2021
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/emerging-welfare/case-2021-shared-task
下载链接
链接失效反馈官方服务:
资源简介:
该数据集为多语言抗议事件检测任务所使用的训练和测试数据,涵盖了英语、葡萄牙语和西班牙语的文档。该训练数据在CASE 2022任务中被使用,并由母语者进行了标注。在规模上,训练数据的具体数量未知,但包括英语、葡萄牙语和西班牙语的数据;而测试数据则具体为英语3,870条,葡萄牙语670条,西班牙语399条。该数据集的任务类型为文档分类。
This dataset comprises training and test data for the multilingual protest event detection task, covering documents in English, Portuguese, and Spanish. The training data was employed in the CASE 2022 shared task and annotated by native speakers. The exact number of training samples remains unspecified, yet it encompasses data in English, Portuguese, and Spanish. For the test set, there are 3,870 samples in English, 670 in Portuguese, and 399 in Spanish. The task type of this dataset is document classification.



