Original Arabic Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://developer.twitter.com/en/docs/api-reference-index
下载链接
链接失效反馈官方服务:
资源简介:
该数据集为阿拉伯语的讽刺检测训练和测试数据集,由组织者为子任务A和B提供。数据集包含推文、是否讽刺以及方言等字段。规模方面,训练集包含2841条推文,测试集包含713条推文,任务旨在进行讽刺检测。
This is an Arabic sarcasm detection training and test dataset, provided by the organizers for Subtasks A and B. The dataset comprises fields including tweets, sarcasm status (whether the content is sarcastic), and dialect information. In terms of scale, the training set consists of 2841 tweets, while the test set contains 713 tweets. The core task of this dataset is sarcasm detection.
提供机构:
SemEval-2022 organizers



