Original Arabic Dataset

Name: Original Arabic Dataset
Creator: SemEval-2022 organizers
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://developer.twitter.com/en/docs/api-reference-index

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集为阿拉伯语的讽刺检测训练和测试数据集，由组织者为子任务A和B提供。数据集包含推文、是否讽刺以及方言等字段。规模方面，训练集包含2841条推文，测试集包含713条推文，任务旨在进行讽刺检测。

This is an Arabic sarcasm detection training and test dataset, provided by the organizers for Subtasks A and B. The dataset comprises fields including tweets, sarcasm status (whether the content is sarcastic), and dialect information. In terms of scale, the training set consists of 2841 tweets, while the test set contains 713 tweets. The core task of this dataset is sarcasm detection.

提供机构：

SemEval-2022 organizers

5,000+

优质数据集

54 个

任务类型

进入经典数据集