ATD
收藏arXiv2025-09-30 收录
下载链接:
https://semeval.github.io/SemEval2024/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在检测五个领域中由AI生成的文本,这些领域包括维基百科、Reddit、WikiHow、PeerRead以及arXiv。数据集平衡了人类生成文本与AI生成文本的样本,每个领域与模型组合均有3000个样本。该数据集的任务是进行跨领域及跨模型的AI生成文本检测。
This dataset is intended for detecting AI-generated text across five distinct domains: Wikipedia, Reddit, WikiHow, PeerRead, and arXiv. It features balanced samples of human-written and AI-generated text, with exactly 3000 samples for each domain-model combination. The core task supported by this dataset is cross-domain and cross-model detection of AI-generated text.
提供机构:
SemEval-2024 competition



