five

ALTA 2023 Shared Task

收藏
arXiv2025-09-30 收录
下载链接:
https://www.alta.asn.au/events/sharedtask2023/index.html
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集分为三个子集:训练集、验证集和测试集,专门用于检测人工智能生成的文本。其中包含的条目被标记为人工生成或人工智能生成。训练集包含三列数据:'id'、'text'和'label',而验证集和测试集则各包含两列:'id'和'text'。数据集在人工智能生成文本和人工生成文本之间均匀分布。规模方面,训练集包含18,000个条目,验证集包含2,000个条目,测试集也包含2,000个条目。该数据集的任务是进行人工智能生成文本的检测。

This dataset is divided into three subsets: training set, validation set, and test set, which is specifically tailored for AI-generated text detection. Each entry within the dataset is labeled as either human-generated or AI-generated. The training set comprises three columns: 'id', 'text', and 'label', while both the validation set and test set include two columns: 'id' and 'text'. The dataset features a balanced distribution between AI-generated and human-generated texts. Regarding its scale, the training set contains 18,000 entries, the validation set has 2,000 entries, and the test set also consists of 2,000 entries. The core task of this dataset is AI-generated text detection.
提供机构:
ALTA 2023 Shared Task organizers
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作