d0rj/SemEval2024-task8
收藏Hugging Face2024-10-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/d0rj/SemEval2024-task8
下载链接
链接失效反馈官方服务:
资源简介:
SemEval2024-task8数据集是一个用于检测机器生成文本的多领域、多模型和多语言数据集。数据集分为四个子任务:Subtask A_monolingual、Subtask A_multilingual、Subtask B和Subtask C。每个子任务包含不同的特征,如文本、标签、模型、来源和ID等。数据集主要用于文本分类任务,涉及多种语言(如英语、阿拉伯语、德语、意大利语等),并且包含机器生成文本和人类书写文本的标签。数据集的来源包括Wikipedia、Wikihow、Peerread、Reddit、Arxiv等。
The SemEval2024-task8 dataset is a dataset for multidomain, multimodel, and multilingual machine-generated text detection. It includes multiple subtasks, each with different features and data formats. Subtasks A and B involve detecting machine-generated text, while Subtask C focuses on detecting changes in text. The dataset supports multiple languages, including English, Arabic, German, and Italian, among others. Each subtask has train, dev, and test sets, with detailed data file paths and size information provided.
提供机构:
d0rj



