ds-claudia/artificial_text_classification

Name: ds-claudia/artificial_text_classification
Creator: ds-claudia
Published: 2024-12-17 23:40:48
License: 暂无描述

Hugging Face2024-12-17 更新2024-12-21 收录

下载链接：

https://hf-mirror.com/datasets/ds-claudia/artificial_text_classification

下载链接

链接失效反馈

官方服务：

资源简介：

Artificial Text Classification数据集旨在区分人类生成和机器生成的文本。该数据集提供了带有标签的文本示例，适用于训练和评估文本分类任务的机器学习模型。数据集包含810个英文文本示例，分为训练集和验证集，每个示例包含ID、文本内容和标签。标签为二进制，1表示机器生成的文本，0表示人类生成的文本。数据集可用于训练模型检测AI生成的文本，评估分类器在区分人工文本和人类文本方面的性能，以及自然语言理解和对抗性文本生成的研究。

The **Artificial Text Classification** dataset is designed to distinguish between human-generated and machine-generated text. This dataset provides labeled examples of text, enabling researchers and developers to train and evaluate machine learning models for text classification tasks. Key features include: text samples (including both human-written and machine-generated text) and labels (binary target variable where 1 represents machine-generated text and 0 represents human-written text). This dataset is particularly useful for evaluating the performance of natural language processing models in detecting synthetic or artificially generated text. The dataset structure includes columns for ID, Text, and label, with a total of 810 examples in English. The dataset can be used for tasks such as training models to detect AI-generated text, evaluating classifiers, and conducting research in natural language understanding and adversarial text generation.

提供机构：

ds-claudia

5,000+

优质数据集

54 个

任务类型

进入经典数据集