ds-claudia/artificial_text_classification
收藏Hugging Face2024-12-17 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/ds-claudia/artificial_text_classification
下载链接
链接失效反馈官方服务:
资源简介:
Artificial Text Classification数据集旨在区分人类生成和机器生成的文本。该数据集提供了带有标签的文本示例,适用于训练和评估文本分类任务的机器学习模型。数据集包含810个英文文本示例,分为训练集和验证集,每个示例包含ID、文本内容和标签。标签为二进制,1表示机器生成的文本,0表示人类生成的文本。数据集可用于训练模型检测AI生成的文本,评估分类器在区分人工文本和人类文本方面的性能,以及自然语言理解和对抗性文本生成的研究。
The **Artificial Text Classification** dataset is designed to distinguish between human-generated and machine-generated text. This dataset provides labeled examples of text, enabling researchers and developers to train and evaluate machine learning models for text classification tasks. Key features include: text samples (including both human-written and machine-generated text) and labels (binary target variable where 1 represents machine-generated text and 0 represents human-written text). This dataset is particularly useful for evaluating the performance of natural language processing models in detecting synthetic or artificially generated text. The dataset structure includes columns for ID, Text, and label, with a total of 810 examples in English. The dataset can be used for tasks such as training models to detect AI-generated text, evaluating classifiers, and conducting research in natural language understanding and adversarial text generation.
提供机构:
ds-claudia



