agentlans/real-vs-gpt2-sentences
收藏Hugging Face2025-01-18 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/agentlans/real-vs-gpt2-sentences
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含30万+句子的数据集,用于比较人类撰写的文本和AI生成的文本完成情况。每条记录包含一个句子种子(seed)、原始完整句子(real)和AI生成的句子完成(gpt2)。数据集来源于高质量英语句子数据集,并使用GPT2模型生成句子完成。该数据集适用于比较语言分析、AI文本检测研究和自然语言理解研究。
This dataset contains over 300,000 sentences comparing human-written text with AI-generated completions. Each entry includes a sentence seed, the original complete sentence, and the AI-generated sentence completion. The sentences are sourced from a high-quality English sentences dataset and completions are generated using the GPT2 model. The dataset is suitable for comparative language analysis, AI text detection research, and natural language understanding studies.
提供机构:
agentlans



