ICCIES-2025-DetectAI/vietnamese_news_human_ai
收藏Hugging Face2025-09-19 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ICCIES-2025-DetectAI/vietnamese_news_human_ai
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于区分越南语新闻文章是由人类撰写还是由AI生成的数据集。数据集包含了200,000篇文章,其中100,000篇是人类撰写的,来自Thanh Niên和VnExpress两个知名新闻平台;另外100,000篇是由AI生成的,使用GPT-4o Mini、Gemini Flash 1.5、Llama 3.3和DeepSeek等模型根据特定提示生成,以确保内容的多样性和风格。每个条目都包括一个越南语新闻段落(200-400字)和一个标签(0代表人类撰写,1代表AI生成)。
This is a dataset for distinguishing between Vietnamese news articles written by humans and those generated by AI. The dataset consists of 200,000 articles, with 100,000 written by humans from the reputable news platforms Thanh Niên and VnExpress, and another 100,000 generated by AI using models such as GPT-4o Mini, Gemini Flash 1.5, Llama 3.3, and DeepSeek based on specific prompts to ensure content diversity and style. Each entry includes a Vietnamese news passage (200-400 words) and a label (0 for human-written, 1 for AI-generated).
提供机构:
ICCIES-2025-DetectAI



