spam-email-5k5
收藏魔搭社区2025-12-05 更新2025-11-15 收录
下载链接:
https://modelscope.cn/datasets/mnemoraorg/spam-email-5k5
下载链接
链接失效反馈官方服务:
资源简介:
# Spam Eail 5k5
Description
SpamMail-Binary is a curated email corpus designed for training and evaluating spam-detection systems. Each record contains:
- **Message** – full email text, including subject and body
- **Category** – binary label: **Spam** or **Ham** (non-spam)
The collection spans a diverse range of phishing attempts, promotional blasts, newsletters, and legitimate correspondence, offering clean, real-world language patterns for natural-language-processing and machine-learning tasks.
# 垃圾邮件5k5(Spam Eail 5k5)
## 数据集描述
SpamMail-Binary(垃圾邮件二元分类语料库)是一份经过精心甄选的邮件语料库,专为垃圾邮件检测系统的训练与评估而构建。每条数据记录包含以下字段:
- **邮件内容(Message)**:完整的邮件文本,涵盖邮件主题与正文
- **分类标签(Category)**:二元分类标签,分为**垃圾邮件(Spam)**与**正常邮件(Ham)**(即非垃圾邮件)
该语料库涵盖了多样化的钓鱼攻击邮件、推广群发邮件、订阅简报以及合法通信往来等多种类型,可为自然语言处理(Natural Language Processing, NLP)与机器学习相关任务提供高质量的真实场景语言模式。
提供机构:
maas
创建时间:
2025-09-08



