five

AreLit/PhishNChips

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/AreLit/PhishNChips
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: PhishNChips language: - en task_categories: - text-classification tags: - phishing-detection - llm-security - benchmark - email-security size_categories: - 1K<n<10K license: other configs: - config_name: default data_files: - split: core path: core_emails.csv - split: benchmark_results path: benchmark_results.csv - split: cross_domain_legitimate_v5 path: cross_domain_legitimate_v5.csv - split: infrastructure_phishing_expanded path: infrastructure_phishing_expanded.csv - split: real_phishing_validation path: real_phishing_validation.csv --- # PhishNChips: A Benchmark for LLM Email-Agent Security PhishNChips is a large-scale benchmark for evaluating how system prompt configurations influence the security behavior of LLM-based email agents. This repository contains the canonical v5.2 release, featuring 2,000 email stimuli and 220,000 adjudicated model evaluations. ## Dataset Overview The benchmark measures a critical deployment variable: how strongly an LLM's system prompt shapes its phishing detection capabilities and false-positive characteristics. PhishNChips provides a controlled environment to study the trade-offs between security, helpfulness, and instruction following in agentic systems. ### Core Components - **Core Benchmark (2,000 emails):** 1,000 phishing emails (grounded in real malicious URLs) and 1,000 legitimate workplace emails (including 333 cross-domain samples). - **Adjudicated Evaluations (220,000):** A full result grid spanning 11 frontier models and 10 distinct system prompt strategies. - **URL Evasion Taxonomy:** Stratified samples covering zero-signal, hidden-signal, and inverted-signal (infrastructure phishing) evasion techniques. ## File Structure | File | Description | |---|---| | `core_emails.csv` | The primary 2,000-email benchmark stimuli. | | `benchmark_results.csv` | Full result grid (11 models x 10 strategies x 2,000 emails). | | `reference_results.csv` | Summary metrics and leaderboard rankings. | | `prompt_strategies.json` | Detailed definitions of the 10 evaluated system prompts. | | `cross_domain_legitimate_v5.csv` | Dedicated split for cross-domain false-positive analysis. | | `infrastructure_phishing_expanded.csv` | Auxiliary split for infrastructure-level stress testing. | | `real_phishing_validation.csv` | Historical real-phishing validation set (Nazario). | | `croissant.json` | Machine-readable metadata (MLCommons Croissant format). | | `SOURCE_LICENSES.md` | Comprehensive provenance and licensing documentation. | ## Datasource Composition The benchmark leverages high-quality, verified data from several major security feeds and research utilities: | Datasource | Count | Category | |---|---:|---:| | `phishtank` | 700 | Phishing | | `tranco` | 662 | Legitimate | | `cross_domain_expansion_v1` | 333 | Legitimate | | `github_phishing_db_live` | 172 | Phishing | | `github_phishing_db` | 62 | Phishing | | `openphish` | 66 | Phishing | | `adversarial_legit` | 5 | Legitimate | ## Responsible Use This dataset contains real malicious URL indicators. Treat all URLs as **offline text strings only**. Do not visit, crawl, or execute URLs from this repository. This resource is intended strictly for security research and defensive evaluation. ## License and Attribution The PhishNChips benchmark project code and synthetic content are released under an **MIT License**. However, the dataset incorporates third-party-derived malicious URL indicators and benign domain seeds. These components are redistributed for academic research with attribution. **Review [SOURCE_LICENSES.md](https://huggingface.co/datasets/AreLit/PhishNChips/blob/main/SOURCE_LICENSES.md) for full citations and provenance details.** - **Nazario Phishing Corpus:** CC-BY-4.0. - **OpenPhish:** Approved for academic research use (Apr 6, 2026). - **PhishTank:** Cleared via Cisco/PhishTank terms. - **Tranco:** Sourced via academic use norms (Le Pochat et al. 2019). ## Citation ```bibtex @article{litvak2026phishnchips, title={The System Prompt Is the Attack Surface: How {LLM} Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities}, author={Litvak, Ron}, journal={arXiv preprint}, year={2026} } ```
提供机构:
AreLit
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作