Phishing Dataset for Machine Learning: Feature Evaluation

Mendeley Data2018-03-24 更新2026-04-09 收录

下载链接：

https://data.mendeley.com/datasets/h3cgnj8hft/1

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains 48 features extracted from 5000 phishing webpages and 5000 legitimate webpages, which were downloaded from January to May 2015 and from May to June 2017. An improved feature extraction technique is employed by leveraging the browser automation framework (i.e., Selenium WebDriver), which is more precise and robust compared to parsing approach based on regular expressions. This dataset is WEKA-ready. Phishing webpage source: PhishTank, OpenPhish Legitimate webpage source: Alexa, Common Crawl Anti-phishing researchers and experts may find this dataset useful for phishing features analysis, conducting rapid proof of concept experiments or benchmarking phishing classification models.

创建时间：

2018-03-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集