PhiUSIIL Phishing URL Dataset
收藏Mendeley Data2024-01-31 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/shwpxscxy2
下载链接
链接失效反馈官方服务:
资源简介:
PhiUSIIL Phishing URL Dataset is a substantial dataset comprising 134,850 legitimate and 100,945 phishing URLs. Most of the URLs we analyzed while constructing the dataset are the latest URLs. Features are extracted from the source code of the webpage and URL. Features such as CharContinuationRate, URLTitleMatchScore, URLCharProb, and TLDLegitimateProb are derived from existing features. Citation: Prasad, A., & Chandra, S. (2023). PhiUSIIL: A diverse security profile empowered phishing URL detection framework based on similarity index and incremental learning. Computers & Security, 103545. doi: https://doi.org/10.1016/j.cose.2023.103545
PhiUSIIL钓鱼URL数据集(PhiUSIIL Phishing URL Dataset)是一款规模可观的数据集,涵盖134,850条合法URL与100,945条钓鱼URL。本数据集构建过程中所纳入分析的绝大多数URL均为最新上线的URL。特征均从网页源代码与URL本身中提取,其中字符连续率(CharContinuationRate)、URL与标题匹配得分(URLTitleMatchScore)、URL字符概率(URLCharProb)以及顶级域名合法概率(TLDLegitimateProb)等特征均由现有特征衍生而来。引用:Prasad, A. 与 Chandra, S. (2023). PhiUSIIL: A diverse security profile empowered phishing URL detection framework based on similarity index and incremental learning. Computers & Security, 103545. doi: https://doi.org/10.1016/j.cose.2023.103545
创建时间:
2024-01-31
搜集汇总
数据集介绍

背景与挑战
背景概述
PhiUSIIL Phishing URL Dataset是一个包含235,795个URL(134,850个合法和100,945个钓鱼)的大规模数据集,特征从网页和URL源代码中提取,适用于网络安全和机器学习研究。数据集为CSV格式,大小54.2MB,并附有相关研究论文引用。
以上内容由遇见数据集搜集并总结生成



