Exploiting Statistical and Structural Features for the Detection of Domain Generation Algorithms
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4010619
下载链接
链接失效反馈官方服务:
资源简介:
This repository contains a dataset for the research of domain generation algorithms (DGAs) and machine learning. More precisely, it targets dictionary-based DGAs.
Constantinos Patsakis, Fran Casino: "Exploiting Statistical and Structural Features for the Detection of Domain Generation Algorithms", Journal of Information Security and Applications, 2021.
Features ordered as in the shared dataset:
Family: DGA that the domain belongs to
SLD: SLD of the Domain
L-LEN: The length of Domain
L-DIG: The number of digits in Domain
L-CON-MAX: The maximum number of consecutive consonants Domain
R-CON-VOW: Number of consonants divided by L-LEN
L-SYM: The number of special characters
R-SYM-LEN: L-SYM divided by L-LEN
R-Dom-3G: Ratio of benign grams in Dom-3G
R-Dom-4G: Ratio of benign grams in Dom-4G
R-Dom-5G: Ratio of benign grams in Dom-5G
L-W2: Number of words with more than 2 characters in Domain
L-W3: Number of words with more than 3 characters in Domain
R-WS-LEN: Dom-WS divided by L-LEN
R-WDS-LEN: Dom-WDS divided by L-LEN
R-W2-LEN: Dom-W2 divided by L-LEN
R-W3-LEN: Dom-W3 divided by L-LEN
M2-Dom-Ws: 2-Chain Markov English grams applied to Dom-WS
M2-Dom-WDS: 2-Chain Markov English grams applied Dom-WDS
E-Dom-WS: Entropy of Dom-WS
E-Dom-WDS: Entropy of Dom-WDS
E-Dom-W2: Entropy of Dom-W2
E-Dom-W3: Entropy of Dom-W3
创建时间:
2020-12-21



