Fast-Flux Dataset: Enhancing Cybersecurity Analysis and Defense
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/5h7n3xtb4x
下载链接
链接失效反馈官方服务:
资源简介:
The enhanced fast-flux domain dataset comprises a comprehensive collection of 91,530 domain records systematically gathered through active DNS monitoring and the integration of threat intelligence over a two-month observation period. The dataset represents one of the largest publicly available collections of fast-flux domain data, featuring detailed behavioral and structural analysis. It provides researchers with unprecedented access to real-world cybersecurity threat data.
The dataset structure incorporates 20 distinct features for each domain record, carefully engineered to capture the multifaceted nature of fast-flux network operations. These features span four primary categories: network connectivity characteristics, domain naming and structural properties, historical relationship data, and web infrastructure presence indicators. Each feature has been validated through statistical analysis and correlation studies to ensure discriminative power and analytical relevance.
Network connectivity features form the foundation of the dataset, capturing the distinctive IP rotation patterns that define fast-flux operations. The number of associated IP addresses per domain ranges from a minimum of 1 to a maximum of 440, with fast-flux domains typically demonstrating significantly higher IP counts compared to legitimate domains. Geographic distribution analysis reveals that fast-flux domains span multiple countries and cities, with a median of 2 countries and two cities per domain, reflecting the distributed nature of fast-flux infrastructure.
The Autonomous System Number (ASN) diversity metric provides insights into the network infrastructure preferences of fast-flux operators, with values ranging from 2 to 194 ASNs per domain. This high variability indicates that while most domains utilize relatively few ASNs, sophisticated fast-flux operations may leverage dozens or hundreds of different network providers to maximize resilience and evade detection.
Domain naming characteristics reveal significant behavioral differences between fast-flux and legitimate domains. Domain length analysis reveals a range of 4 to 244 characters, with a median length of 16 characters and a mean of approximately 19.7 characters. More significantly, domain entropy analysis reveals that fast-flux domains exhibit higher randomness in naming patterns, with entropy values averaging 3.5-4 bits, compared to 3.0-3.3 bits for legitimate domains. This entropy differential reflects the algorithmic or pseudo-random generation techniques commonly employed in fast-flux domain creation.
Top-level domain (TLD) distribution analysis reveals that 99.5% of domains in the dataset utilize Type 2 TLDs (primarily .com), indicating a strong preference for common, trusted TLD types that may help evade suspicion. Additionally, 16% of domains utilize private WHOIS registration, indicating that operational security considerations are a factor in the deployment of fast-flux networks.
创建时间:
2025-07-21



