CTU-50-FEEL
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7515405
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains HTTPS-based features of the CTU-50 dataset.
These features dataset was created for the purposes of a thesis on Distributed Federated Learning for Network Security. The code of the thesis, as well as the code used for generating these features, can be found in this repository.
The dataset spans five days and is split into benign client data, which comes from the day-to-day traffic of real users, and malicious data, which was generated from captured traffic of malware.
The dataset does not include any identifiable information. The features were created from the raw Zeek captures by splitting them into one-hour widows and grouping all connections with the same 4-tuple. This means every hour, a connection to one service will produce a single feature vector.
The data is in the form of CSV, with the first 41 columns being numerical features. Each feature vector contains two columns with label information. Every vector is uniquely identifiable using these columns:
id.orig_h - source IP address. The last octet of the addres is obfuscated with a value consistent across the dataset.
id.resp_h - target IP address
id.resp_p - target port
proto - used protocol
day - the day of the capture (value from 1 to 5)
hour - an hour of the day from which the feature vector was computed
创建时间:
2023-01-10



