five

WinMal25 Dataset

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/winmal25-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
Obfuscated malware detection is a complex task where classification performance is seriously affected due to the evasion techniques presented in the input software samples. This research follows the novel memory analysis technique to examine features extracted from different RAM snapshots over compromised Windows Virtual Machines. For this, we use the CIC-MalMem-2022 dataset and create a new collection of data that we call WinMal25, which is based on fileless malware. Moreover, we apply the Self-Supervised Learning paradigm directly in the tabular data domain, leveraging the representation learning of massive amounts of unlabeled information to provide a strong generalization capacity to our models. To the best of our knowledge, this is the first work implementing Self-Supervised Tabular Learning for the malware detection problem. The results exhibit proven evidence that Self-Supervised Learning using Tabular Networks outperforms, in terms of detection rate and inference time performance, popular baselines like Multi-layer Perceptron and Random Forest, by 0.36% in accuracy and 1.85% in macro F1 score. The original experimentation detailed herein, encompassing Explainable Artificial Intelligence, yields relevant insights toward a simpler characterization of obfuscated malware and the considerations behind deploying a memory-based antivirus.
提供机构:
Almaraz-Rivera, Josue Genaro; Perez-Diaz, Jesus Arturo; Cantoral-Ceballos, Jose Antonio; Botero, Juan Felipe; Branco, Paula; Jourdan, Guy-Vincent
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作