检测恶意软件类型数据集
收藏帕依提提2024-03-04 收录
下载链接:
https://www.payititi.com/opendatasets/show-26030.html
下载链接
链接失效反馈官方服务:
资源简介:
Provide the names, email addresses, institutions, and other contact information of the donors and creators of the data set. Data Set Information: This study seeks to obtain data which will help to address machine learning based malware research gaps. The specific objective of this study is to build a benchmark dataset for Windows operating system API calls of various malware. This is the first study to undertake metamorphic malware to build sequential API calls. It is hoped that this research will contribute to a deeper understanding of how metamorphic malware change their behavior (i.e. API calls) by adding meaningless opcodes with their own dissembler/assembler parts. In our research, we have translated the families produced by each of the software into 8 main malware families: Trojan, Backdoor, Downloader, Worms, Spyware Adware, Dropper, Virus. Table 1 shows the number of malware belonging to malware families in our data set. As you can see in the table, the number of samples of other malware families except AdWare is quite close to each other. There is such a difference because we don't find too much of malware from the adware malware family. Attribute Information: Various Windows API calls Relevant Papers: Provide references to papers that have cited this data set in the past (if any). Citation Request: AF. Yaz?±, F?– ??atak, E. G??l, Classification of Metamorphic Malware with Deep Learning (LSTM), IEEE Signal Processing and Applications Conference, 2019. Catak, F?–., Yazi, AF., A Benchmark API Call Dataset for Windows PE Malware Classification, [Web Link], 2019.
请提供本数据集的捐赠者与创作者的姓名、电子邮箱、所属机构及其他联系方式。
数据集说明:本研究旨在获取相关数据,以填补基于机器学习的恶意软件研究空白。本研究的具体目标是构建一套针对各类恶意软件的Windows操作系统API调用基准数据集。本研究为首项基于变形恶意软件(metamorphic malware)构建序列API调用数据集的研究工作。本研究期望通过为变形恶意软件添加搭载自定义反汇编/汇编模块的无意义操作码,助力学界更深入地理解变形恶意软件如何改变自身行为(即API调用模式)。
在本研究中,我们将各工具生成的恶意软件家族划分为8个主要类别:特洛伊木马(Trojan)、后门程序(Backdoor)、下载器(Downloader)、蠕虫(Worms)、间谍广告软件(Spyware Adware)、投递器(Dropper)、病毒(Virus)。表1展示了本数据集内各恶意软件家族的样本数量。如表中所示,除广告软件(AdWare)家族外,其余恶意软件家族的样本数量均较为均衡。出现该数量差异的原因在于我们未能采集到足够多的广告软件家族恶意软件样本。
属性说明:本数据集涵盖各类Windows API调用。
相关论文:请提供过往引用本数据集的学术论文参考文献(若有)。
引用请求:
1. AF. Yazıcı, F. Öztürk, E. Göl, 《基于深度学习(LSTM)的变形恶意软件分类》,IEEE信号处理与应用会议,2019年。
2. Catak, F.Ö., Yazi, AF., 《面向Windows PE恶意软件分类的基准API调用数据集》,[网页链接],2019年。
提供机构:
帕依提提
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个用于恶意软件分类的基准数据集,专注于Windows操作系统的API调用序列,涵盖特洛伊木马、后门、下载器等多种恶意软件类型。其特点是首次针对变形恶意软件构建顺序API调用数据,旨在研究恶意软件如何通过添加无意义操作码来改变行为,为机器学习模型提供训练和评估基础。
以上内容由遇见数据集搜集并总结生成



