five

Android Malware Dataset with VirusTotal Labels

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/11095699
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains labels of 2.47 million Android apk hashes extracted from VirusTotal reports. The dataset was used in the experiments of our publication titled An Analysis of Android Malware Classification ServicesThe csv of the labels that was extracted from the VirusTotal reports is provided in labeling_dataset.csv.gz . A cell's value of -1 is used whenever there was no result from theengine for the given apk file hash value. The column names are provided in cols_labeling_dataset.csv. Note -1 is a string and not an integer If you use information from this repo, please cite our paper Rashed M, Suarez-Tangil G. An Analysis of Android Malware Classification Services. Sensors. 2021; 21(16):5671. https://doi.org/10.3390/s21165671 BibTeX @Article{s21165671,AUTHOR = {Rashed, Mohammed and Suarez-Tangil, Guillermo},TITLE = {An Analysis of Android Malware Classification Services},JOURNAL = {Sensors},VOLUME = {21},YEAR = {2021},NUMBER = {16},ARTICLE-NUMBER = {5671},URL = {https://www.mdpi.com/1424-8220/21/16/5671},\ISSN = {1424-8220},DOI = {10.3390/s21165671}} Required Software gzip Debian-based Linux: you may install it using the following command apt-get install gzip MacOS: gzip is pre-installed Windows: you may download gzip from http://gnuwin32.sourceforge.net/packages/gzip.htm How to use the file? There are two ways to use the file: Extract the gzip file and then you will have a csv output file. For that you need to install gzip and then extracting .csv.gz. The user may use the command gunzip labelingDataset.csv.gz Extract information from the zipped file directly (following the same logic of AndroZoo's csv):To extract the first column and save to a file called list_of_selected_sha256, run the following command:zcat labelingDataset.csv.gz | cut -d',' -f1 > list_of_selected_sha256To obtain rows of apk hashes that were first seen after the 1st of May, 2016, run this command:zcat labeling_dataset.csv.gz | grep -v ',snaggamea' | awk -F, '{if ( $2 >= "2016-05" ) {print} }'
创建时间:
2024-05-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作