Ember: Endgame Malware BEnchmark for Research (2017-01-01 to 2017-12-31)
收藏DataCite Commons2020-07-15 更新2025-04-09 收录
下载链接:
https://www.impactcybertrust.org/dataset_view?idDataset=1146
下载链接
链接失效反馈官方服务:
资源简介:
The ember dataset is a collection of 1.1 million sha256 hashes from PE files that were scanned sometime in 2017. This repository makes it easy to reproducibly train the benchmark model, extend the provided feature set, or classify new PE files with the benchmark model.
The dataset includes features extracted from 1.1M binary files: 900K training samples (300K malicious, 300K benign, 300K unlabeled) and 200K test samples (100K malicious, 100K benign). The dataset is accompanied by open source code for extracting features from additional binaries so that additional sample features can be appended to the dataset. This dataset fills a void in the information security machine learning community: a benign/malicious dataset that is large, open and general enough to cover several interesting use cases. ; Hyrum Anderson
提供机构:
IMPACT
创建时间:
2018-12-18



