BPE formatted payload
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://data.mendeley.com/datasets/h9gm92pjbc
下载链接
链接失效反馈官方服务:
资源简介:
The dataset is encoded in the BPE format of the Sentence Piece package (Kudo and Richardson, 2018) . Training and testing datasets are encoded in the same way. Dataset consists of csv format and columns are as follows:
1 label_name : Threat Group Name (Just reference only)
2 label_code : Use as actual correct answer data as threat group code (Y data)
3 raw_packet : Encoded attack payload training data (X data)
本数据集采用Sentence Piece工具包的字节对编码(BPE, Byte Pair Encoding)格式进行编码(Kudo与Richardson,2018)。
训练集与测试集采用完全一致的编码方式。
本数据集采用逗号分隔值(CSV, Comma-Separated Values)格式存储,各字段说明如下:
1. label_name:威胁组名称(仅作参考)
2. label_code:用作威胁组编码的真实标准答案数据(即Y数据)
3. raw_packet:经过编码的攻击载荷训练数据(即X数据)
创建时间:
2022-05-03



