802.11 Managemement frames from a public location
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8003771
下载链接
链接失效反馈官方服务:
资源简介:
About
The following datasets were captured at a busy Belgian train station between 9pm and 10pm, it contains all 802.11 management frames that were captured. both datasets were captured with approximately 20 minutes between then.
Both datasets are represented by a pcap and CSV file. The CSV file contains the frame type, timestamps, signal strength, SSID and MAC addresses for every frame. In the pcap file, all generic 802.11 elements were removed for anonymization purposes.
Anonymization
All frames were anonymized by removing identifying information or renaming identifiers. Concretely, the following transformations were applied to both datasets:
All MAC addresses were renamed (e.g. 00:00:00:00:00:01)
All SSID's were renamed (e.g. NETWORK_1)
All generec 802.11 elements were removed from the pcap
In the pcap file, anonymization actions could lead to "corrupted" frames because length tags do not correspond with the actual data. However, the file and its frames are still readable in packet analyzing tools such as Wireshark or Scapy.
The script which was used to anonymize is available in the dataset.
Data
Specifications for the datasets
N/o
Dataset 1
dataset 2
Frames
36306
60984
Beacon frames
19693
27983
Request frames
798
1580
Response frames
15815
31421
Identified Wi-Fi Networks
54
70
Identified MAC addresses
2092
2705
Identified Wireless devices
128
186
Capturetime
480s
422s
Dataset contents
The two datasets are stored in the directories `1/` and `2/`. Each directory contains:
`capture-X.pcap`: an anonymized version of the original capture
`capture-X.csv`: content of each captured frame (timestamp, MAC address...) saved as a CSV file
`anonymization.py` is the script which was used to remove identifiers.
`README.md` contains the documentation about the datasets
License
Copyright 2022-2023 Benjamin Vermunicht, Beat Signer, Maxim Van de Wynckel, Vrije Universiteit Brussel
Permission is hereby granted, free of charge, to any person obtaining a copy of this dataset and associated documentation files (the “Dataset”), to deal in the Dataset without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Dataset, and to permit persons to whom the Dataset is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions that make use of the Dataset.
THE DATASET IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE DATASET OR THE USE OR OTHER DEALINGS IN THE DATASET.
关于本数据集
下述两个数据集采集自比利时某繁忙火车站的21时至22时时段,包含捕获到的全部802.11管理帧(802.11 management frames)。两个数据集的采集间隔约为20分钟。
两个数据集均以pcap格式与CSV格式存储。CSV文件中记录了每帧的帧类型、时间戳、信号强度、服务集标识符(SSID,Service Set Identifier)以及媒体访问控制(MAC,Media Access Control)地址。为实现匿名化处理,pcap文件中已移除所有通用802.11元素。
匿名化处理
所有数据帧均通过移除识别信息或重命名标识符的方式完成匿名化。具体而言,对两个数据集应用了如下转换操作:
- 所有MAC地址均已重命名(示例:00:00:00:00:00:01)
- 所有SSID均已重命名(示例:NETWORK_1)
- 所有通用802.11元素均已从pcap文件中移除。
在pcap文件中,匿名化操作可能会因长度标签与实际数据不匹配而导致“损坏”的数据帧,但该文件及其数据帧仍可在Wireshark、Scapy等数据包分析工具中正常读取。用于执行匿名化的脚本已随数据集一同提供。
数据
### 数据集规格
| 统计项 | 数据集1 | 数据集2 |
|--------|---------|---------|
| 总数据帧数量 | 36306 | 60984 |
| 信标帧(Beacon frames) | 19693 | 27983 |
| 请求帧(Request frames) | 798 | 1580 |
| 响应帧(Response frames) | 15815 | 31421 |
| 已识别Wi-Fi网络数量 | 54 | 70 |
| 已识别MAC地址数量 | 2092 | 2705 |
| 已识别无线设备数量 | 128 | 186 |
| 捕获时长 | 480秒 | 422秒 |
### 数据集内容
两个数据集分别存储于目录`1/`与`2/`中,每个目录包含以下文件:
- `capture-X.pcap`:原始捕获数据的匿名化版本
- `capture-X.csv`:以CSV格式存储的每帧捕获内容(包含时间戳、MAC地址等信息)
- `anonymization.py`:用于移除标识符的匿名化脚本
- `README.md`:包含数据集相关说明的文档
授权许可
版权所有©2022-2023 Benjamin Vermunicht、Beat Signer、Maxim Van de Wynckel、布鲁塞尔自由大学(Vrije Universiteit Brussel)
特此免费授予任何获得本数据集及相关文档文件(以下简称“本数据集”)的人员不受限制地处理本数据集的权利,包括但不限于使用、复制、修改、合并、发布、分发、再许可以及销售本数据集副本的权利,同时允许向其提供本数据集的人员行使前述权利,但需遵守以下条件:
上述版权声明与本许可声明应包含在本数据集的所有副本或实质性使用部分中。
本数据集按“现状”提供,不附带任何明示或暗示的担保,包括但不限于适销性、特定用途适用性以及非侵权性的担保。在任何情况下,作者或版权持有人均不对因本数据集或本数据集的使用或其他交易行为产生的任何索赔、损害或其他责任承担责任,无论是合同诉讼、侵权诉讼还是其他诉讼。
创建时间:
2023-06-07



