Spatially-diverse High-dimensional Channel State Information (CSI) based Dataset for Human Activity Recognition (HAR)
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/11201413
下载链接
链接失效反馈官方服务:
资源简介:
Introduction
The data collection for this novel channel state information (CSI)-based human activity recognition (HAR) dataset was conducted in a rigorously controlled laboratory environment. The space, measuring 5 meters x 8 meters x 3 meters, served as a well-defined testing ground. Four strategically positioned ESP32 devices formed transceiver pairs in a diagonal network, functioning as both transmitters and receivers, which are separated by a distance of 1.5 meters. The transmitters, powered by external power banks for consistent operation, were mounted on tripods at a height of 1.5 meters on the north and east corners. Their corresponding receivers, connected to laptops via USB for real-time CSI data acquisition, were positioned on the south and west corners.
This dataset captures a wider range of activities (including subtle movements) and accounts for variations in body type and environmental conditions. It achieves this by collecting CSI data in a controlled environment using multiple transmitter-receiver pairs positioned at different orientations. This setup captures CSI information across 166 subcarriers used in Wi-Fi Wi-Fi IEEE 802.11n on channel 11, providing a richer and more nuanced view of human movement compared to traditional datasets. This data is expected to lead to the development of more robust and generalizable HAR models with higher accuracy and real-world applicability.
Description of Dataset
The dataset is housed within a directory named "SHD-HAR-Dataset-main" in the repository. This directory is further divided into "raw" and "amplitude" subdirectories. Data in "raw" and "amplitude" directories is further categorized based on participant orientation relative to the transceivers ("front/side"). It's important to note that the samples are synchronized between the "front" and "side" folders. This is because both devices collected data simultaneously for a particular activity. To summarize, the file "activityX.csv" in the "front" folder was collected at the same time as the corresponding "activityX.csv" file in the "side" folder, where "X" represents a unique identifier for each sample.
The "raw" directory stores the unprocessed CSI data for each activity sample. These samples are stored as individual CSV files ("activityX.csv") containing 300-450 packets of CSI data obtained within 5 seconds. Each packet encompasses 25 distinct data fields, resulting in a total of 300-450 x 25 data entries per activity stored within the ".csv" file.
The "amplitude" directory contains the processed signal amplitude data for 166 subcarriers. This directory mirrors the structure of the "raw" directory, with subdirectories for "front" and "side" orientations and "activityX.csv" files for each sample. However, the data within these files is transformed into a 300-450 x 166 matrix, representing the extracted signal amplitudes from the original CSI data. This format significantly reduces dimensionality while preserving key information for HAR analysis.
创建时间:
2024-05-16



