A Smartphone Camera Based RGB Video Dataset of Natural Hand Gestures
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/n66hhk695h
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 1,352 high-quality RGB video recordings of 13 everyday hand gestures, captured entirely with a Samsung Galaxy S23 smartphone in a variety of real-world lighting conditions. It has been created to support gesture recognition research on consumer-grade devices, without relying on depth or infrared sensors, in order to emphasize practical and cost-effective solutions. The recordings feature 26 participants, with all participants performing gestures in a full-body view and two participants additionally performing gestures in a hand-only view. For each gesture, there are 92 full-body videos and 12 hand-only videos, resulting in 104 videos per gesture.
The dataset captures natural variation in appearance and environment, with gestures performed under a wide range of lighting conditions, including outdoor daylight, dim indoor light, green and red LED lights, backlit scenes, natural white light, and warm light. Each video has been standardized to a resolution of 640×640 pixels at 30 frames per second, encoded in H.264 format, and contains no audio. Unlike some gesture datasets, this collection does not include extracted frames. All data is provided as complete video clips to allow maximum flexibility in preprocessing and model design.
The directory structure is organized by gesture class and capture mode, with a dedicated folder for each modality (“full_body” or “hand_only”). A comprehensive metadata file, metadata.csv, is provided at the root of the dataset. This file contains, for each video, the filename, gesture label, participant identifier, take number, capture mode, relative video path, duration in seconds, frame rate, and video resolution. This enables straightforward filtering, indexing, and integration into machine learning workflows.
With its combination of diverse participants, varied lighting, and multiple capture perspectives, this dataset provides a realistic and challenging benchmark for developing and evaluating gesture recognition systems. It is particularly well-suited for research that aims to achieve robustness against lighting changes, performer variability, and environmental diversity, as well as for projects exploring mobile-friendly, real-world computer vision solutions.
创建时间:
2025-08-13



