N-WLASL2000
收藏Mendeley Data2024-01-31 更新2024-06-28 收录
下载链接:
https://ieee-dataport.org/documents/n-wlasl2000
下载链接
链接失效反馈官方服务:
资源简介:
N-WLASL dataset is a synthetic event-based dataset comprising 21,093 samples across 2,000 glosses. The dataset was collected using an event camera to shoot toward an LCD monitor. The monitor plays video frames from WLASL, the largest public word-level American Sign Language dataset. We use the event camera DAVIS346 with a resolution of 346x260 to record the display. The video resolution of WLASL is 256x256 and the frame rate is 25Hz. To ensure accurate recording of the display, we have implemented three video pre-processing procedures using the python-opencv and dv packages in Python. These procedures are as follows:Add black paddings and red borders around video frames to increase their size to 346x260.Center the video frames on the monitor display after scaling them to 1428x1080 in the original aspect ratio.Display all videos sequentially at the original frame rate of 25Hz and pause the first frame of each video for 500ms to prevent event bursts brought on by swapping videos.
创建时间:
2024-01-31
搜集汇总
数据集介绍

背景与挑战
背景概述
N-WLASL是一个合成的事件型手语识别数据集,包含21,093个样本,覆盖2,000个美国手语词汇。该数据集使用事件相机DAVIS346录制WLASL视频生成,专为事件视觉和手语识别研究设计,具有高分辨率和预处理流程。
以上内容由遇见数据集搜集并总结生成



