SWL-LSE: SignaMed Word-Level LSE, a Dataset of Spanish Sign Language Health Signs
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13691886
下载链接
链接失效反馈官方服务:
资源简介:
SWL-LSE Dataset
The SWL-LSE dataset is coined from SignaMed Word-Level LSE (Lengua de Signos Española -Spanish Sign Language).
Overview
The dataset consists of 8,000 sign sequences from 300 different sign classes related to the health domain. Each class is represented by an RGB video that serves as the dictionary sign. These dictionary signs were reproduced by 124 signers, including deaf individuals, interpreters, and L2 Spanish Sign Language (LSE) students, using their webcams or mobile phones via the SignaMed platform (https://signamed.web.app). For privacy reasons, only the skeleton data is shared.
The process of collecting the dataset is described in:
Vázquez-Enríquez, M.; Alba-Castro, J.L.; Pérez-Pérez, A.; Cabeza-Pereiro, C.; Docío-Fernández, L. SignaMed: a CooperativeBilingual LSE-Spanish Dictionary in the Healthcare Domain. In Proceedings of the Proceedings of the LREC-COLING 202411th Workshop on the Representation and Processing of Sign Languages: Evaluation of Sign Language Resources; Efthimiou, E.; Fotinea, S.E.; Hanke, T.; Hochgesang, J.A.; Mesch, J.; Schulder, M., Eds., Torino, Italia, 2024; pp. 386–394.
The dataset itself and the pipeline for training and executing a baseline model based on skeletons is described in this github (https://github.com/mvazquezgts/SWL-LSE), and this paper:
Vázquez-Enríquez, M.; Alba-Castro, J.L.; Docío-Fernández, L.; Rodríguez-Banga, E. SWL-LSE: A Dataset of Spanish Sign Language Health Signs with an ISLR Baseline Method. Technologies 2024, 12(10), 205, D.O.I:10.3390/technologies12100205
Files
1. VIDEOS_REF.zip
Description: RGB videos recorded in lab conditions that represent each sign-class
Total files: 300
2. videos_ref_annotations.csv
Description: CSV file with the correspondence between the name of the video, its class ID and gloss in spanish: FILENAME,CLASS_ID,LABEL.
Total files: 1
3. ANNOTATIONS.zip
Description: 3 CSV files with train, validation and test file-class correspondences: FILENAME,CLASS_ID
Total files: 3
4. MEDIAPIPE.zip
Description: Pickle files containing the full output of Mediapipe using their Heavy model. Each .pkl file contains the outputs of Mediapipe Holistic legacy, Mediapipe Pose and Mediapipe Hands. Each file is package as a dictionary: dict_keys(['pose', 'hands', 'holistic_legacy'])
Total files: 8000
Usage
Researchers and practitioners in pattern recognition, machine learning, and sign language linguistics may find this dataset valuable for:
Training/testing machine learning models for isolated sign language recognition or gesture recognition.
Analyzing patterns on signs realization
Acknowledgments
This dataset is a collaborative effort of the next research goups and entities:
Group of Multimedia Technologies (GTM) from the atlanTTic Research Center of University of Vigo (Spain)
Group of Discourse and Society (GRADES) from the School of Philology and Translation of University of Vigo (Spain)
Federation of Deaf People Galician Associations (FAXPG)
Fundación CNSE-DILSE
Gratitude is extended to them for their contributions and support.
创建时间:
2025-02-17



