five

Replication Data for: Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio Sensors

收藏
DataCite Commons2025-03-25 更新2025-04-16 收录
下载链接:
https://rdr.kuleuven.be/citation?persistentId=doi:10.48804/SVEABM
下载链接
链接失效反馈
官方服务:
资源简介:
This repository includes the data for reproducing the results of the paper: "Self-Learning for Personalized Keyword Spotting on Ultra-Low-Power Audio Sensors". In this work, we recorded a dataset of speech samples with our microphone sensor after replaying a subset of data from the HeySnips dataset using a speaker. The collected speech data (total of 400 samples) are split between a testset and a trainset, both including "Hey Snips" utterances and non-"Hey Snips" utterances. In particular, the data of the testset is composed by recordings from 20 random speakers from the original testset. After the recording, the data were fed to our DNN models deployed on devices. Initially, a per-speaker prototype vector is computed by feeding three audio recordings of the target keywords. Next, the audio tracks of the training set are processed with a sliding window approach to compute the distance with respect to the prototype and assign pseudo-labels for the self-learning task. The dataset is, therefore, composed of two main partitions. First, the "recorded_speech_data" includes the audio recordings. Note that this dataset is under restricted access to not violate the terms of access of the original dataset. Second, the "processed_outputs" includes the output of the processing, i.e. the measured distances. By using the dataset in combination with the associated code, every user will be able to reproduce the results of the paper.
提供机构:
KU Leuven RDR
创建时间:
2024-10-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作