VocalSound 16k in WebDataset Format
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14649750
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is the VocalSound dataset, formatted in the WebDataset format. WebDataset files are essentially tar archives, where each example in the dataset is represented by a pair of files: a WAV audio file and a corresponding JSON metadata file. The JSON file contains the class label and other relevant information for that particular audio sample.
$ tar tf wds-audio-test-000000.tar | headm3109_0_laughter.jsonm3109_0_laughter.wavf0238_0_sniff.jsonf0238_0_sniff.wavm0526_0_cough.jsonm0526_0_cough.wavm0886_0_sneeze.jsonm0886_0_sneeze.wavo1897_0_laughter.jsono1897_0_laughter.wav
$ cat m3109_0_laughter.json{"label": "laughter"}
创建时间:
2025-01-15



