SpeechCommands in WebDataset Format
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14722646
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is the speechcommands dataset, formatted in the WebDataset format. WebDataset files are essentially tar archives, where each example in the dataset is represented by a pair of files: a WAV audio file and a corresponding JSON metadata file. The JSON file contains the class label and other relevant information for that particular audio sample.
$ tar tvf wds-audio-train_0000000.tar|head
-r--r--r-- bigdata/bigdata 19 2025-01-10 08:58 right_7e783e3f_nohash_1.json
-r--r--r-- bigdata/bigdata 32044 2025-01-10 08:58 right_7e783e3f_nohash_1.wav
-r--r--r-- bigdata/bigdata 16 2025-01-10 08:58 up_c79159aa_nohash_3.json
-r--r--r-- bigdata/bigdata 32044 2025-01-10 08:58 up_c79159aa_nohash_3.wav
-r--r--r-- bigdata/bigdata 18 2025-01-10 08:58 left_2b42e7a2_nohash_3.json
-r--r--r-- bigdata/bigdata 32044 2025-01-10 08:58 left_2b42e7a2_nohash_3.wav
-r--r--r-- bigdata/bigdata 18 2025-01-10 08:58 left_c79159aa_nohash_4.json
-r--r--r-- bigdata/bigdata 32044 2025-01-10 08:58 left_c79159aa_nohash_4.wav
-r--r--r-- bigdata/bigdata 18 2025-01-10 08:58 left_708b8d51_nohash_0.json
-r--r--r-- bigdata/bigdata 32044 2025-01-10 08:58 left_708b8d51_nohash_0.wav
$ cat right_7e783e3f_nohash_1.json
{"labels": "right"}
创建时间:
2025-01-23



