txya900619/vggsound-16k
收藏Hugging Face2025-03-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/txya900619/vggsound-16k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含文本和音频特征的数据集,文本特征为caption,音频特征为audio,音频采样率为16000Hz。数据集分为训练集和测试集,训练集包含183631个样本,测试集包含15463个样本。整个数据集的大小为约63.5GB,下载大小为约61.9GB。数据集提供了默认配置,指定了训练集和测试集的数据文件路径。
This dataset includes text and audio features, with the text feature named caption and the audio feature named audio at a sampling rate of 16000Hz. The dataset is split into a training set and a test set, with the training set containing 183631 samples and the test set containing 15463 samples. The total size of the dataset is approximately 63.5GB, and the download size is about 61.9GB. The dataset provides a default configuration specifying the data file paths for the training and test sets.
提供机构:
txya900619



