Internally Collected Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://nc-ai.github.io/speech/publications/nonhuman-vc/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含82,008个音频样本,分为三类:表达性发音(37,332个样本)、声音设计非人声(42,800个样本)以及动物声音(1,886个样本)。所有音频样本均以44.1千赫兹、16位WAV格式存储。该数据集的规模为82,008个音频样本,其任务是实现人声到非人声的转换。
This dataset contains 82,008 audio samples categorized into three classes: expressive speech (37,332 samples), sound-designed non-vocal sounds (42,800 samples), and animal sounds (1,886 samples). All audio samples are stored in 44.1 kHz, 16-bit WAV format. The task of this dataset is vocal-to-non-vocal conversion.



