ifobito/viet_bud-tts-text-tags-500-v1
收藏Hugging Face2025-01-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/ifobito/viet_bud-tts-text-tags-500-v1
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含语音声学特征的数据集,特征包括文本、音高均值、音高标准差、信噪比(snr)、c50、发音速率(speaking_rate)、音素(phonemes)、stoi、si-sdr、pesq、噪声类型(noise)、混响(reverberation)、语音单调性(speech_monotony)、噪声干扰下的sdR(sdr_noise)、pesq语音质量(pesq_speech_quality)。数据集分为训练集、验证集和测试集,分别包含634,158、7,500和7,500个示例。
This is a dataset containing acoustic features of speech, including text, mean utterance pitch, standard deviation of utterance pitch, signal-to-noise ratio (snr), c50, speaking rate, phonemes, stoi, si-sdr, pesq, type of noise, reverberation, speech monotony, sdr in noise, and pesq speech quality. The dataset is split into training, validation, and test sets, containing 634,158, 7,500, and 7,500 examples respectively.
提供机构:
ifobito



