WenetSpeech4TTS

Name: WenetSpeech4TTS
Creator: maas
Published: 2026-05-15 22:07:37
License: 暂无描述

魔搭社区2026-05-15 更新2024-07-13 收录

下载链接：

https://modelscope.cn/datasets/dukguo/WenetSpeech4TTS

下载链接

链接失效反馈

官方服务：

资源简介：

WenetSpeech4TTS is a multi-domain Mandarin corpus derived from the open-sourced WenetSpeech dataset. Tailored for the text-to-speech tasks, we refined WenetSpeech by adjusting segment boundaries, enhancing the audio quality, and eliminating speaker mixing within each segment. Following a more accurate transcription process and quality-based data filtering process, the obtained WenetSpeech4TTS corpus contains 12,800 hours of paired audio-text data. Furthermore, we have created subsets of varying sizes, categorized by segment quality scores to allow for TTS model training and finetuning.

WenetSpeech4TTS是一款源自开源WenetSpeech数据集的多领域普通话语料库。本语料库专为文本到语音（text-to-speech，TTS）任务定制，我们对原始WenetSpeech数据集进行了优化处理：调整语音片段边界、提升音频质量，并消除各片段内的说话人混叠问题。经更精准的转录流程与基于质量的数据筛选流程后，最终构建的WenetSpeech4TTS语料库包含12800小时的音文配对数据。此外，我们还依据片段质量评分构建了不同规模的子集，以支持文本到语音模型的训练与微调。

提供机构：

maas

创建时间：

2024-07-03

搜集汇总

数据集介绍