Monolingual Speech Dataset

arXiv2025-09-30 收录

下载链接：

https://lism13.github.io/demo/CrossSpeech

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了三种语言的单一语言数据集混合：英语、中文和韩语，旨在用于训练和评估跨语言文本语音转换系统。所有音频的采样率为22,050赫兹，相应的转录文本已转换为国际音标符号。这是一个多语言数据集，其任务是跨语言文本到语音合成。

This dataset consists of a mixture of monolingual datasets in three languages: English, Chinese, and Korean, and is intended for training and evaluating cross-lingual text-to-speech systems. All audio has a sampling rate of 22,050 Hz, and the corresponding transcriptions have been converted to International Phonetic Alphabet (IPA) symbols. This is a multilingual dataset whose task is cross-lingual text-to-speech synthesis.

5,000+

优质数据集

54 个

任务类型

进入经典数据集