ArmanTTS
收藏arXiv2023-04-07 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2304.03585v1
下载链接
链接失效反馈官方服务:
资源简介:
ArmanTTS是一个专为波斯语设计的单说话人TTS数据集,由伊朗科技大学计算机工程学院创建。该数据集包含8449个样本,总时长9小时12分钟14秒,采用22.05 kHz采样率,单声道录音,平均信噪比为25dB。数据集的创建过程包括使用OpenSubtitles获取波斯语文本,将文本映射到音素,并在专业录音室环境下录制语音。ArmanTTS主要应用于波斯语的文本到语音转换,旨在解决波斯语TTS领域数据稀缺的问题。
ArmanTTS is a single-speaker TTS dataset specifically designed for the Persian language, created by the School of Computer Engineering, Iran University of Science and Technology. This dataset includes 8449 samples, with a total duration of 9 hours, 12 minutes and 14 seconds, a sampling rate of 22.05 kHz, monophonic recording, and an average signal-to-noise ratio of 25 dB. The dataset creation process involves obtaining Persian text from OpenSubtitles, mapping the text to phonemes, and recording the speech in a professional studio environment. ArmanTTS is primarily utilized for Persian text-to-speech conversion, aiming to address the issue of data scarcity in the Persian TTS field.
提供机构:
伊朗科技大学计算机工程学院
创建时间:
2023-04-07



