Italian TTS Speech Corpus (Appen)

Name: Italian TTS Speech Corpus (Appen)
Creator: ELRA (European Language Resources Association)
Published: 2005-05-12 00:00:00
License: 暂无描述

catalog.elra.info2005-05-12 更新2025-01-21 收录

下载链接：

https://catalog.elra.info/en-us/repository/browse/ELRA-S0148/

下载链接

链接失效反馈

官方服务：

资源简介：

The Italian TTS Speech Corpus contains the recordings of 1 native Italian speaker (male, 50 years old) recorded in a studio over 1 channel (Shure SM15 unidirectional professional head-word condenser microphone). The data collection and transcription were performed by Appen (Australia).Speech samples are stored as sequences of 16-bit 22.05 kHz PCM in uncompressed WAV files. The speaker read 3,300 prompted sentences covering all legal triphones and diphones.The database is provided with orthographic transcriptions in SAMPA, including canonical and alternative pronunciation, and syllable, stress and acoustic events markings. All transcriptions were segmented at the utterance (sentence/command word) level, annotated at the word level and checked manually. A pronunciation lexicon including 7,300 headwords (plus variants) is also available.This database is aimed to be used within text-to-speech and speech synthesis applications.

意大利语音合成语料库收录了一位意大利本土男性（50岁）的录音，该录音在录音棚内通过Shure SM15单向专业头戴式电容麦克风以单通道方式录制。数据采集与转录工作由Appen（澳大利亚）公司完成。语音样本以16位22.05 kHz PCM格式存储于未压缩的WAV文件中。说话者朗读3300条提示句子，覆盖所有法律三音节和双音节。数据库提供了SAMPA格式的正字法转录，包括标准发音和替代发音，以及音节、重音和声学事件标记。所有转录均按句子（命令词）级别进行分割，在词级别进行标注，并进行了人工检查。此外，还包括一个包含7300个词首（及其变体）的发音词典。该数据库旨在用于文本到语音和语音合成应用中。

提供机构：

ELRA (European Language Resources Association)

5,000+

优质数据集

54 个

任务类型

进入经典数据集