波兰读音语料库
收藏arXiv2017-06-01 更新2024-06-21 收录
下载链接:
http://mowa.clarin-pl.eu
下载链接
链接失效反馈官方服务:
资源简介:
波兰读音语料库是由波兰-日本信息技术学院创建的大型高质量录音语料库,旨在支持波兰语音研究的发展。该语料库包含317名发言者的554次录音,总计约56小时的音频数据,包含356674个词汇,来自46361个词汇的词汇表。数据集在专业录音室环境中使用两种麦克风录制,包括高质量的录音室麦克风和普通消费者音频耳机。此外,还包含一个小型的电话质量语料库。该语料库的应用领域广泛,包括语音处理工具的开发、语音识别系统的训练以及语音学和发音研究。
The Polish Pronunciation Corpus is a large-scale high-quality recorded speech corpus created by the Polish-Japanese Academy of Information Technology, aiming to support the advancement of Polish phonetic research. It comprises 554 recordings from 317 speakers, totaling approximately 56 hours of audio data, containing 356,674 lexical items derived from a vocabulary list of 46,361 words. The dataset was recorded in a professional studio environment using two types of microphones: high-quality studio microphones and standard consumer audio headsets. Additionally, it includes a small-scale telephone-quality speech corpus. This corpus has a wide range of application scenarios, including the development of speech processing tools, the training of speech recognition systems, as well as phonetic and articulatory research.
提供机构:
波兰-日本信息技术学院
创建时间:
2017-06-01



