five

MASRI Synthetic

收藏
DataCite Commons2022-09-12 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2022S08
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>MASRI (Maltese Automatic Speech Recognition I) Synthetic was developed by the <a href="https://www.um.edu.mt/projects/masri/">MASRI team </a> at the <a href="https://www.um.edu.mt/">University of Malta</a> and consists of approximately 99 hours of synthesized Maltese speech.</p><br> <h3>Data</h3><br> <p>Source sentences were extracted from the <a href="https://mlrs.research.um.edu.mt/index.php?page=corpora">Maltese Language Resource Server</a> (MLRS) corpus, comprised of written or transcribed Maltese covering various genres, including parliamentary debates, news, law, opinion, sports, culture, academic, literature and religious texts. Text was processed through the CrimsonWing text-to-speech system to generate speech files. Synthesized speech was created with 210 voices (105 male and 105 female).</p><br> <p>Audio files are presented as 16kHz, 16-bit, single channel flac files. When uncompressed, they produce PCM wav files.</p><br> <p>Transcripts are contained in a single plain text file encoded as UTF-8.</p><br> <h3>Samples</h3><br> <p>Please view the following samples:</p><br> <ul><br> <li><a href="desc/addenda/LDC2022S08.f.flac">Female Audio (FLAC)</a></li><br> <li><a href="desc/addenda/LDC2022S08.f.txt">Female Transcript (TXT)</a></li><br> <li><a href="desc/addenda/LDC2022S08.m.flac">Male Audio (FLAC)</a></li><br> <li><a href="desc/addenda/LDC2022S08.m.txt">Male Transcript (TXT)</a></li><br> </ul><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2022 University of Malta, © 2022 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2022-09-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作