MASRI Synthetic
收藏DataCite Commons2022-09-12 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2022S08
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3><br>
<p>MASRI (Maltese Automatic Speech Recognition I) Synthetic was developed by the <a href="https://www.um.edu.mt/projects/masri/">MASRI team </a> at the <a href="https://www.um.edu.mt/">University of Malta</a> and consists of approximately 99 hours of synthesized Maltese speech.</p><br>
<h3>Data</h3><br>
<p>Source sentences were extracted from the <a href="https://mlrs.research.um.edu.mt/index.php?page=corpora">Maltese Language Resource Server</a> (MLRS) corpus, comprised of written or transcribed Maltese covering various genres, including parliamentary debates, news, law, opinion, sports, culture, academic, literature and religious texts. Text was processed through the CrimsonWing text-to-speech system to generate speech files. Synthesized speech was created with 210 voices (105 male and 105 female).</p><br>
<p>Audio files are presented as 16kHz, 16-bit, single channel flac files. When uncompressed, they produce PCM wav files.</p><br>
<p>Transcripts are contained in a single plain text file encoded as UTF-8.</p><br>
<h3>Samples</h3><br>
<p>Please view the following samples:</p><br>
<ul><br>
<li><a href="desc/addenda/LDC2022S08.f.flac">Female Audio (FLAC)</a></li><br>
<li><a href="desc/addenda/LDC2022S08.f.txt">Female Transcript (TXT)</a></li><br>
<li><a href="desc/addenda/LDC2022S08.m.flac">Male Audio (FLAC)</a></li><br>
<li><a href="desc/addenda/LDC2022S08.m.txt">Male Transcript (TXT)</a></li><br>
</ul><br>
<h3>Updates</h3><br>
<p>None at this time.</p></br>
Portions © 2022 University of Malta, © 2022 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2022-09-12



