Polish Speech Database
收藏DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2019S19
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3><br>
<p>Polish Speech Database was developed by <a href="https://www.voicelab.ai/">VoiceLab</a>. It consists of 263,424 utterances of Polish speech data from 200 speakers, totaling approximately 280 hours, and corresponding transcripts.</p><br>
<p>Data collection was performed in Poland. Speakers were asked to record themselves for at least 60 minutes from their home computer using a headset while reading text on a website. The text was comprised of sentences covering most speech sounds in Polish.</p><br>
<p>The database includes speaker metadata. There were 103 male speakers and 97 female speakers. Their ages ranged from 15 years to 60 years of age. Most were in the 15-30 years age range.</p><br>
<h3>Data</h3><br>
<p>Speech data is presented as 16,000 Hz, 16-bit, single channel, flac compressed wav files. Transcripts are UTF-8 encoded plain text.</p><br>
<h3>Samples</h3><br>
<p>Please view the following samples.</p><br>
<ul><br>
<li><a href="desc/addenda/LDC2019S19.k.flac">Female Speech</a></li><br>
<li><a href="desc/addenda/LDC2019S19.k.txt">Female Transcript</a></li><br>
<li><a href="desc/addenda/LDC2019S19.m.flac">Male Speech</a></li><br>
<li><a href="desc/addenda/LDC2019S19.m.txt">Male Transcript</a></li><br>
</ul><br>
<h3>Updates</h3><br>
<p>None at this time.</p></br>
Portions © 2019 VoiceLab.ai, © 2019 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30



