five

Polish Speech Database

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2019S19
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Polish Speech Database was developed by <a href="https://www.voicelab.ai/">VoiceLab</a>. It consists of 263,424 utterances of Polish speech data from 200 speakers, totaling approximately 280 hours, and corresponding transcripts.</p><br> <p>Data collection was performed in Poland. Speakers were asked to record themselves for at least 60 minutes from their home computer using a headset while reading text on a website. The text was comprised of sentences covering most speech sounds in Polish.</p><br> <p>The database includes speaker metadata. There were 103 male speakers and 97 female speakers. Their ages ranged from 15 years to 60 years of age. Most were in the 15-30 years age range.</p><br> <h3>Data</h3><br> <p>Speech data is presented as 16,000 Hz, 16-bit, single channel, flac compressed wav files. Transcripts are UTF-8 encoded plain text.</p><br> <h3>Samples</h3><br> <p>Please view the following samples.</p><br> <ul><br> <li><a href="desc/addenda/LDC2019S19.k.flac">Female Speech</a></li><br> <li><a href="desc/addenda/LDC2019S19.k.txt">Female Transcript</a></li><br> <li><a href="desc/addenda/LDC2019S19.m.flac">Male Speech</a></li><br> <li><a href="desc/addenda/LDC2019S19.m.txt">Male Transcript</a></li><br> </ul><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2019 VoiceLab.ai, © 2019 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作