five

CHM150

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2016S04
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>CHM150 (Corpus Hecho en M&eacute;xico 150) was developed by the <a href="http://odin.fi-b.unam.mx/profesores/abelherrera/">Speech Processing Laboratory</a> of the Faculty of Engineering at the <a href="http://www.unam.mx/">National Autonomous University of Mexico</a> (UNAM) and consists of approximately 1.63 hours of Mexican Spanish speech, associated transcripts, and speaker metadata. The goal of this work was to support spoken term detection and forensic speaker identification.</p><br> <p>LDC has released the following data sets in the CIEMPIESS series:</p><br> <ul><br> <li>CIEMPIESS (<a href="../../../LDC2015S07">LDC2015S07</a>)</li><br> <li>CIEMPIESS Light (<a href="../../../LDC2017S23">LDC2017S23</a>)</li><br> <li>CIEMPIESS Balance (<a href="../../../LDC2018S11">LDC2018S11</a>)</li><br> <li>CIEMPIESS Experimentation (<a href="../../../LDC2019S07">LDC2019S07</a>)</li><br> </ul><br> <h3>Data</h3><br> <p>This corpus is comprised of Mexican Spanish microphone speech from 75 male speakers and 75 female speakers in a quiet office environment. Speakers could answer pre-selected open questions or describe a particular painting shown to them on a computer monitor.</p><br> <p>Speaker metadata in this release includes age, gender, place of birth, place of residence and parents' nationalities.</p><br> <p>The audio files are presented as to 16 kHz, 16-bit PCM flac compressed wav.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2016S04.wav">audio sample</a> and <a href="desc/addenda/LDC2016S04.txt">text sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2012, 2016 Carlos Daniel Hernández Mena, © 2016 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作