Articulation Index LSCP
收藏DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2015S12
下载链接
链接失效反馈官方服务:
资源简介:
<h3>Introduction</h3><br>
<p>Articulation Index LSCP was developed by researchers at <a href="http://www.lscp.net/index.php?lang=en">Laboratoire de Sciences Cognitives et Psycholinguistique (LSCP), Ecole Normale Supérieure</a>. It revises and enhances a subset of Articulation Index (AIC) (<a href="../../../LDC2005S22">LDC2005S22</a>), a corpus of persons speaking English syllables. Changes include the addition of forced alignment to sound files, time alignment of syllable utterances and format conversions.</p><br>
<p>AIC consists of 20 American English speakers (12 males, 8 females) pronouncing syllables, some of which form actual words, but most of which are nonsense syllables. All possible Consonant-Vowel (CV) and Vowel-Consonant (VC) combinations were recorded for each speaker twice, once in isolation and once within a carrier-sentence, for a total of 25768 recorded syllables.</p><br>
<h3>Data</h3><br>
<p>Articulation Index LSCP alters AIC in the following ways.</p><br>
<ol><br>
<li>Time-alignments for the onset and offset of each word and syllable were generated through forced-alignment with a standard HMM-GMM (Hidden Markov Model-Gaussian Mixture Model) ASR system.</li><br>
<li>The time-alignments for the beginning and end of the syllables (whether in isolation or within a carrier sentence) were manually adjusted. The time-alignments for the other words in carrier sentences were not manually adjusted.</li><br>
<li>The recordings of isolated syllables were cut according to the manual time-alignments to remove the silent portions at the beginning and end, and the time-alignments were altered to correspond to the cut recordings.</li><br>
<li>The file naming scheme was slightly altered for compatibility with the <a href="http://kaldi.sourceforge.net/">Kaldi speech recognition toolkit</a>.</li><br>
<li>AIC contains a wide-band (16 KHz, 16-bit PCM) and a narrow-band (8 KHz, 8 bit u-law) version of the recordings distributed in sphere format. The LSCP version contains the wide-band version only distributed as wave files.</li><br>
</ol><br>
<p>This release does not include certain AIC triphone recordings (CVC, CCV or VCC).</p><br>
<p>Audio data is presented as 16kHz 16-bit flac compressed .wav files. The flac compression was added for distribution, and documentation may refer to the files as .wav files.</p><br>
<h3>Samples</h3><br>
<p>Please listen to this <a href="desc/addenda/LDC2015S12.wav">audio sample</a>.</p><br>
<h3>Updates</h3><br>
<p>None at this time.</p></br>
Portions © 2015 Tomas Bergvelt, Anna Kolesnikov, Xuan-Nga Cao, Thomas Schatz, Emmanuel Dupoux, © 2005, 2015 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30



