Articulation Index LSCP

Name: Articulation Index LSCP
Creator: Linguistic Data Consortium
Published: 2021-07-01 16:27:25
License: 暂无描述

DataCite Commons2021-07-01 更新2025-04-16 收录

下载链接：

https://catalog.ldc.upenn.edu/LDC2015S12

下载链接

链接失效反馈

官方服务：

资源简介：

<h3>Introduction</h3><br> <p>Articulation Index LSCP was developed by researchers at <a href="http://www.lscp.net/index.php?lang=en">Laboratoire de Sciences Cognitives et Psycholinguistique (LSCP), Ecole Normale Supérieure</a>. It revises and enhances a subset of Articulation Index (AIC) (<a href="../../../LDC2005S22">LDC2005S22</a>), a corpus of persons speaking English syllables. Changes include the addition of forced alignment to sound files, time alignment of syllable utterances and format conversions.</p><br> <p>AIC consists of 20 American English speakers (12 males, 8 females) pronouncing syllables, some of which form actual words, but most of which are nonsense syllables. All possible Consonant-Vowel (CV) and Vowel-Consonant (VC) combinations were recorded for each speaker twice, once in isolation and once within a carrier-sentence, for a total of 25768 recorded syllables.</p><br> <h3>Data</h3><br> <p>Articulation Index LSCP alters AIC in the following ways.</p><br> <ol><br> <li>Time-alignments for the onset and offset of each word and syllable were generated through forced-alignment with a standard HMM-GMM (Hidden Markov Model-Gaussian Mixture Model) ASR system.</li><br> <li>The time-alignments for the beginning and end of the syllables (whether in isolation or within a carrier sentence) were manually adjusted. The time-alignments for the other words in carrier sentences were not manually adjusted.</li><br> <li>The recordings of isolated syllables were cut according to the manual time-alignments to remove the silent portions at the beginning and end, and the time-alignments were altered to correspond to the cut recordings.</li><br> <li>The file naming scheme was slightly altered for compatibility with the <a href="http://kaldi.sourceforge.net/">Kaldi speech recognition toolkit</a>.</li><br> <li>AIC contains a wide-band (16 KHz, 16-bit PCM) and a narrow-band (8 KHz, 8 bit u-law) version of the recordings distributed in sphere format. The LSCP version contains the wide-band version only distributed as wave files.</li><br> </ol><br> <p>This release does not include certain AIC triphone recordings (CVC, CCV or VCC).</p><br> <p>Audio data is presented as 16kHz 16-bit flac compressed .wav files. The flac compression was added for distribution, and documentation may refer to the files as .wav files.</p><br> <h3>Samples</h3><br> <p>Please listen to this <a href="desc/addenda/LDC2015S12.wav">audio sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2015 Tomas Bergvelt, Anna Kolesnikov, Xuan-Nga Cao, Thomas Schatz, Emmanuel Dupoux, © 2005, 2015 Trustees of the University of Pennsylvania

提供机构：

Linguistic Data Consortium

创建时间：

2020-11-30

5,000+

优质数据集

54 个

任务类型

进入经典数据集