five

West Point Russian Speech

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2003S05
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>West Point Russian Speech was developed at the Department of Foreign Languages (DFL) and the Center for Technology Enhanced Language Learning (CTELL) at the United States Military Academy at West Point. The purpose of the corpus is to provide a set of recordings for the training and development of speaker-independent speech recognition systems for use by West Point cadets enrolled in the Russian language program.</p><br> <h3>Data</h3><br> <p>The corpus consists of 4,181 speech files in SPHERE format, totalling approximately four hours of speech. Approximately 2,290 files are from native informants and 1,891 are from non-native informants.</p><br> <p>The following tables show the breakdown of corpus content in terms of male, female, native and non-native speakers.</p><br> <p>Number of speakers:</p><br> <table><br> <tbody><br> <tr><br> <td>&nbsp;</td><br> <td>male</td><br> <td>female</td><br> <td>total</td><br> </tr><br> <tr><br> <td>native</td><br> <td>13</td><br> <td>16</td><br> <td>29</td><br> </tr><br> <tr><br> <td>non-native</td><br> <td>16</td><br> <td>10</td><br> <td>26</td><br> </tr><br> <tr><br> <td>totals</td><br> <td>29</td><br> <td>26</td><br> <td>55</td><br> </tr><br> </tbody><br> </table><br> <p>Number of speech files:</p><br> <table><br> <tbody><br> <tr><br> <td>&nbsp;</td><br> <td>male</td><br> <td>female</td><br> <td>total</td><br> </tr><br> <tr><br> <td>native</td><br> <td>1027</td><br> <td>1263</td><br> <td>2290</td><br> </tr><br> <tr><br> <td>non-native</td><br> <td>1103</td><br> <td>788</td><br> <td>1891</td><br> </tr><br> <tr><br> <td>totals</td><br> <td>2130</td><br> <td>2050</td><br> <td>4181</td><br> </tr><br> </tbody><br> </table><br> <p>The speech data was collected using laptop computers running Windows NT. Recordings were captured at a sampling rate of 16-bit at 22,050 Hz pcm using a Shure SM10A microphone and a RANE Model MS1 pre-amplifier. A visual display of the sentence, along with a digital recording of the sentence as read by a native speaker, was presented. The informant pressed the Enter key to record the utterance. The informant's recording was played back for review and the utterance was re-recorded if necessary.</p><br> <p>The collection script consists of 96 sentences with a total of 528 tokens and 351 types.</p><br> <p>Each waveform file has a monophone and word level master label file transcription in HTK-format. A concatenated version of the master label files at both the word level and the phone level is provided.</p><br> <p>The lexicon contains 690 distinct orthographic word forms, including all words found in the collection script.</p><br> <h3>Samples</h3><br> <p>Please view the following samples:</p><br> <ul><br> <li><a href="desc/addenda/LDC2003S05.f.sph">Female Speaker (S31)</a></li><br> <li><a href="desc/addenda/LDC2003S05.m.sph">Male Speaker (S08)</a></li><br> <li><a href="desc/addenda/LDC2003S05.monophone.mlf">Phone Level Transcript</a></li><br> <li><a href="desc/addenda/LDC2003S05.word.mlf">Word Level Transcript</a></li><br> </ul><br> <h3>Updates</h3><br> <p>There are no updates available at this time.</p></br> Portions © 2003 United States Military Academy, © 2003 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作