five

Noisy TIMIT Speech

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2017S04
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Noisy TIMIT Speech was developed by the <a href="http://www.fit.edu/">Florida Institute of Technology</a> and contains approximately 322 hours of speech from the TIMIT Acoustic-Phonetic Continuous Speech Corpus (<a href="../../../LDC93S1">LDC93S1</a>) modified with different additive noise levels. Only the audio has been modified; the original arrangement of the TIMIT corpus is still as described by the TIMIT documentation.</p><br> <h3>Data</h3><br> <p>The additive noise are white, pink, blue, red, violet and babble noise with noise levels varying in 5 dB (decibel) steps and ranges from 5 to 50 dB.</p><br> <p>The color of noise refers to the power spectrum of a noise signal. Sound waves have two characteristics: frequency, which describes how fast the waveform vibrates per second; and amplitude, the size of the waveform. Colored noises are named in an analogy to the colors of light.&nbsp; For instance, white noise contains all audible frequencies just as white light contains all frequencies in the visible range. Non-white colored noises have more energy concentrated at the high or low end of the sound spectrum. White, pink and blue noise are officially defined in the <a href="https://www.its.bldrdoc.gov/fs-1037/fs-1037c.htm">federal telecommunications standard</a>.</p><br> <p>The white, pink, blue, red and violet noise types added to the TIMIT data in this release were generated artificially using MATLAB. For the babble noise, a random segment of recorded babble speech was selected and scaled relative to the power of the original TIMIT audio signal.</p><br> <p>All audio files are presented as single channel 16kHz 16-flac.</p><br> <h3>Samples</h3><br> <p>Please listen to the following samples:</p><br> <ul><br> <li><a href="desc/addenda/LDC2017S04.bab.5db.flac">5db Babble</a></li><br> <li><a href="desc/addenda/LDC2017S04.blu.15db.flac">15db Blue</a></li><br> <li><a href="desc/addenda/LDC2017S04.pin.25db.flac">25db Pink</a></li><br> <li><a href="desc/addenda/LDC2017S04.red.35db.flac">35db Red</a></li><br> <li><a href="desc/addenda/LDC2017S04.vio.45db.flac">45db Violet</a></li><br> <li><a href="desc/addenda/LDC2017S04.whi.50db.flac">50db White</a></li><br> </ul><br> <h3>Updates</h3><br> <p>None at this time.</p><br> <h3>Related Works incorporating TIMIT</h3><br> <p>TIMIT was designed to provide speech data for acoustic-phonetic studies and for the development and evaluation of automatic speech recognition systems. Since its release in 1993, several corpora have been developed using the TIMIT database:</p><br> <p>NTIMIT (<a href="../../../LDC93S2">LDC93S2</a>): transmitting TIMIT recordings through a telephone handset and over various channels in the NYNEX telephone network</p><br> <p>CTIMIT (<a href="../../../LDC96S30">LDC96S30</a>): passing TIMIT files through cellular telephone circuits</p><br> <p>FFMTIMIT (<a href="../../../LDC96S32">LDC96S32</a>): re-recording TIMIT files with a free-field microphone</p><br> <p>HTIMIT (<a href="../../../LDC98S67">LDC98S67</a>): re-recording a subset of TIMIT files throgh different telephone handsets</p><br> <p>STC-TIMIT (<a href="../../../LDC2008S03">LDC2008S03</a>): passing TIMIT files through an actual telephone channel in a single call</p><br> <p>WTIMIT 1.0 (<a href="../../../LDC2010S02">LDC2010S02</a>): wideband mobile telephony TIMIT version</p></br> Portions © 2017 Florida Institute of Technology, © 1993, 2017 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作