Noisy TIMIT Speech

Mendeley Data2024-01-31 更新2024-06-28 收录

下载链接：

https://catalog.ldc.upenn.edu/LDC2017S04

下载链接

链接失效反馈

官方服务：

资源简介：

Introduction Noisy TIMIT Speech was developed by the Florida Institute of Technology and contains approximately 322 hours of speech from the TIMIT Acoustic-Phonetic Continuous Speech Corpus (LDC93S1) modified with different additive noise levels. Only the audio has been modified; the original arrangement of the TIMIT corpus is still as described by the TIMIT documentation. Data The additive noise are white, pink, blue, red, violet and babble noise with noise levels varying in 5 dB (decibel) steps and ranges from 5 to 50 dB. The color of noise refers to the power spectrum of a noise signal. Sound waves have two characteristics: frequency, which describes how fast the waveform vibrates per second; and amplitude, the size of the waveform. Colored noises are named in an analogy to the colors of light. For instance, white noise contains all audible frequencies just as white light contains all frequencies in the visible range. Non-white colored noises have more energy concentrated at the high or low end of the sound spectrum. White, pink and blue noise are officially defined in the federal telecommunications standard. The white, pink, blue, red and violet noise types added to the TIMIT data in this release were generated artificially using MATLAB. For the babble noise, a random segment of recorded babble speech was selected and scaled relative to the power of the original TIMIT audio signal. All audio files are presented as single channel 16kHz 16-flac. Samples Please listen to the following samples: 5db Babble 15db Blue 25db Pink 35db Red 45db Violet 50db White Updates None at this time. Related Works incorporating TIMIT TIMIT was designed to provide speech data for acoustic-phonetic studies and for the development and evaluation of automatic speech recognition systems. Since its release in 1993, several corpora have been developed using the TIMIT database: NTIMIT (LDC93S2): transmitting TIMIT recordings through a telephone handset and over various channels in the NYNEX telephone network CTIMIT (LDC96S30): passing TIMIT files through cellular telephone circuits FFMTIMIT (LDC96S32): re-recording TIMIT files with a free-field microphone HTIMIT (LDC98S67): re-recording a subset of TIMIT files throgh different telephone handsets STC-TIMIT (LDC2008S03): passing TIMIT files through an actual telephone channel in a single call WTIMIT 1.0 (LDC2010S02): wideband mobile telephony TIMIT version Portions © 2017 Florida Institute of Technology, © 1993, 2017 Trustees of the University of Pennsylvania

创建时间：

2024-01-31

搜集汇总

数据集介绍

背景与挑战

背景概述

Noisy TIMIT Speech是一个包含约322小时语音的数据集，基于TIMIT语料库，添加了六种不同噪声类型和水平的音频数据。该数据集适用于语音识别研究，提供了多种噪声条件下的语音样本，支持对噪声环境下语音识别系统的开发和测试。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集