Speech Utterance Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/ehughson/voice_toolbox
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了参与者在不同环境条件下互动时的语音发声数据及提取的声学特征。具体环境条件包括高级餐厅、咖啡馆、热闹的餐馆、安静的酒吧、喧闹的酒吧、夜店以及基准环境。提取的声学特征包括音量、频谱特征和语速特征。该数据集规模为2341个语音片段,来自12位参与者(其中8位女性,4位男性)。其任务是对语音进行分析及特征提取,以适应不同语境下的语音调整需求。
This dataset contains speech utterance data and extracted acoustic features collected from participants interacting under various environmental conditions. The specific environmental conditions include fine-dining restaurants, cafes, bustling restaurants, quiet bars, noisy bars, nightclubs, and a baseline environment. The extracted acoustic features cover volume, spectral features, and speech rate features. The dataset consists of 2,341 speech utterances from 12 participants, of whom 8 are female and 4 are male. The core task of this dataset is speech analysis and feature extraction, designed to meet speech adjustment requirements across diverse contextual scenarios.
提供机构:
Simon Fraser University



