five

Human Vocal Tract Length

收藏
doi.org2025-03-26 收录
下载链接:
http://doi.org/10.17632/gccfw77yc7.1
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset comprises recordings collected from nine native American English speakers, including five males (M1, M2, M3, M4, M5) and four females (F1, F3, F4, F5), aged between 20 and 46 years. All participants reported no hearing or speaking impairments. Vocal Tract Length (VTL) Measurement 1st column: The lowest resonance frequency, Φ=c/4L​, was calculated for a lossless uniform tube of length L, where c=35300cm/s (speed of sound). Here, L represents the Vocal Tract Length (VTL), which was measured using real-time magnetic resonance imaging (MRI) videos. Acoustic Features 2nd column: Fundamental Frequency (F0​) in Hz. 3rd to 6th columns: Scaled formant values: f1/1: first formant scaled by 1 f2/3: second formant scaled by 3 f3/5: third formant scaled by 5 f4/7: fourth formant scaled by 7 12 Mel-Frequency Cepstral Coefficients (MFCCs). The 0th-order MFCC coefficient was excluded. For additional information: P. Vasquez-Serrano, J. Reyes-Moreno, Rodrigo Capobianco Guido, Alexander Sepúlveda-Sepúlveda, MFCC Parameters of the Speech Signal: An Alternative to Formant-Based Instantaneous Vocal Tract Length Estimation, Journal of Voice, 2023, ISSN 0892-1997, https://doi.org/10.1016/j.jvoice.2023.05.012

本数据集收录了九位母语为美式英语的本土人士的录音,其中男性五名(M1、M2、M3、M4、M5)和女性四名(F1、F3、F4、F5),年龄介于20至46岁之间。所有参与者均报告没有听力或言语障碍。 声道长度(VTL)测量 第1列:最低共振频率Φ=c/4L,其中c=35300cm/s(声速),L代表声道长度(VTL),通过实时磁共振成像(MRI)视频进行测量。 声学特征 第2列:基频(F0)以赫兹为单位。 第3至第6列:缩放形式频率值: f1/1:第一个形式频率乘以1 f2/3:第二个形式频率乘以3 f3/5:第三个形式频率乘以5 f4/7:第四个形式频率乘以7 12个梅尔频率倒谱系数(MFCC)。排除第0阶MFCC系数。 补充信息:P. Vasquez-Serrano, J. Reyes-Moreno, Rodrigo Capobianco Guido, Alexander Sepúlveda-Sepúlveda, 《语音信号中的MFCC参数:基于形式频率的瞬时声道长度估计的替代方法》,《语音》杂志,2023年,ISSN 0892-1997,https://doi.org/10.1016/j.jvoice.2023.05.012
提供机构:
Mendeley Data
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作