Multiparametric Analysis of Speaking Fundamental Frequency in Genetically Related Speakers Using Different Speech Materials
收藏DataCite Commons2021-09-28 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Multiparametric_Analysis_of_Speaking_Fundamental_Frequency_in_Genetically_Related_Speakers_Using_Different_Speech_Materials/16689433
下载链接
链接失效反馈官方服务:
资源简介:
The present data derives from a Ph.D. research on the inter-speaker discriminatory potential of Fundamental Frequency descriptors in comparisons involving genetically-related and non-genetically related speakers.<br>A set of 15 f0 measures were considered for assessment in connected speech (i.e., at the domain of sentences), including descriptors of f0 dispersion, central tendency, and modulation f0M, as presented below. As for lengthened vowels, given the more stationary f0 patterns observed, only the first seven parameters were considered for the analysis:<br><br><br>f0mean: f0 mean in semitones ref 1 Hz/ and in Hertz f0med: f0 median in semitones ref 1 Hz/ and in Hertz<br>f0min: f0 minimum in semitones ref 1 Hz/ and in Hertz<br>f0max: f0 maximum in semitones ref 1 Hz/ and in Hertz<br>f0sd: f0 standard-deviation in semitones ref 1 Hz/ and in Hertz<br>f0base: Base value of f0 in semitones ref 1 Hz/ and in Hertz (i.e., equivalent to the 7.4th quantile of the f0 sample)<br>f0SAQ: f0 semi-amplitude between quartiles in semitones ref 1 Hz/ and in Hertz (i.e., a non-parametric measure of f0 dispersion)<br>f0M1: Smoothed f0 peak rate in peaks per second (i.e., f0 peak rate/s)<br>f0M2: Standard-deviation of f0 maxima in semitones ref 1 Hz/ and in Hertz (i.e., when there is more than one peak in the interval)<br>f0M3: Standard-deviation of the F0 maxima positions in seconds (i.e., standard-deviation of peaks' duration)<br>f0M4: 1st-derivative f0 mean in Hertz/frame of the positive derivatives (i.e., f0 rising rate in the peaks)<br>f0M5: 1st-derivative f0 mean in Hertz/frame of the negative derivatives (i.e., f0 falling rate in the peaks)<br>f0M6: 1st-derivative f0 standard-deviation in Hertz/frame of the positive derivatives (i.e., standard deviation of f0 rising rate)<br>f0M7: 1st-derivative f0 standard-deviation in Hertz/frame of the negative derivatives (i.e., standard deviation of f0 falling rate)<br>f0M8: Mean peakness of f0 max in semitones relatively to f0 range multiplied by 1000 (i.e., corresponding to the width of f0 peaks)<br><br>AN IMPORTANT NOTE:<br>Subsetting is required for "Lengthened Vowels", so vowel duration (DUR) is <b>≥ 160 ms.</b>
本数据集源自一项博士研究,旨在探究基频(Fundamental Frequency, F0)描述符在遗传相关与非遗传相关说话者对比场景下的跨说话人鉴别潜力。
研究选取15项基频测量指标,用于连贯语音(即句子级语音域)场景下的评估,涵盖基频离散度、集中趋势以及基频调制指标f0M,具体如下。针对延长元音场景,由于其基频模式更为平稳,仅选取前7项参数开展分析:
f0均值(f0mean):以1Hz为参考基准的半音及赫兹为单位的基频均值
f0中位数(f0med):以1Hz为参考基准的半音及赫兹为单位的基频中位数
f0最小值(f0min):以1Hz为参考基准的半音及赫兹为单位的基频最小值
f0最大值(f0max):以1Hz为参考基准的半音及赫兹为单位的基频最大值
f0标准差(f0sd):以1Hz为参考基准的半音及赫兹为单位的基频标准差
基频基准值(f0base):以1Hz为参考基准的半音及赫兹为单位的基频基准值(即等同于基频样本的7.4分位数)
四分位基频半振幅(f0SAQ):以1Hz为参考基准的半音及赫兹为单位的基频四分位半振幅(即基频离散度的非参数化度量指标)
平滑基频峰值速率(f0M1):单位为峰值/秒的平滑基频峰值速率(即f0峰值速率)
基频极大值标准差(f0M2):以1Hz为参考基准的半音及赫兹为单位的基频极大值标准差(即区间内存在多个峰值时的对应指标)
基频极大值位置标准差(f0M3):单位为秒的基频极大值位置标准差(即峰值时长的标准差)
正导数帧基频一阶导数均值(f0M4):单位为赫兹/帧的正导数帧基频一阶导数均值(即峰值处的基频上升速率)
负导数帧基频一阶导数均值(f0M5):单位为赫兹/帧的负导数帧基频一阶导数均值(即峰值处的基频下降速率)
正导数帧基频一阶导数标准差(f0M6):单位为赫兹/帧的正导数帧基频一阶导数标准差(即基频上升速率的标准差)
负导数帧基频一阶导数标准差(f0M7):单位为赫兹/帧的负导数帧基频一阶导数标准差(即基频下降速率的标准差)
基频极大值平均峰度(f0M8):以相对于基频范围的半音值计并乘以1000的基频极大值平均峰度(即对应基频峰值的宽度)
【重要说明】
针对"Lengthened Vowels"场景需开展数据子集筛选,要求元音时长(DUR)<b>≥ 160毫秒</b>。
提供机构:
figshare
创建时间:
2021-09-28



