five

Multiparametric Analysis of Speaking Fundamental Frequency in Genetically Related Speakers Using Different Speech Materials

收藏
DataCite Commons2021-09-28 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Multiparametric_Analysis_of_Speaking_Fundamental_Frequency_in_Genetically_Related_Speakers_Using_Different_Speech_Materials/16689433/1
下载链接
链接失效反馈
官方服务:
资源简介:
The present data derives from a Ph.D. research on the inter-speaker discriminatory potential of Fundamental Frequency descriptors in comparisons involving genetically-related and non-genetically related speakers.<br>A set of 15 f0 measures were considered for assessment in connected speech (i.e., at the domain of sentences), including descriptors of f0 dispersion, central tendency, and modulation f0M, as presented below. As for lengthened vowels, given the more stationary f0 patterns observed, only the first seven parameters were considered for the analysis:<br><br><br>f0mean: f0 mean in semitones ref 1 Hz/ and in Hertz f0med: f0 median in semitones ref 1 Hz/ and in Hertz<br>f0min: f0 minimum in semitones ref 1 Hz/ and in Hertz<br>f0max: f0 maximum in semitones ref 1 Hz/ and in Hertz<br>f0sd: f0 standard-deviation in semitones ref 1 Hz/ and in Hertz<br>f0base: Base value of f0 in semitones ref 1 Hz/ and in Hertz (i.e., equivalent to the 7.4th quantile of the f0 sample)<br>f0SAQ: f0 semi-amplitude between quartiles in semitones ref 1 Hz/ and in Hertz (i.e., a non-parametric measure of f0 dispersion)<br>f0M1: Smoothed f0 peak rate in peaks per second (i.e., f0 peak rate/s)<br>f0M2: Standard-deviation of f0 maxima in semitones ref 1 Hz/ and in Hertz (i.e., when there is more than one peak in the interval)<br>f0M3: Standard-deviation of the F0 maxima positions in seconds (i.e., standard-deviation of peaks' duration)<br>f0M4: 1st-derivative f0 mean in Hertz/frame of the positive derivatives (i.e., f0 rising rate in the peaks)<br>f0M5: 1st-derivative f0 mean in Hertz/frame of the negative derivatives (i.e., f0 falling rate in the peaks)<br>f0M6: 1st-derivative f0 standard-deviation in Hertz/frame of the positive derivatives (i.e., standard deviation of f0 rising rate)<br>f0M7: 1st-derivative f0 standard-deviation in Hertz/frame of the negative derivatives (i.e., standard deviation of f0 falling rate)<br>f0M8: Mean peakness of f0 max in semitones relatively to f0 range multiplied by 1000 (i.e., corresponding to the width of f0 peaks)<br><br>AN IMPORTANT NOTE:<br>Subsetting is required for "Lengthened Vowels", so vowel duration (DUR) is <b>≥ 160 ms.</b>

本数据集源自一项博士研究,探究基因相关与非基因相关说话者对比场景下,基频(Fundamental Frequency)描述子的说话人区分潜力。 研究选取了15项f0指标,用于连贯语音(即语句域)场景下的评估,涵盖f0离散程度、集中趋势与调制特征类描述子,具体如下。针对延长元音场景,由于其f0模式更为平稳,仅选取前7项参数开展分析: f0mean:f0均值,单位为半音(参考基准1Hz)与赫兹 f0med:f0中位数,单位为半音(参考基准1Hz)与赫兹 f0min:f0最小值,单位为半音(参考基准1Hz)与赫兹 f0max:f0最大值,单位为半音(参考基准1Hz)与赫兹 f0sd:f0标准差,单位为半音(参考基准1Hz)与赫兹 f0base:f0基准值,单位为半音(参考基准1Hz)与赫兹(等效于f0样本的7.4分位数) f0SAQ:四分位间距的f0半幅值,单位为半音(参考基准1Hz)与赫兹(即一种非参数化的f0离散程度度量指标) f0M1:每秒平滑f0峰值速率(即f0峰值数/秒) f0M2:f0极大值的标准差,单位为半音(参考基准1Hz)与赫兹(即区间内存在多个峰值时的指标) f0M3:f0极大值位置的标准差,单位为秒(即峰值时长的标准差) f0M4:正导数帧的f0一阶导数均值,单位为赫兹/帧(即峰值处的f0上升速率) f0M5:负导数帧的f0一阶导数均值,单位为赫兹/帧(即峰值处的f0下降速率) f0M6:正导数帧的f0一阶导数标准差,单位为赫兹/帧(即f0上升速率的标准差) f0M7:负导数帧的f0一阶导数标准差,单位为赫兹/帧(即f0下降速率的标准差) f0M8:f0极大值的平均峰度(以半音为单位,相对于f0取值范围),并乘以1000(即对应f0峰值的宽度) 重要说明:针对"延长元音"场景需进行子集筛选,即元音时长(DUR)需≥160毫秒。
提供机构:
figshare
创建时间:
2021-09-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作