Preference of voice and performance style: Data of the listening test and acoustic analysis

Name: Preference of voice and performance style: Data of the listening test and acoustic analysis
Creator: figshare
Published: 2024-01-19 13:26:37
License: 暂无描述

DataCite Commons2024-01-19 更新2024-08-19 收录

下载链接：

https://figshare.com/articles/dataset/Preference_of_voice_and_performance_style_Data_of_the_listening_test_and_acoustic_analysis/25028036

下载链接

链接失效反馈

官方服务：

资源简介：

This study sought to find out which style of radio advertisement performance listeners consider likable and which acoustic features differentiate the likable from the unlikable. The same speakers presented a gender neutral pretend-advertisement in two styles: calm and energetic. Listeners had to rate the likability of the performances. The results showed that listener likability scores were consistent and did not depend on listener gender. The listeners overwhelmingly preferred advertisements presented in a calm style, regardless of performer or their age or gender. For each advertisement, 88 parameters of the extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) were calculated. Most of these significantly differentiated likable and unlikable performances. Likable performances were characterised by lower pitch, faster articulation rate, a quieter voice with no abrupt changes in loudness, and a breathy voice. The study showed the importance of determining which performance style listeners prefer, as the voice of the performer is directly affected by the performance style. Listeners might like a voice in one style, but not the other.List of eGeMAPS parameter abbreviationsA1,2,3 = difference of log amplitude of first, second and third harmonic to f0 amplitudeAF1,F2,F3 = difference of log amplitude of first, second and third formant to f0 amplitudealpha ratio = ratio of the summed energy from 50–1000 Hz and 1–5 kHzBF1,2,3 = bandwidth of first, second and third formantf0 = logarithmic fundamental frequency on a semitone frequency scale, starting at 27.5 Hz (semitone 0)F1,2,3 = centre frequency of first, second and third formantHammarberg index = ratio of the strongest energy peak in the 0–2 kHz region to the strongest peak in the 2–5 kHz regionharmonic difference H1–H2 = ratio of energy of the first f0 harmonic (H1) to the energy of the second f0 harmonic (H2)harmonic difference H1–A3 = ratio of energy of the first f0 harmonic (H1) to the energy of the highest harmonic in the third formant range (A3)HNR = harmonics-to-noise ratioLEq = equivalent sound level, computed by converting the average of the per-frame RMS energies to a logarithmic (dB) scaleMFCC1,2,3,4 = first, second, third and fourth Mel-frequency cepstral coefficientloudness = estimate of perceived signal intensity from an auditory spectrumpctl = percentilepctlrg = range of the 20th to 80th percentileSDnorm = standard deviation normalised by the arithmetic mean (coefficient of variation)shimmer = difference of the peak amplitudes of consecutive f0 periodsspectral flux = difference of the spectra of two consecutive framesspectral slope 0–500 Hz or 500–1500 Hz = linear regression slope of the logarithmic power spectrum for 0–500 Hz or 500–1500 Hz regionUVR = unvoiced regionsVR = voiced regions

提供机构：

figshare

创建时间：

2024-01-19