five

Synthetic vowels of speakers with Parkinson’s disease and Parkinsonism

收藏
DataCite Commons2020-08-27 更新2024-08-17 收录
下载链接:
https://figshare.com/articles/Synthetic_vowels_of_speakers_with_Parkinson_s_disease_and_Parkinsonism/7628819
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset contains synthesized replicas of sustained vowels /A/ and /I/ performed by healthy controls, patients with Parkinson’s disease, multiple system atrophy and progressive supranuclear palsy. The dataset can be used as a reference for evaluation of pitch detectors, detectors of modal fundamental frequency, and detectors of subharmonics.<br><b>Coding system </b>Each recording is named by a unique alphanumeric code in the format <i>Uvxy</i>, where <i>U</i> means abbreviation of the group (HC = healthy control, PD = Parkinson’s disease, MSA = multiple system atrophy, PSP = progressive supranuclear palsy) in upper case characters, <i>v</i> is the numeric identifier of the subject within the group, <i>x</i> denotes type of vowel (a = vowel /A/, i = vowel /I/), and <i>y</i> is the number of repetition. The part of the code of <i>U</i> and <i>v</i> uniquely determine each speaker, whereas <i>x</i> and <i>y</i> determine speaker’s recordings.<br><b> Recordings </b>All recordings are briefly described in table dataset.csv. All files of each record (see records.zip) are identified by the corresponding code and suffix. Suffix describe type of the file and is separated from the code by underscore. Naming of the files is illustrated on the record HC8a1. The code describes first repetition of the vowel /A/ performed by healthy speaker HC8. The record HC8a1 consists of following files:<i>HC8a1.wav</i> = waveform of the synthesized replica. This is the reference signal used for the evaluation. Parameters of jitter, shimmer and harmonic to noise ratio (HNR) can be found in dataset.csv.<br><i>HC8a1_clean.wav</i> = waveform of the synthesized replica without added noise. We provide this signal to make the model more versatile. Authors may add a different kind of noise to this signal or manipulate with HNR. Note that that both signals required normalization prior to writing into wav-file. Original scaling factor between HC8a1 and HCa1_clean can be determined from total power of signals and reference HNR value.<br><i>HC8a1_LF.wav</i> = sample of the glottal pulse used for the synthesis.<br><i>HC8a2_impulses.csv</i> = list of impulses’ locations in seconds and corresponding amplitudes. The position of pulses was corrected to match with the beginning of glottal pulse, i.e., first sample of the signal HC8a1_LF.wav begins at each of these positions. The jitter and shimmer listed in dataset.csv were are median values. Jitter and shimmer by other definitions can be calculated from positions and amplitudes of pulses provided by this file. <i>HC8a1_subharmonics.csv</i> = list of subharmonic intervals described by the start time in seconds and end time in seconds. Corresponding index of amplitude modulation expressed as SHR in percent can be found in the table dataset.csv. When no subharmonic was determined by the supervised parameterization, no file was included for the speaker and SHR in the dataset.csv was set to zero [1].<br> <br> [1] Note that the supervised detection had lower sensitivity due to senzitivity of pitch trace in PRAAT, so the occurrence of subharmonics in synthesized data is much lower than in the original dataset analyzed by automated segmentation. This is not a problem because subharmonics were synthesized only at the given interval - this illustrates why it is important to detect subharmonics in other way than by pitch. <br>
提供机构:
figshare
创建时间:
2019-01-25
二维码
社区交流群
二维码
科研交流群
商业服务