five

Corpus of Age-related Voice Disguise

收藏
Mendeley Data2024-03-27 更新2024-06-28 收录
下载链接:
https://etsin.fairdata.fi/dataset/3430d5da-3a70-4de0-86ea-4469eef21750
下载链接
链接失效反馈
官方服务:
资源简介:
This corpus includes normal and age-related disguised speech uttered by 60 native Finnish speakers (31 females and 29 males). The speakers were asked to read the same text fragments several times, in their modal voice and in two disguised voices, first pretending to be an elderly speaker and then pretending to be a child. The texts consisted of the Finnish translations of The Rainbow Passage and The North Wind and the Sun, and two selected English sentences from the TIMIT[1] corpus (SA1, SA2). The corpus includes samples of 78 different sentences per speaker (66 Finnish, 12 English). The speech was recorded simultaneously with a portable recorder with close-talking microphone, and two smartphones applications, yielding a total of 14040 audio files (3 * 4680). The material was recorded in summer 2015 in order to study the effect of voice disguise on automatic speaker recognition. Access to the corpus requires a personal application, apply here: https://lbr.csc.fi Further information is available in the following publications: Rosa González Hautamäki, Md Sahidullah, Tomi Kinnunen and Ville Hautamäki, "Age-Related Voice Disguise and its Impact in Speaker Verification Accuracy", Proc. Odyssey: the Speaker and Language Recognition Workshop, Bilbao, Spain, June, 2016. Rosa González Hautamäki, Md Sahidullah, Ville Hautamäki and Tomi Kinnunen, "Acoustical and perceptual study of voice disguise by age modification in speaker verification", Speech Communication, Volume 95, December 2017, Pages 1-15, doi: doi.org/10.1016/j.specom.2017.10.002

本语料库收录了60名芬兰母语者(31名女性、29名男性)录制的正常语音与年龄伪装语音。受试说话者需多次朗读相同文本片段,分别使用自然本音、两种年龄伪装语音:先模仿老年说话者,再模仿儿童说话者。所用文本包括《彩虹段落》与《北风与太阳》的芬兰语译本,以及从TIMIT语料库[1]中选取的2句英语句子(SA1、SA2)。每位说话者对应78句不同句子的语音样本(66句芬兰语、12句英语)。语音录制同时采用搭载近讲麦克风的便携式录音设备与两款智能手机应用程序,最终总计生成14040个音频文件(3组×4680个)。该语料采集于2015年夏季,用于研究语音伪装对自动说话人识别的影响。如需使用该语料库,需提交个人申请,申请链接:https://lbr.csc.fi。更多详细信息可参阅以下出版物:罗莎·冈萨雷斯·奥坦马基、穆罕默德·萨希杜拉、托米·金努宁与维尔莱·奥坦马基:《年龄相关语音伪装及其对说话人验证准确率的影响》,发表于《Odyssey:说话人与语言识别研讨会论文集》,西班牙毕尔巴鄂,2016年6月;罗莎·冈萨雷斯·奥坦马基、穆罕默德·萨希杜拉、维尔莱·奥坦马基与托米·金努宁:《说话人验证中基于年龄调整的语音伪装声学与感知研究》,发表于《Speech Communication》,2017年12月,第95卷,第1-15页,DOI:doi.org/10.1016/j.specom.2017.10.002。
创建时间:
2023-10-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作