five

Malay Speech Data by Mobile Phone_Reading - 134 Hours

收藏
catalogue.elra.info2022-10-06 更新2025-03-24 收录
下载链接:
https://catalogue.elra.info/en-us/repository/browse/ELRA-S0469/
下载链接
链接失效反馈
官方服务:
资源简介:
156 Speakers - Mobile Telephony Malay Speech Data_Reading is recorded by native Malay speakers in the quiet environment. The recording is rich in content, covering multiple categories such as economy, entertainment, news, oral language, numbers, and letters. Around 450 sentences for each speaker. The effective time is 134 hours. All texts are manually transcribed to ensure high accuracy.Format:16kHz, 16bit, mono channel , no compact WAV, text format: metadataLanguage:MalayEnvironment:Quiet ,echolessRecording Text:economy, entertainment, news spoken language, figure and letterSpeaker:156 Malay speakers with 65% females (102 speakers); about 450 prompts per speaker.Device:Android phone : iPhone = 5.5 : 1Application Scenario:speech recognition machine translation, voiceprint recognition

本数据集汇聚了156位马来语母语者的移动电话语音数据,录音环境静谧宜人。内容丰富,涵盖经济、娱乐、新闻、口语、数字与字母等多个类别。每位说话者约有450句录音,总计有效时长达到134小时。所有文本均经人工转录,以确保极高的准确性。录音格式为16kHz、16位单声道,无压缩WAV格式,文本格式包含元数据语言:马来语;录音环境:安静无回声;录音内容:经济、娱乐、新闻口语、数字与字母;说话者:156位马来语说话者,其中女性占65%(102位);每位说话者约450个提示词;设备:Android手机与iPhone之比为5.5:1;应用场景:语音识别、机器翻译、声纹识别。
提供机构:
catalogue.elra.info
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作