five

Mandarin Speech Data by Mobile Phone - 2,028 Hours

收藏
catalogue.elra.info2025-03-26 收录
下载链接:
https://catalogue.elra.info/en-us/repository/browse/ELRA-S0417/
下载链接
链接失效反馈
官方服务:
资源简介:
4,787 Chinese native speakers participated in the recording with equal gender. Speakers are from various provinces of China. The recording content is rich, covering mobile phone voice assistant interaction, smart home command and control, In-car command and control, numbers, and other fields, which is accurately matching the smart home, intelligent car, and other practical application scenarios.Format:16kHz, 16bit, uncompressed wav, mono channel;Recording environment:quiet indoor environment, without echo;Recording content (read speech):generic category; human-machine interaction category; smart home command and control category; in-car command and control category; numbers;Speaker:4,787 speakers totally, with 49% males and 51% females;Device:Android mobile phone; iPhoneLanguage:MandarinApplication scenarios:speech recognition; voiceprint recognition.

本数据集由来自中国各省份的4,787名母语为中文的演讲者参与录制,其中男女比例均衡。录音内容丰富,涵盖移动手机语音助手交互、智能家居命令与控制、车载命令与控制、数字以及其他领域,与智能家居、智能汽车等实际应用场景高度契合。格式为16kHz,16位,未压缩的WAV格式,单声道;录音环境为安静的室内环境,无回声;录音内容(朗读语音)包括通用类别、人机交互类别、智能家居命令与控制类别、车载命令与控制类别、数字等;演讲者总计4,787人,其中男性占49%,女性占51%;设备为Android智能手机;iPhone;语言为普通话;应用场景为语音识别、声纹识别。
提供机构:
catalogue.elra.info
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作