海天瑞声-中文普通话老人儿童识别语音库(桌面)
收藏魔搭社区2026-05-21 更新2024-06-08 收录
下载链接:
https://modelscope.cn/datasets/haitianruisheng/ChineseMandarinelderlyandkidsSpeechRecognitionCorpus
下载链接
链接失效反馈官方服务:
资源简介:
该语音数据集涵盖了特色年龄段,包括儿童和老年人,确保了性别均衡,同时发音人来自中国七大方言区,实现了地域的广泛覆盖。儿童录制包括车控指令、有声书(特别是儿童节目)、儿童视频节目以及儿歌和抖音上热门的儿童歌曲。老年录制包括车控指令、地图导航、有声书(特别侧重老年人喜爱的节目),以及老年人偏爱的音乐类型
This speech dataset covers representative age cohorts including children and the elderly, maintains a balanced gender distribution, and its speakers are sourced from seven major dialect regions across China, ensuring extensive geographic coverage. Recordings for child participants include vehicle control commands, audiobooks (particularly children's programs), children's video programs, as well as nursery rhymes and trending children's songs from Douyin. Recordings for elderly participants cover vehicle control commands, map navigation, audiobooks with a specific focus on programs favored by seniors, and music genres preferred by the elderly.
提供机构:
maas
创建时间:
2024-06-05
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个中文普通话语音库,专门针对儿童和老年人群体设计,覆盖中国七大方言区域,包含车载命令、有声读物和地图导航等录音内容,总时长为400小时,由800名说话人录制,数据格式为16KHz/16bit WAV。它适用于大型语音模型的测试和训练,并受Apache 2.0许可,但需注意可能包含个人敏感信息。
以上内容由遇见数据集搜集并总结生成



