Children Speech Recognition Challenge (CSRC) 2021
收藏arXiv2020-11-16 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2011.06724v2
下载链接
链接失效反馈官方服务:
资源简介:
儿童语音识别挑战赛(CSRC)2021数据集由西北工业大学音频、语音与语言处理组等机构创建,旨在通过提供大规模的儿童语音数据来推动儿童语音识别技术的发展。该数据集包含400小时的普通话语音数据,其中包括340小时的成人语音、30小时的儿童朗读语音和30小时的儿童对话语音。数据集的创建过程涉及对不同年龄段儿童和成人语音的收集与标注。该数据集主要应用于儿童语音识别系统的开发和评估,特别是在计算机辅助语言学习(CALL)和智能玩具等领域。
The Children's Speech Recognition Challenge (CSRC) 2021 Dataset was created by the Audio, Speech and Language Processing Group of Northwestern Polytechnical University and other institutions. It aims to promote the development of children's speech recognition technologies by providing large-scale Mandarin speech data. This dataset contains 400 hours of Mandarin speech data, consisting of 340 hours of adult speech, 30 hours of children's read speech, and 30 hours of children's conversational speech. The construction of this dataset involves the collection and annotation of speech data from children and adults across various age groups. It is primarily used for the development and evaluation of children's speech recognition systems, particularly in domains such as Computer-Assisted Language Learning (CALL) and smart toys.
提供机构:
西北工业大学音频、语音与语言处理组
创建时间:
2020-11-13



