数据堂—593小时中国人说英语手机采集语音数据

Name: 数据堂—593小时中国人说英语手机采集语音数据
Creator: maas
Published: 2026-01-06 16:14:42
License: 暂无描述

魔搭社区2026-01-06 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/DatatangBeijing/593Hour-ChineseSpeakingEnglishSpeechDataByMobilephone

下载链接

链接失效反馈

官方服务：

资源简介：

593小时中国人说英语手机采集语音数据是由3691名中国人参与录制的10万句常用英语句子，覆盖国内江苏、山东、北京、河南等方言区，符合中国人说英语的特定口音。录音文本涵盖常用英语句子，内容丰富，领域广泛，音素均衡。593小时中国人说英语手机采集语音数据可用于改善语音识别系统对中国人说英语的识别效果

The 593-hour mobile-collected speech dataset of Chinese-spoken English consists of 100,000 common English sentences recorded by 3,691 Chinese participants. It covers major Chinese dialect regions including Jiangsu, Shandong, Beijing, Henan and others, and conforms to the specific accent of Chinese speakers when speaking English. The recorded sentences cover a variety of daily English utterances, with rich content, wide domain coverage and balanced phoneme distribution. This dataset can be used to improve the recognition performance of speech recognition systems for English spoken by Chinese speakers.

提供机构：

maas

创建时间：

2024-05-06

搜集汇总

数据集介绍