数据堂—1,012小时印度英语手机采集语音数据

Name: 数据堂—1,012小时印度英语手机采集语音数据
Creator: maas
Published: 2025-11-03 17:43:01
License: 暂无描述

魔搭社区2025-11-03 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/DatatangBeijing/1012Hours-IndianEnglishSpeechDataByMobilePhone

下载链接

链接失效反馈

官方服务：

资源简介：

1,012小时印度英语手机采集语音数据是由2,100名印度本土发音人参与录制；录音文本由语言专家参与设计，涵盖通用、交互、车载、家居等多类别；文本经过人工校对，准确率高；本套印度英语手机采集语音数据可应用于语音识别、机器翻译、声纹识别

This dataset contains 1,012 hours of Indian English speech data collected via mobile phones, with 2,100 native Indian speakers participating in the recording. The accompanying recording texts were designed by language experts, covering various scenarios such as general daily use, interactive communication, in-vehicle environments, and smart home scenarios. All texts have undergone manual proofreading to ensure high accuracy. This Indian English mobile-collected speech dataset can be applied to tasks including speech recognition, machine translation, and speaker verification.

提供机构：

maas

创建时间：

2024-05-07

搜集汇总

数据集介绍