数据堂—240小时印地语手机采集语音数据_朗读

Name: 数据堂—240小时印地语手机采集语音数据_朗读
Creator: maas
Published: 2025-11-03 18:03:07
License: 暂无描述

魔搭社区2025-11-03 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/DatatangBeijing/240Hours-HindiSpeechDataByMobilePhone

下载链接

链接失效反馈

官方服务：

资源简介：

240小时印地语手机采集语音数据由401名印度人参与录制；录音涵盖安静和噪音的不同环境，更贴合语音识别实际应用场景；录音内容丰富，覆盖经济，娱乐，新闻，口语等多个领域，所有文本由人工转写，准确率高。240小时印地语手机采集语音数据可应用于语音识别、机器翻译、声纹识别

This 240-hour Hindi speech dataset collected via mobile phones was recorded by 401 Indian participants. The recordings cover various environments with both quiet and noisy conditions, which better aligns with real-world application scenarios for speech recognition. The recorded content is diverse, covering multiple domains such as economy, entertainment, news, and daily conversational language. All transcriptions are manually conducted with high accuracy. This dataset can be applied to tasks including speech recognition, machine translation, and speaker verification.

提供机构：

maas

创建时间：

2024-05-06

搜集汇总

数据集介绍