数据堂—738小时维语手机采集语音数据

Name: 数据堂—738小时维语手机采集语音数据
Creator: maas
Published: 2025-12-04 11:31:35
License: 暂无描述

魔搭社区2025-12-04 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/DatatangBeijing/738Hours_UyghurSpeechDataByMobilePhone

下载链接

链接失效反馈

官方服务：

资源简介：

738小时维语手机采集语音数据由2,058名来自维吾尔族聚居区的人参与录制，男女均衡。录音内容为30万维语口语化句子，录音环境为安静的室内。738小时维语手机采集语音数据所有句子均经过人工精准转写，并标注了噪音标识。

738-hour Uyghur speech data collected via mobile phones was recorded by 2,058 participants from Uyghur-concentrated regions, with an equal gender distribution. The dataset includes 300,000 colloquial Uyghur sentences, and all recordings were carried out in quiet indoor environments. All sentences in this dataset have undergone precise manual transcription and are annotated with noise labels.

提供机构：

maas

创建时间：

2024-05-06

搜集汇总

数据集介绍