哈萨克语ASR标注
收藏魔搭社区2024-11-09 更新2024-11-16 收录
下载链接:
https://modelscope.cn/datasets/LONGMAOSOFT/Kazakh-ASR-annotation
下载链接
链接失效反馈官方服务:
资源简介:
数据集文件元信息以及数据文件,请浏览“数据集文件”页面获取。
当前数据集卡片使用的是默认模版,数据集的贡献者未提供更加详细的数据集介绍,但是您可以通过如下GIT Clone命令,或者ModelScope SDK来下载数据集
#### 下载方法
:modelscope-code[]{type="sdk"}
:modelscope-code[]{type="git"}
# Complete data size
## 47.4GB
# Join the group
https://t.me/+Y5kL2iHis9A0ZWI1
Obtain a complete dataset
Mutual communication within the industry
Get more information and consultation
Timely dataset update notifications
## Or add enterprise WeChat

## Dataset Introduction
### Kazakh ASR annotation
### Version
v1.0
### Release Date
2024-10-23
### Data Description
Data type: Standard Halaqah Arabic, phonetic pronunciation clear and standard, volume moderate and natural; No crying/laughing/sneezing/yawning/snoring/other noises etc.
Collection environment: Indoor or outdoor, no obvious background noise; Not wearing earphones. Mouth distance from the microphone is about 10cm.
Data format: wav format, 16bit, 16KHz, single channel, not subjected to any form of format compression, noise reduction processing, etc.
Collection equipment: Smart phone
Data quantity: 516 people, 300 sentences per person, 10-20 words per sentence, male-female ratio is 3:2
Authorization situation: Signed authorization letter by the recorder
File name: Gend
## Directory Structure
```
root_directory/
├── audio/
│ ├── audio1.wav
```
提供机构:
maas
创建时间:
2024-11-09
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集为哈萨克语自动语音识别(ASR)标注数据,包含47.4GB的WAV格式语音文件,采集自516位说话者,每人录制300句,每句10-20词,数据采集环境标准且已获授权。数据集发布于2024年10月23日,采用Apache 2.0许可证。
以上内容由遇见数据集搜集并总结生成



