Uzbek Speech Corpus

Name: Uzbek Speech Corpus
Creator: Computer Systems, Tashkent University of Information Technology named after Muhammad Al-Khwarizmi and Institute of Smart Systems and Artificial Intelligence, Nazarbayev University
Published: 2021-07-27 04:18:49
License: 暂无描述

DataCite Commons2021-07-27 更新2024-07-13 收录

下载链接：

https://issai.nu.edu.kz/uzbek-asr/

下载链接

链接失效反馈

官方服务：

资源简介：

The Uzbek speech corpus (USC) has been developed in academic collaboration between ISSAI and TUIT. The USC comprises 958 different speakers with a total of 105 hours of transcribed audio recordings. To ensure high quality, the USC has been manually checked by native speakers. The USC is primarily designed for the ASR task, however, it can also be used to aid other speech-related tasks, such as speech synthesis and speech translation. To the best of our knowledge, the USC is the first open-source Uzbek speech corpus available for both academic and commercial use under the Creative Commons Attribution 4.0 International License. We expect that the USC will be a valuable resource for the general speech research community and become the baseline dataset for Uzbek ASR research.

提供机构：

Computer Systems, Tashkent University of Information Technology named after Muhammad Al-Khwarizmi and Institute of Smart Systems and Artificial Intelligence, Nazarbayev University

创建时间：

2021-07-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集