Uzbek Speech Corpus
收藏DataCite Commons2021-07-27 更新2024-07-13 收录
下载链接:
https://issai.nu.edu.kz/uzbek-asr/
下载链接
链接失效反馈官方服务:
资源简介:
The Uzbek speech corpus (USC) has been developed in academic collaboration between ISSAI and TUIT. The USC comprises 958 different speakers with a total of 105 hours of transcribed audio recordings. To ensure high quality, the USC has been manually checked by native speakers. The USC is primarily designed for the ASR task, however, it can also be used to aid other speech-related tasks, such as speech synthesis and speech translation. To the best of our knowledge, the USC is the first open-source Uzbek speech corpus available for both academic and commercial use under the Creative Commons Attribution 4.0 International License. We expect that the USC will be a valuable resource for the general speech research community and become the baseline dataset for Uzbek ASR research.
提供机构:
Computer Systems, Tashkent University of Information Technology named after Muhammad Al-Khwarizmi and Institute of Smart Systems and Artificial Intelligence, Nazarbayev University
创建时间:
2021-07-27



