five

murodbek/uzbek-speech-corpus

收藏
Hugging Face2025-01-24 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/murodbek/uzbek-speech-corpus
下载链接
链接失效反馈
官方服务:
资源简介:
乌兹别克语音语料库(USC)是由ISSAI和塔什干信息技术大学计算机系统系的图像和语音处理实验室合作开发的。该语料库包含958位不同说话者的转录音频记录,总时长为105小时。为确保高质量,该语料库已由母语说话者手动检查。USC主要用于自动语音识别(ASR),但也可用于辅助其他与语音相关的任务,如语音合成和语音翻译。据我们所知,USC是第一个在学术和商业用途下开放的乌兹别克语音语料库,遵循Creative Commons Attribution 4.0国际许可。

The Uzbek Speech Corpus (USC) is developed in collaboration between ISSAI and the Image and Speech Processing Laboratory in the Department of Computer Systems of the Tashkent University of Information Technologies. The USC comprises 958 different speakers with a total of 105 hours of transcribed audio recordings. It has been manually checked by native speakers to ensure high quality. The USC is primarily designed for automatic speech recognition (ASR), but it can also be used for other speech-related tasks such as speech synthesis and speech translation. To the best of our knowledge, the USC is the first open-source Uzbek speech corpus available for both academic and commercial use under the Creative Commons Attribution 4.0 International License.
提供机构:
murodbek
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作