five

A dataset of Mongolian, Tibetan and Uyghur speech fragments based on voice activity detection

收藏
科学数据银行2020-08-14 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/en/detail?dataSetId=743522838262054912
下载链接
链接失效反馈
官方服务:
资源简介:
Based on the speech data in Mongolian, Tibetan, and Uyghur speech data from Chinese minority regions in 2015, we adopted a double threshold Voice Activity Detection method with short-time energy and short-time zero-crossing rate to obtain multiple voice fragments of each sentence speech. The result dataset contains 1657 Mongolian speech fragments, 666 Tibetan speech fragments and 756 Uygur speech fragments. The total volume of the data is about 111 MB. Through automatic software segmentation and multiple auditing and proofreading by language experts, we have obtained high-quality voice fragment data of Mongolian, Tibetan and Uygur, which can be applied to minority speech recognition, voice activity detection, speech enhancement, speech synthesis and language teaching.
提供机构:
Tursun Kadir; Institute of acoustics, Chinese Academy of Sciences
创建时间:
2020-08-14
二维码
社区交流群
二维码
科研交流群
商业服务