A dataset of Mongolian, Tibetan and Uyghur speech fragments based on voice activity detection

Name: A dataset of Mongolian, Tibetan and Uyghur speech fragments based on voice activity detection
Creator: Tursun Kadir; Institute of acoustics, Chinese Academy of Sciences
Published: 2020-08-14 00:00:00
License: 暂无描述

科学数据银行2020-08-14 更新2026-04-23 收录

下载链接：

https://www.scidb.cn/en/detail?dataSetId=743522838262054912

下载链接

链接失效反馈

官方服务：

资源简介：

Based on the speech data in Mongolian, Tibetan, and Uyghur speech data from Chinese minority regions in 2015, we adopted a double threshold Voice Activity Detection method with short-time energy and short-time zero-crossing rate to obtain multiple voice fragments of each sentence speech. The result dataset contains 1657 Mongolian speech fragments, 666 Tibetan speech fragments and 756 Uygur speech fragments. The total volume of the data is about 111 MB. Through automatic software segmentation and multiple auditing and proofreading by language experts, we have obtained high-quality voice fragment data of Mongolian, Tibetan and Uygur, which can be applied to minority speech recognition, voice activity detection, speech enhancement, speech synthesis and language teaching.

提供机构：

Tursun Kadir; Institute of acoustics, Chinese Academy of Sciences

创建时间：

2020-08-14