XBMU-AMDO31: A Speech Recognition Dataset for the Amdo Dialect of Tibetan
收藏科学数据银行2025-06-04 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=301d4cae68bd43ff9e3797ce3fd37c18
下载链接
链接失效反馈官方服务:
资源简介:
The speech dataset was collected in the Xiahe region of Gansu Province, China, encompassing 31 hours of recordings from 66 native Tibetan speakers, along with their corresponding transcriptions. The dataset comprises 33 males and 33 females. The speech dataset is divided into training, development, and test sets. The training set comprises 18,505 sentences recorded from 54 speakers; the development set includes 2,045 sentences from 6 speakers; and the test set contains 2,035 sentences from 6 speakers.
提供机构:
西北民族大学
创建时间:
2025-06-04



