five

XBMU-AMDO31: A Speech Recognition Dataset for the Amdo Dialect of Tibetan

收藏
DataCite Commons2025-06-27 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=301d4cae68bd43ff9e3797ce3fd37c18
下载链接
链接失效反馈
官方服务:
资源简介:
The speech dataset was collected in the Xiahe region of Gansu Province, China, encompassing 31 hours of recordings from 66 native Tibetan speakers, along with their corresponding transcriptions. The dataset comprises 33 males and 33 females. The speech dataset is divided into training, development, and test sets. The training set comprises 18,505 sentences recorded from 54 speakers; the development set includes 2,045 sentences from 6 speakers; and the test set contains 2,035 sentences from 6 speakers.
提供机构:
Science Data Bank
创建时间:
2025-06-27
二维码
社区交流群
二维码
科研交流群
商业服务