XBMU-AMDO31: A Speech Recognition Dataset for the Amdo Dialect of Tibetan

Name: XBMU-AMDO31: A Speech Recognition Dataset for the Amdo Dialect of Tibetan
Creator: Science Data Bank
Published: 2025-06-27 01:47:56
License: 暂无描述

DataCite Commons2025-06-27 更新2026-05-05 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=301d4cae68bd43ff9e3797ce3fd37c18

下载链接

链接失效反馈

官方服务：

资源简介：

The speech dataset was collected in the Xiahe region of Gansu Province, China, encompassing 31 hours of recordings from 66 native Tibetan speakers, along with their corresponding transcriptions. The dataset comprises 33 males and 33 females. The speech dataset is divided into training, development, and test sets. The training set comprises 18,505 sentences recorded from 54 speakers; the development set includes 2,045 sentences from 6 speakers; and the test set contains 2,035 sentences from 6 speakers.

提供机构：

Science Data Bank

创建时间：

2025-06-27