XBMU-AMDO31: A Speech Recognition Dataset for the Amdo Dialect of Tibetan

Name: XBMU-AMDO31: A Speech Recognition Dataset for the Amdo Dialect of Tibetan
Creator: 西北民族大学
Published: 2025-06-04 00:00:00
License: 暂无描述

科学数据银行2025-06-04 更新2026-04-23 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=301d4cae68bd43ff9e3797ce3fd37c18

下载链接

链接失效反馈

官方服务：

资源简介：

The speech dataset was collected in the Xiahe region of Gansu Province, China, encompassing 31 hours of recordings from 66 native Tibetan speakers, along with their corresponding transcriptions. The dataset comprises 33 males and 33 females. The speech dataset is divided into training, development, and test sets. The training set comprises 18,505 sentences recorded from 54 speakers; the development set includes 2,045 sentences from 6 speakers; and the test set contains 2,035 sentences from 6 speakers.

提供机构：

西北民族大学

创建时间：

2025-06-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集