XBMU-bo-Lhasa31:A Speech Recognition Dataset for the Lhasa Dialect of Tibetan
收藏科学数据银行2025-06-20 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=bdd9e8849a584d7d9152163022e58c6c
下载链接
链接失效反馈官方服务:
资源简介:
The dataset consists of audio files, text files and description files. Where (1) wav is the audio folder, under which it is divided into 51 subfolders according to the speaker, with a total duration of 31.61 hours, containing 24,289 speech samples, with an average duration of 4.68 seconds each, totaling 2.68 GB.(2) The text in the transcript file corresponds to the audio one-to-one, where all the textual data are derived from the news domain, and the textual non pronunciation symbols are normalized. (3)The readme.txt file contains some basic information of the dataset. (4) resource_lexicon.txt is the pronunciation lexicon file.
本数据集包含音频文件、文本文件与描述文件,具体信息如下:1. wav文件夹为音频存储目录,按说话人划分为51个子文件夹,总时长31.61小时,共包含24289条语音样本,单条样本平均时长4.68秒,总存储空间占用达2.68 GB。2. 转录文本文件与音频文件一一对应,所有文本数据均来源于新闻领域,且已完成文本中非发音类符号的归一化处理。3. readme.txt文件包含本数据集的部分基础信息。4. resource_lexicon.txt为发音词典文件。
提供机构:
Northwest University for Nationalities
创建时间:
2025-06-20



