yasalma/TatSC_ASR

Name: yasalma/TatSC_ASR
Creator: yasalma
Published: 2025-04-29 19:25:00
License: 暂无描述

Hugging Face2025-04-29 更新2025-07-05 收录

下载链接：

https://hf-mirror.com/datasets/yasalma/TatSC_ASR

下载链接

链接失效反馈

官方服务：

资源简介：

Tatar Speech Corpus ASR是一个包含塔塔尔语语音的语料库，适用于自动语音识别（ASR）任务。该数据集是第一个开源的塔塔尔语音语料库，包括众包和有声读物两部分，共有269.1小时的转录语音，包含271,914个语句。数据集包含音频文件、文本转录、音频时长、文件唯一标识符和说话者标识符等字段。

Tatar Speech Corpus ASR is a speech dataset for Tatar language aimed at automatic speech recognition (ASR) tasks. It is the first open-source Tatar speech corpus, including both crowdsourced and audiobooks data, with a total of 269.1 hours of transcribed speech containing 271,914 utterances. The dataset includes fields such as audio files, text transcriptions, audio duration, unique file identifiers, and speaker identifiers.

提供机构：

yasalma

5,000+

优质数据集

54 个

任务类型

进入经典数据集