Beehzod/stt_data

Name: Beehzod/stt_data
Creator: Beehzod
Published: 2024-07-16 07:34:32
License: 暂无描述

Hugging Face2024-07-16 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/Beehzod/stt_data

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个用于自动语音识别任务的乌兹别克语数据集，包含8个训练样本。每个样本包含文件名和转录文本，但音频字段为空。数据集的总大小为2270字节，下载大小为4030字节。

This dataset is designed for automatic speech recognition tasks, supporting Uzbek language. It includes three main features: file name, transcription, and audio. The dataset is divided into a training set, containing 8 samples with a total size of 2270 bytes. The download size of the dataset is 4030 bytes, and the dataset size is 2270 bytes.

提供机构：

Beehzod

原始信息汇总

数据集概述

语言

乌兹别克语 (uz)

许可证

Apache 2.0

任务类别

自动语音识别 (automatic-speech-recognition)

数据集信息

特征

file_name: 文件名，数据类型为字符串 (string)
transcription: 转录文本，数据类型为字符串 (string)
audio: 音频数据，数据类型为空 (null)

数据分割

train: 训练集，包含8个样本，占用2270字节

数据大小

下载大小: 4030字节
数据集大小: 2270字节

配置

config_name: default
- data_files:
  - split: train
  - path: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集