ESpeech/ESpeech-buldjat

Name: ESpeech/ESpeech-buldjat
Creator: ESpeech
Published: 2025-08-25 12:30:46
License: 暂无描述

Hugging Face2025-08-25 更新2025-09-13 收录

下载链接：

https://hf-mirror.com/datasets/ESpeech/ESpeech-buldjat

下载链接

链接失效反馈

官方服务：

资源简介：

Buldjat YouTube音频数据集包含从Buldjat YouTube频道提取的54小时的音频片段及其对应的元数据。每个音频文件代表频道视频内容的一个片段，以44.1kHz的采样率处理。数据集适用于文本到语音、自动语音识别和质量评估任务，包含俄语文本，音频格式为MP3，文件结构为分段的音频文件和JSON格式的元数据。

The Buldjat YouTube Audio Dataset contains 54 hours of processed audio segments extracted from the Buldjat YouTube channel with corresponding metadata. Each audio file represents a segment from the channels videos and content, processed at 44.1kHz sample rate. The dataset is suitable for text-to-speech, automatic speech recognition, and quality assessment tasks, containing Russian text, with audio format in MP3, and file structure of segmented audio files with JSON metadata.

提供机构：

ESpeech

5,000+

优质数据集

54 个

任务类型

进入经典数据集