vrclc/festvox-iiith-ml

Name: vrclc/festvox-iiith-ml
Creator: vrclc
Published: 2024-01-03 10:14:34
License: 暂无描述

Hugging Face2024-01-03 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/vrclc/festvox-iiith-ml

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含音频、语音ID、说话者ID和转录文本等特征。数据集分为训练集，包含1000个样本，总大小为187936686字节。数据集主要用于自动语音识别和文本到语音转换任务，语言为马拉雅拉姆语。数据集的名称是Festvox IIITH Malayalam。

This dataset includes features such as audio, voice ID, speaker ID, and transcribed text. The dataset is divided into a training set which contains 1000 samples with a total size of 187936686 bytes. It is primarily used for automatic speech recognition (ASR) and text-to-speech (TTS) tasks, with the language being Malayalam. The name of this dataset is Festvox IIITH Malayalam.

提供机构：

vrclc

原始信息汇总

数据集概述

数据特征

audio: 音频数据
speech_id: 字符串类型的语音标识
speaker_id: 字符串类型的说话人标识
transcript: 字符串类型的转录文本

数据分割

train: 训练集，包含1000个样本，总大小为187936686字节

数据集大小

下载大小: 180001519字节
数据集大小: 187936686字节

配置

default: 默认配置，包含训练集数据文件路径为data/train-*

任务类别

自动语音识别
文本到语音

语言

ml: 马拉雅拉姆语

数据集名称

Festvox IIITH Malayalam: 数据集的友好名称

数据集规模

n<1K: 数据集规模小于1000

5,000+

优质数据集

54 个

任务类型

进入经典数据集