mesolitica/tts-combine-annotated
收藏Hugging Face2025-05-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mesolitica/tts-combine-annotated
下载链接
链接失效反馈官方服务:
资源简介:
这是一个基于马来语的数据集,包含转录文本、演讲者名称、演讲者ID、性别、发音音高均值和标准差、信噪比、c50值、语音时长、语音质量评价指标STOI、SI-SDR、PESQ等特征。数据集被分为训练集,共有360,298个样本,总大小为约187MB。数据集包含了8位演讲者的语音数据,总时长约为713小时。
This is a Malay-based dataset containing features such as transcription text, speaker name, speaker ID, gender, mean and standard deviation of utterance pitch, signal-to-noise ratio, c50 value, speech duration, speech quality evaluation metrics STOI, SI-SDR, PESQ, etc. The dataset is split into a training set with a total of 360,298 samples and a size of approximately 187MB. The dataset includes speech data from 8 speakers, with a total duration of approximately 713 hours.
提供机构:
mesolitica



