Pratik-B/jenny-tts-tags-6h-v1
收藏Hugging Face2025-01-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Pratik-B/jenny-tts-tags-6h-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频文件的多种特征信息,如文件名、文本内容、标准化后的转录文本、音高平均值和标准差、信噪比、c50值、说话速率、音素信息、短时客观 intelligibility (STOI) 值、信号与失真加噪声比 (SI-SDR) 值以及感知评价质量 (PESQ) 值。数据集划分为训练集,共有4000个示例。
The dataset includes various features of audio files, such as file name, text content, normalized transcription, mean and standard deviation of pitch, signal-to-noise ratio, c50 value, speaking rate, phoneme information, Short-Time Objective Intelligibility (STOI) value, Signal-to-Distortion-Plus-Noise Ratio (SI-SDR) value, and Perceptual Evaluation of Speech Quality (PESQ) value. The dataset is split into a training set with a total of 4,000 examples.
提供机构:
Pratik-B



