ylacombe/parler-tts-mini-v1-fast_speaker_similarity
收藏Hugging Face2024-07-08 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/ylacombe/parler-tts-mini-v1-fast_speaker_similarity
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如原始文本、说话者ID、路径、章节ID、ID、信噪比、C50、语音持续时间、语速、音素、STOI、SI-SDR、PESQ、性别、音高标准差、音高均值、音高、噪声、混响、语音单调性、噪声SDR、语音质量PESQ、口音、文本描述、音频、文本、生成音频和相似度等。数据集分为训练集,包含340个样本,总大小为288841772字节,下载大小为186231508字节。
The dataset includes multiple features related to speech, such as original text, speaker ID, audio file path, chapter ID, SNR, C50, speech duration, speaking rate, phonemes, STOI, SI-SDR, PESQ, gender, pitch statistics, noise, reverberation, speech monotony, SDR noise, PESQ speech quality, accent, text description, audio file, generated audio, and similarity. These features are used to analyze and evaluate various attributes of speech. The dataset is divided into a training set with 340 samples.
提供机构:
ylacombe
原始信息汇总
数据集概述
数据集信息
特征
- text_original: 字符串类型
- speaker_id: 字符串类型
- path: 字符串类型
- chapter_id: 字符串类型
- id: 字符串类型
- snr: 浮点数类型 (float32)
- c50: 浮点数类型 (float32)
- speech_duration: 浮点数类型 (float32)
- speaking_rate: 字符串类型
- phonemes: 字符串类型
- stoi: 浮点数类型 (float32)
- si-sdr: 浮点数类型 (float32)
- pesq: 浮点数类型 (float32)
- gender: 字符串类型
- utterance_pitch_std: 浮点数类型 (float32)
- utterance_pitch_mean: 浮点数类型 (float32)
- pitch: 字符串类型
- noise: 字符串类型
- reverberation: 字符串类型
- speech_monotony: 字符串类型
- sdr_noise: 字符串类型
- pesq_speech_quality: 字符串类型
- accent: 字符串类型
- text_description: 字符串类型
- audio: 音频类型,采样率为16000
- text: 字符串类型
- generated_audio: 音频类型,采样率为16000
- similarity: 浮点数类型 (float64)
数据分割
- train: 包含340个样本,占用288841772.0字节
数据集大小
- 下载大小: 186231508字节
- 数据集大小: 288841772.0字节
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*
- data_files:



