Yub-0/jenny-tts-text-tags-6h-v1
收藏Hugging Face2026-04-02 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Yub-0/jenny-tts-text-tags-6h-v1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: file_name
dtype: string
- name: text
dtype: string
- name: transcription_normalised
dtype: string
- name: utterance_pitch_mean
dtype: float32
- name: utterance_pitch_std
dtype: float32
- name: snr
dtype: float64
- name: c50
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
splits:
- name: train
num_bytes: 2063542
num_examples: 4000
download_size: 1000931
dataset_size: 2063542
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征项:
- 名称:文件名(file_name),数据类型:字符串
- 名称:原始文本(text),数据类型:字符串
- 名称:归一化转录文本(transcription_normalised),数据类型:字符串
- 名称:语句基频均值(utterance_pitch_mean),数据类型:float32
- 名称:语句基频标准差(utterance_pitch_std),数据类型:float32
- 名称:信噪比(Signal-to-Noise Ratio, SNR)(snr),数据类型:float64
- 名称:C50语音清晰度指标(c50),数据类型:float64
- 名称:言语速率(speaking_rate),数据类型:字符串
- 名称:音素(phonemes),数据类型:字符串
- 名称:短时客观可懂度(Short-Time Objective Intelligibility, STOI)(stoi),数据类型:float64
- 名称:尺度不变信号失真比(Scale-Invariant Signal-to-Distortion Ratio, SI-SDR)(si-sdr),数据类型:float64
- 名称:语音质量感知评估(Perceptual Evaluation of Speech Quality, PESQ)(pesq),数据类型:float64
- 名称:噪声类型(noise),数据类型:字符串
- 名称:混响参数(reverberation),数据类型:字符串
- 名称:言语单调性(speech_monotony),数据类型:字符串
- 名称:噪声信号失真比(sdr_noise),数据类型:字符串
- 名称:PESQ语音质量评分(pesq_speech_quality),数据类型:字符串
划分集:
- 名称:训练集(train),字节占用:2063542,样本数量:4000
下载大小:1000931
数据集总大小:2063542
配置项:
- 配置名称:默认配置(default),数据文件:
- 划分集:训练集(train),路径:data/train-*
提供机构:
Yub-0



