ylacombe/libritts-r-text-tags-v4
收藏Hugging Face2024-06-10 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/ylacombe/libritts-r-text-tags-v4
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: clean
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: accent
dtype: string
splits:
- name: dev.clean
num_bytes: 4380751
num_examples: 5736
- name: test.clean
num_bytes: 3858412
num_examples: 4837
- name: train.clean.100
num_bytes: 25619631
num_examples: 33232
- name: train.clean.360
num_bytes: 90774234
num_examples: 116426
download_size: 45439112
dataset_size: 124633028
- config_name: other
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: accent
dtype: string
splits:
- name: dev.other
num_bytes: 3376254
num_examples: 4613
- name: test.other
num_bytes: 3670706
num_examples: 5120
- name: train.other.500
num_bytes: 154275204
num_examples: 205035
download_size: 57435880
dataset_size: 161322164
configs:
- config_name: clean
data_files:
- split: dev.clean
path: clean/dev.clean-*
- split: test.clean
path: clean/test.clean-*
- split: train.clean.100
path: clean/train.clean.100-*
- split: train.clean.360
path: clean/train.clean.360-*
- config_name: other
data_files:
- split: dev.other
path: other/dev.other-*
- split: test.other
path: other/test.other-*
- split: train.other.500
path: other/train.other.500-*
---
提供机构:
ylacombe
原始信息汇总
数据集概述
数据集配置
配置名称:clean
特征
text: 字符串text_original: 字符串speaker_id: 字符串path: 字符串chapter_id: 字符串id: 字符串snr: 浮点数 (float32)c50: 浮点数 (float32)speech_duration: 浮点数 (float64)speaking_rate: 字符串phonemes: 字符串stoi: 浮点数 (float64)si-sdr: 浮点数 (float64)pesq: 浮点数 (float64)gender: 字符串utterance_pitch_std: 浮点数 (float64)utterance_pitch_mean: 浮点数 (float64)pitch: 字符串noise: 字符串reverberation: 字符串speech_monotony: 字符串sdr_noise: 字符串pesq_speech_quality: 字符串accent: 字符串
数据分割
dev.clean:- 字节数: 4380751
- 样本数: 5736
test.clean:- 字节数: 3858412
- 样本数: 4837
train.clean.100:- 字节数: 25619631
- 样本数: 33232
train.clean.360:- 字节数: 90774234
- 样本数: 116426
数据文件
dev.clean:clean/dev.clean-*test.clean:clean/test.clean-*train.clean.100:clean/train.clean.100-*train.clean.360:clean/train.clean.360-*
下载大小
- 45439112 字节
数据集大小
- 124633028 字节
配置名称:other
特征
text: 字符串text_original: 字符串speaker_id: 字符串path: 字符串chapter_id: 字符串id: 字符串snr: 浮点数 (float32)c50: 浮点数 (float32)speech_duration: 浮点数 (float64)speaking_rate: 字符串phonemes: 字符串stoi: 浮点数 (float64)si-sdr: 浮点数 (float64)pesq: 浮点数 (float64)gender: 字符串utterance_pitch_std: 浮点数 (float64)utterance_pitch_mean: 浮点数 (float64)pitch: 字符串noise: 字符串reverberation: 字符串speech_monotony: 字符串sdr_noise: 字符串pesq_speech_quality: 字符串accent: 字符串
数据分割
dev.other:- 字节数: 3376254
- 样本数: 4613
test.other:- 字节数: 3670706
- 样本数: 5120
train.other.500:- 字节数: 154275204
- 样本数: 205035
数据文件
dev.other:other/dev.other-*test.other:other/test.other-*train.other.500:other/train.other.500-*
下载大小
- 57435880 字节
数据集大小
- 161322164 字节



