ylacombe/libritts-r-text-tags-v3
收藏Hugging Face2024-05-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/ylacombe/libritts-r-text-tags-v3
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: clean
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
splits:
- name: dev.clean
num_bytes: 4282623
num_examples: 5736
- name: test.clean
num_bytes: 3776641
num_examples: 4837
- name: train.clean.100
num_bytes: 25054458
num_examples: 33232
- name: train.clean.360
num_bytes: 88786172
num_examples: 116426
download_size: 45389304
dataset_size: 121899894
- config_name: other
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
splits:
- name: dev.other
num_bytes: 3297275
num_examples: 4613
- name: test.other
num_bytes: 3580934
num_examples: 5120
- name: train.other.500
num_bytes: 150846841
num_examples: 205035
download_size: 57366901
dataset_size: 157725050
configs:
- config_name: clean
data_files:
- split: dev.clean
path: clean/dev.clean-*
- split: test.clean
path: clean/test.clean-*
- split: train.clean.100
path: clean/train.clean.100-*
- split: train.clean.360
path: clean/train.clean.360-*
- config_name: other
data_files:
- split: dev.other
path: other/dev.other-*
- split: test.other
path: other/test.other-*
- split: train.other.500
path: other/train.other.500-*
---
提供机构:
ylacombe
原始信息汇总
数据集概述
配置名称:clean
特征信息
- text: 数据类型为字符串
- text_original: 数据类型为字符串
- speaker_id: 数据类型为字符串
- path: 数据类型为字符串
- chapter_id: 数据类型为字符串
- id: 数据类型为字符串
- snr: 数据类型为float32
- c50: 数据类型为float32
- speech_duration: 数据类型为float64
- speaking_rate: 数据类型为字符串
- phonemes: 数据类型为字符串
- stoi: 数据类型为float64
- si-sdr: 数据类型为float64
- pesq: 数据类型为float64
- gender: 数据类型为字符串
- utterance_pitch_std: 数据类型为float64
- utterance_pitch_mean: 数据类型为float64
- pitch: 数据类型为字符串
- noise: 数据类型为字符串
- reverberation: 数据类型为字符串
- speech_monotony: 数据类型为字符串
- sdr_noise: 数据类型为字符串
- pesq_speech_quality: 数据类型为字符串
数据集分割
- dev.clean: 大小为4282623字节,示例数为5736
- test.clean: 大小为3776641字节,示例数为4837
- train.clean.100: 大小为25054458字节,示例数为33232
- train.clean.360: 大小为88786172字节,示例数为116426
下载与数据集大小
- 下载大小: 45389304字节
- 数据集大小: 121899894字节
配置名称:other
特征信息
- text: 数据类型为字符串
- text_original: 数据类型为字符串
- speaker_id: 数据类型为字符串
- path: 数据类型为字符串
- chapter_id: 数据类型为字符串
- id: 数据类型为字符串
- snr: 数据类型为float32
- c50: 数据类型为float32
- speech_duration: 数据类型为float64
- speaking_rate: 数据类型为字符串
- phonemes: 数据类型为字符串
- stoi: 数据类型为float64
- si-sdr: 数据类型为float64
- pesq: 数据类型为float64
- gender: 数据类型为字符串
- utterance_pitch_std: 数据类型为float64
- utterance_pitch_mean: 数据类型为float64
- pitch: 数据类型为字符串
- noise: 数据类型为字符串
- reverberation: 数据类型为字符串
- speech_monotony: 数据类型为字符串
- sdr_noise: 数据类型为字符串
- pesq_speech_quality: 数据类型为字符串
数据集分割
- dev.other: 大小为3297275字节,示例数为4613
- test.other: 大小为3580934字节,示例数为5120
- train.other.500: 大小为150846841字节,示例数为205035
下载与数据集大小
- 下载大小: 57366901字节
- 数据集大小: 157725050字节



