ylacombe/libritts-r-descriptions-10k-v5
收藏Hugging Face2024-06-10 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/ylacombe/libritts-r-descriptions-10k-v5
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: clean
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: accent
dtype: string
- name: text_description
dtype: string
splits:
- name: dev.clean
num_bytes: 5613890
num_examples: 5736
- name: test.clean
num_bytes: 4938734
num_examples: 4837
- name: train.clean.100
num_bytes: 32807815
num_examples: 33232
- name: train.clean.360
num_bytes: 116058846
num_examples: 116426
download_size: 54877782
dataset_size: 159419285
- config_name: other
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: accent
dtype: string
- name: text_description
dtype: string
splits:
- name: dev.other
num_bytes: 4367066
num_examples: 4613
- name: test.other
num_bytes: 4767598
num_examples: 5120
- name: train.other.500
num_bytes: 198714600
num_examples: 205035
download_size: 70247426
dataset_size: 207849264
configs:
- config_name: clean
data_files:
- split: dev.clean
path: clean/dev.clean-*
- split: test.clean
path: clean/test.clean-*
- split: train.clean.100
path: clean/train.clean.100-*
- split: train.clean.360
path: clean/train.clean.360-*
- config_name: other
data_files:
- split: dev.other
path: other/dev.other-*
- split: test.other
path: other/test.other-*
- split: train.other.500
path: other/train.other.500-*
---
提供机构:
ylacombe
原始信息汇总
数据集概述
数据集配置
配置名称:clean
- 特征字段:
- text: string
- text_original: string
- speaker_id: string
- path: string
- chapter_id: string
- id: string
- snr: float32
- c50: float32
- speech_duration: float64
- speaking_rate: string
- phonemes: string
- stoi: float64
- si-sdr: float64
- pesq: float64
- gender: string
- utterance_pitch_std: float64
- utterance_pitch_mean: float64
- pitch: string
- noise: string
- reverberation: string
- speech_monotony: string
- sdr_noise: string
- pesq_speech_quality: string
- accent: string
- text_description: string
- 数据分割:
- dev.clean: 5736个样本,5613890字节
- test.clean: 4837个样本,4938734字节
- train.clean.100: 33232个样本,32807815字节
- train.clean.360: 116426个样本,116058846字节
- 下载大小:54877782字节
- 数据集大小:159419285字节
配置名称:other
- 特征字段:
- text: string
- text_original: string
- speaker_id: string
- path: string
- chapter_id: string
- id: string
- snr: float32
- c50: float32
- speech_duration: float64
- speaking_rate: string
- phonemes: string
- stoi: float64
- si-sdr: float64
- pesq: float64
- gender: string
- utterance_pitch_std: float64
- utterance_pitch_mean: float64
- pitch: string
- noise: string
- reverberation: string
- speech_monotony: string
- sdr_noise: string
- pesq_speech_quality: string
- accent: string
- text_description: string
- 数据分割:
- dev.other: 4613个样本,4367066字节
- test.other: 5120个样本,4767598字节
- train.other.500: 205035个样本,198714600字节
- 下载大小:70247426字节
- 数据集大小:207849264字节
数据文件路径
配置名称:clean
- dev.clean: clean/dev.clean-*
- test.clean: clean/test.clean-*
- train.clean.100: clean/train.clean.100-*
- train.clean.360: clean/train.clean.360-*
配置名称:other
- dev.other: other/dev.other-*
- test.other: other/test.other-*
- train.other.500: other/train.other.500-*



