ylacombe/libritts-r-descriptions-10k-v4
收藏Hugging Face2024-06-04 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/ylacombe/libritts-r-descriptions-10k-v4
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: clean
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: text_description
dtype: string
splits:
- name: dev.clean
num_bytes: 5417004
num_examples: 5736
- name: test.clean
num_bytes: 4774246
num_examples: 4837
- name: train.clean.100
num_bytes: 31732872
num_examples: 33232
- name: train.clean.360
num_bytes: 112272730
num_examples: 116426
download_size: 56116938
dataset_size: 154196852
- config_name: other
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: text_description
dtype: string
splits:
- name: dev.other
num_bytes: 4225305
num_examples: 4613
- name: test.other
num_bytes: 4615700
num_examples: 5120
- name: train.other.500
num_bytes: 192550333
num_examples: 205035
download_size: 71829224
dataset_size: 201391338
configs:
- config_name: clean
data_files:
- split: dev.clean
path: clean/dev.clean-*
- split: test.clean
path: clean/test.clean-*
- split: train.clean.100
path: clean/train.clean.100-*
- split: train.clean.360
path: clean/train.clean.360-*
- config_name: other
data_files:
- split: dev.other
path: other/dev.other-*
- split: test.other
path: other/test.other-*
- split: train.other.500
path: other/train.other.500-*
---
提供机构:
ylacombe
原始信息汇总
数据集概述
配置名称:clean
-
特征信息:
text: 字符串类型text_original: 字符串类型speaker_id: 字符串类型path: 字符串类型chapter_id: 字符串类型id: 字符串类型snr: 浮点数类型(32位)c50: 浮点数类型(32位)speech_duration: 浮点数类型(64位)speaking_rate: 字符串类型phonemes: 字符串类型stoi: 浮点数类型(64位)si-sdr: 浮点数类型(64位)pesq: 浮点数类型(64位)gender: 字符串类型utterance_pitch_std: 浮点数类型(64位)utterance_pitch_mean: 浮点数类型(64位)pitch: 字符串类型noise: 字符串类型reverberation: 字符串类型speech_monotony: 字符串类型sdr_noise: 字符串类型pesq_speech_quality: 字符串类型text_description: 字符串类型
-
数据分割:
dev.clean: 5736个样本,5417004字节test.clean: 4837个样本,4774246字节train.clean.100: 33232个样本,31732872字节train.clean.360: 116426个样本,112272730字节
-
下载大小:56116938字节
-
数据集大小:154196852字节
配置名称:other
-
特征信息:
text: 字符串类型text_original: 字符串类型speaker_id: 字符串类型path: 字符串类型chapter_id: 字符串类型id: 字符串类型snr: 浮点数类型(32位)c50: 浮点数类型(32位)speech_duration: 浮点数类型(64位)speaking_rate: 字符串类型phonemes: 字符串类型stoi: 浮点数类型(64位)si-sdr: 浮点数类型(64位)pesq: 浮点数类型(64位)gender: 字符串类型utterance_pitch_std: 浮点数类型(64位)utterance_pitch_mean: 浮点数类型(64位)pitch: 字符串类型noise: 字符串类型reverberation: 字符串类型speech_monotony: 字符串类型sdr_noise: 字符串类型pesq_speech_quality: 字符串类型text_description: 字符串类型
-
数据分割:
dev.other: 4613个样本,4225305字节test.other: 5120个样本,4615700字节train.other.500: 205035个样本,192550333字节
-
下载大小:71829224字节
-
数据集大小:201391338字节



