ylacombe/libritts-r-descriptions-10k-v5-without-accents
收藏Hugging Face2024-06-12 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/ylacombe/libritts-r-descriptions-10k-v5-without-accents
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: clean
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: accent
dtype: string
- name: text_description
dtype: string
splits:
- name: dev.clean
num_bytes: 5524563
num_examples: 5736
- name: test.clean
num_bytes: 4860013
num_examples: 4837
- name: train.clean.100
num_bytes: 32301788
num_examples: 33232
- name: train.clean.360
num_bytes: 114287409
num_examples: 116426
download_size: 55023682
dataset_size: 156973773
- config_name: other
features:
- name: text
dtype: string
- name: text_original
dtype: string
- name: speaker_id
dtype: string
- name: path
dtype: string
- name: chapter_id
dtype: string
- name: id
dtype: string
- name: snr
dtype: float32
- name: c50
dtype: float32
- name: speech_duration
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: stoi
dtype: float64
- name: si-sdr
dtype: float64
- name: pesq
dtype: float64
- name: gender
dtype: string
- name: utterance_pitch_std
dtype: float64
- name: utterance_pitch_mean
dtype: float64
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: sdr_noise
dtype: string
- name: pesq_speech_quality
dtype: string
- name: accent
dtype: string
- name: text_description
dtype: string
splits:
- name: dev.other
num_bytes: 4311855
num_examples: 4613
- name: test.other
num_bytes: 4706703
num_examples: 5120
- name: train.other.500
num_bytes: 195931689
num_examples: 205035
download_size: 70342709
dataset_size: 204950247
configs:
- config_name: clean
data_files:
- split: dev.clean
path: clean/dev.clean-*
- split: test.clean
path: clean/test.clean-*
- split: train.clean.100
path: clean/train.clean.100-*
- split: train.clean.360
path: clean/train.clean.360-*
- config_name: other
data_files:
- split: dev.other
path: other/dev.other-*
- split: test.other
path: other/test.other-*
- split: train.other.500
path: other/train.other.500-*
---
数据集信息:
- 配置名称: clean
特征:
- 名称: text
数据类型: 字符串
- 名称: text_original
数据类型: 字符串
- 名称: speaker_id
数据类型: 字符串
- 名称: path
数据类型: 字符串
- 名称: chapter_id
数据类型: 字符串
- 名称: id
数据类型: 字符串
- 名称: snr
数据类型: float32
- 名称: c50
数据类型: float32
- 名称: speech_duration
数据类型: float64
- 名称: speaking_rate
数据类型: 字符串
- 名称: phonemes
数据类型: 字符串
- 名称: stoi
数据类型: float64
- 名称: si-sdr
数据类型: float64
- 名称: pesq
数据类型: float64
- 名称: gender
数据类型: 字符串
- 名称: utterance_pitch_std
数据类型: float64
- 名称: utterance_pitch_mean
数据类型: float64
- 名称: pitch
数据类型: 字符串
- 名称: noise
数据类型: 字符串
- 名称: reverberation
数据类型: 字符串
- 名称: speech_monotony
数据类型: 字符串
- 名称: sdr_noise
数据类型: 字符串
- 名称: pesq_speech_quality
数据类型: 字符串
- 名称: accent
数据类型: 字符串
- 名称: text_description
数据类型: 字符串
拆分集:
- 名称: dev.clean
字节数: 5524563
样本数: 5736
- 名称: test.clean
字节数: 4860013
样本数: 4837
- 名称: train.clean.100
字节数: 32301788
样本数: 33232
- 名称: train.clean.360
字节数: 114287409
样本数: 116426
下载大小: 55023682
数据集大小: 156973773
- 配置名称: other
特征:
- 名称: text
数据类型: 字符串
- 名称: text_original
数据类型: 字符串
- 名称: speaker_id
数据类型: 字符串
- 名称: path
数据类型: 字符串
- 名称: chapter_id
数据类型: 字符串
- 名称: id
数据类型: 字符串
- 名称: snr
数据类型: float32
- 名称: c50
数据类型: float32
- 名称: speech_duration
数据类型: float64
- 名称: speaking_rate
数据类型: 字符串
- 名称: phonemes
数据类型: 字符串
- 名称: stoi
数据类型: float64
- 名称: si-sdr
数据类型: float64
- 名称: pesq
数据类型: float64
- 名称: gender
数据类型: 字符串
- 名称: utterance_pitch_std
数据类型: float64
- 名称: utterance_pitch_mean
数据类型: float64
- 名称: pitch
数据类型: 字符串
- 名称: noise
数据类型: 字符串
- 名称: reverberation
数据类型: 字符串
- 名称: speech_monotony
数据类型: 字符串
- 名称: sdr_noise
数据类型: 字符串
- 名称: pesq_speech_quality
数据类型: 字符串
- 名称: accent
数据类型: 字符串
- 名称: text_description
数据类型: 字符串
拆分集:
- 名称: dev.other
字节数: 4311855
样本数: 4613
- 名称: test.other
字节数: 4706703
样本数: 5120
- 名称: train.other.500
字节数: 195931689
样本数: 205035
下载大小: 70342709
数据集大小: 204950247
配置:
- 配置名称: clean
数据文件:
- 拆分: dev.clean
路径: clean/dev.clean-*
- 拆分: test.clean
路径: clean/test.clean-*
- 拆分: train.clean.100
路径: clean/train.clean.100-*
- 拆分: train.clean.360
路径: clean/train.clean.360-*
- 配置名称: other
数据文件:
- 拆分: dev.other
路径: other/dev.other-*
- 拆分: test.other
路径: other/test.other-*
- 拆分: train.other.500
路径: other/train.other.500-*
提供机构:
ylacombe
原始信息汇总
数据集概述
配置信息
配置名称:clean
-
特征列表
- text: string
- text_original: string
- speaker_id: string
- path: string
- chapter_id: string
- id: string
- snr: float32
- c50: float32
- speech_duration: float64
- speaking_rate: string
- phonemes: string
- stoi: float64
- si-sdr: float64
- pesq: float64
- gender: string
- utterance_pitch_std: float64
- utterance_pitch_mean: float64
- pitch: string
- noise: string
- reverberation: string
- speech_monotony: string
- sdr_noise: string
- pesq_speech_quality: string
- accent: string
- text_description: string
-
数据分割
- dev.clean: 5736个样本, 5524563字节
- test.clean: 4837个样本, 4860013字节
- train.clean.100: 33232个样本, 32301788字节
- train.clean.360: 116426个样本, 114287409字节
-
下载大小: 55023682字节
-
数据集大小: 156973773字节
配置名称:other
-
特征列表
- text: string
- text_original: string
- speaker_id: string
- path: string
- chapter_id: string
- id: string
- snr: float32
- c50: float32
- speech_duration: float64
- speaking_rate: string
- phonemes: string
- stoi: float64
- si-sdr: float64
- pesq: float64
- gender: string
- utterance_pitch_std: float64
- utterance_pitch_mean: float64
- pitch: string
- noise: string
- reverberation: string
- speech_monotony: string
- sdr_noise: string
- pesq_speech_quality: string
- accent: string
- text_description: string
-
数据分割
- dev.other: 4613个样本, 4311855字节
- test.other: 5120个样本, 4706703字节
- train.other.500: 205035个样本, 195931689字节
-
下载大小: 70342709字节
-
数据集大小: 204950247字节
数据文件配置
配置名称:clean
- 数据文件路径
- dev.clean: clean/dev.clean-*
- test.clean: clean/test.clean-*
- train.clean.100: clean/train.clean.100-*
- train.clean.360: clean/train.clean.360-*
配置名称:other
- 数据文件路径
- dev.other: other/dev.other-*
- test.other: other/test.other-*
- train.other.500: other/train.other.500-*



