ylacombe/libritts-r-descriptions-10k-v3

Name: ylacombe/libritts-r-descriptions-10k-v3
Creator: ylacombe
Published: 2024-05-09 23:19:00
License: 暂无描述

Hugging Face2024-05-09 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/ylacombe/libritts-r-descriptions-10k-v3

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含两个配置：clean和other，分别用于不同的语音数据处理需求。每个配置下包含多种特征，如文本内容、说话者ID、路径、章节ID等，以及语音相关的统计信息如说话速率、音素、信噪比等。数据集被分割为开发集、测试集和训练集，每个分割都有详细的文件大小和示例数量记录。此数据集适用于语音识别、语音分析等领域的研究和开发。

提供机构：

ylacombe

原始信息汇总

数据集概述

配置名称：clean

特征信息：
- text: 字符串类型
- text_original: 字符串类型
- speaker_id: 字符串类型
- path: 字符串类型
- chapter_id: 字符串类型
- id: 字符串类型
- speaking_rate: 字符串类型
- phonemes: 字符串类型
- snr: 浮点数类型（float32）
- c50: 浮点数类型（float32）
- utterance_pitch_mean: 浮点数类型（float32）
- utterance_pitch_std: 浮点数类型（float32）
- gender: 字符串类型
- pitch: 字符串类型
- noise: 字符串类型
- reverberation: 字符串类型
- speech_monotony: 字符串类型
- text_description: 字符串类型
数据集分割：
- dev.clean: 5736个样本，4991816字节
- test.clean: 4837个样本，4374386字节
- train.clean.100: 33232个样本，29119087字节
- train.clean.360: 116426个样本，103093413字节
下载大小： 50291079字节
数据集大小： 141578702字节

配置名称：other

特征信息：
- text: 字符串类型
- text_original: 字符串类型
- speaker_id: 字符串类型
- path: 字符串类型
- chapter_id: 字符串类型
- id: 字符串类型
- utterance_pitch_mean: 浮点数类型（float32）
- utterance_pitch_std: 浮点数类型（float32）
- snr: 浮点数类型（float64）
- c50: 浮点数类型（float64）
- speaking_rate: 字符串类型
- phonemes: 字符串类型
- gender: 字符串类型
- pitch: 字符串类型
- noise: 字符串类型
- reverberation: 字符串类型
- speech_monotony: 字符串类型
- text_description: 字符串类型
数据集分割：
- dev.other: 4613个样本，3879596字节
- test.other: 5120个样本，4224226字节
- train.other.500: 205035个样本，177273395字节
下载大小： 64263284字节
数据集大小： 185377217字节

5,000+

优质数据集

54 个

任务类型

进入经典数据集