ittailup/la-speech-and-text-generated-country
收藏Hugging Face2024-05-26 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/ittailup/la-speech-and-text-generated-country
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: speaker_id
dtype: string
- name: gender
dtype: string
- name: duration
dtype: float32
- name: text
dtype: string
- name: country
dtype: string
- name: utterance_pitch_mean
dtype: float32
- name: utterance_pitch_std
dtype: float32
- name: snr
dtype: float64
- name: c50
dtype: float64
- name: speaking_rate
dtype: string
- name: phonemes
dtype: string
- name: pitch
dtype: string
- name: noise
dtype: string
- name: reverberation
dtype: string
- name: speech_monotony
dtype: string
- name: text_description
dtype: string
- name: index
dtype: int64
splits:
- name: test
num_bytes: 170736
num_examples: 300
- name: train
num_bytes: 41244520
num_examples: 72146
download_size: 12337600
dataset_size: 41415256
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
- split: train
path: data/train-*
---
The dataset includes various speech-related features such as speaker ID, gender, duration of speech, text content, country of the speaker, mean and standard deviation of pitch, signal-to-noise ratio, C50, speaking rate, phonemes, pitch, noise, reverberation, speech monotony, text description, and sample index. The dataset is divided into a test set with 300 samples and a training set with 72146 samples. The download size of the dataset is 12337600 bytes, and the total size is 41415256 bytes.
提供机构:
ittailup
原始信息汇总
数据集概述
数据集特征
- speaker_id:字符串类型
- gender:字符串类型
- duration:浮点型(float32)
- text:字符串类型
- country:字符串类型
- utterance_pitch_mean:浮点型(float32)
- utterance_pitch_std:浮点型(float32)
- snr:浮点型(float64)
- c50:浮点型(float64)
- speaking_rate:字符串类型
- phonemes:字符串类型
- pitch:字符串类型
- noise:字符串类型
- reverberation:字符串类型
- speech_monotony:字符串类型
- text_description:字符串类型
- index:整型(int64)
数据集划分
- test:
- 数据量:170736字节
- 示例数:300
- train:
- 数据量:41244520字节
- 示例数:72146
数据集大小
- 下载大小:12337600字节
- 数据集总大小:41415256字节
配置信息
- config_name:default
- data_files:
- test:路径为
data/test-* - train:路径为
data/train-*
- test:路径为



