westbrook/gigaspeech-tiny-0-train
收藏Hugging Face2024-07-21 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/westbrook/gigaspeech-tiny-0-train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如segment_id、speaker、text、audio等,涵盖了音频、文本、时间戳、来源、类别等信息。数据集的特征描述非常详细,包括音频的采样率、时间戳、来源类型、类别标签等。此外,还包含了一些音频质量相关的特征,如信噪比、语音清晰度等。数据集的划分仅包含训练集,大小为146510字节,包含2个样本。
The dataset includes multiple features, primarily for speech and text analysis. Features include segment ID, speaker, text content, audio (sampling rate 16000), begin and end times, audio ID, title, URL, audio source (e.g., audiobook, podcast, YouTube), category (e.g., People and Blogs, Business, Nonprofits and Activism, etc.), original full path, utterance pitch mean, utterance pitch std, SNR, C50, speaking rate, phonemes, STOI, SI-SDR, PESQ, age, accent, brightness, emotion, gender, smoothness, pitch, noise, reverberation, speech monotony, and multiple text descriptions. The dataset is split into a training set with 2 examples.
提供机构:
westbrook
原始信息汇总
数据集概述
数据集特征
- segment_id: 字符串类型
- speaker: 字符串类型
- text: 字符串类型
- audio: 音频类型,采样率为16000
- begin_time: 浮点数类型
- end_time: 浮点数类型
- audio_id: 字符串类型
- title: 字符串类型
- url: 字符串类型
- source: 分类标签类型,包含以下类别:
- 0: audiobook
- 1: podcast
- 2: youtube
- category: 分类标签类型,包含以下类别:
- 0: People and Blogs
- 1: Business
- 2: Nonprofits and Activism
- 3: Crime
- 4: History
- 5: Pets and Animals
- 6: News and Politics
- 7: Travel and Events
- 8: Kids and Family
- 9: Leisure
- 10: N/A
- 11: Comedy
- 12: News and Politics
- 13: Sports
- 14: Arts
- 15: Science and Technology
- 16: Autos and Vehicles
- 17: Science and Technology
- 18: People and Blogs
- 19: Music
- 20: Society and Culture
- 21: Education
- 22: Howto and Style
- 23: Film and Animation
- 24: Gaming
- 25: Entertainment
- 26: Travel and Events
- 27: Health and Fitness
- 28: audiobook
- original_full_path: 字符串类型
- utterance_pitch_mean: 浮点数类型
- utterance_pitch_std: 浮点数类型
- snr: 浮点数类型
- c50: 浮点数类型
- speaking_rate: 字符串类型
- phonemes: 字符串类型
- stoi: 浮点数类型
- si-sdr: 浮点数类型
- pesq: 浮点数类型
- age_ori: 字符串类型
- age_value: 浮点数类型
- age: 字符串类型
- accent_ori: 字符串类型
- accent_value: 浮点数类型
- accent: 字符串类型
- brightness_ori: 字符串类型
- brightness_value: 浮点数类型
- brightness: 字符串类型
- emotion_ori: 字符串类型
- emotion_value: 浮点数类型
- emotion: 字符串类型
- gender_ori: 字符串类型
- gender_value: 浮点数类型
- gender: 字符串类型
- smoothness_ori: 字符串类型
- smoothness_value: 浮点数类型
- smoothness: 字符串类型
- pitch: 字符串类型
- noise: 字符串类型
- reverberation: 字符串类型
- speech_monotony: 字符串类型
- text_description1: 字符串类型
- text_description2: 字符串类型
- text_description3: 字符串类型
- text_description4: 字符串类型
- text_description5: 字符串类型
数据集划分
- train: 包含2个样本,占用146510.0字节
数据集大小
- 下载大小: 176947字节
- 数据集大小: 146510.0字节
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*
- data_files:



