westbrook/gigaspeech-tiny-0-train

Name: westbrook/gigaspeech-tiny-0-train
Creator: westbrook
Published: 2024-07-21 05:33:52
License: 暂无描述

Hugging Face2024-07-21 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/westbrook/gigaspeech-tiny-0-train

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个特征，如segment_id、speaker、text、audio等，涵盖了音频、文本、时间戳、来源、类别等信息。数据集的特征描述非常详细，包括音频的采样率、时间戳、来源类型、类别标签等。此外，还包含了一些音频质量相关的特征，如信噪比、语音清晰度等。数据集的划分仅包含训练集，大小为146510字节，包含2个样本。

The dataset includes multiple features, primarily for speech and text analysis. Features include segment ID, speaker, text content, audio (sampling rate 16000), begin and end times, audio ID, title, URL, audio source (e.g., audiobook, podcast, YouTube), category (e.g., People and Blogs, Business, Nonprofits and Activism, etc.), original full path, utterance pitch mean, utterance pitch std, SNR, C50, speaking rate, phonemes, STOI, SI-SDR, PESQ, age, accent, brightness, emotion, gender, smoothness, pitch, noise, reverberation, speech monotony, and multiple text descriptions. The dataset is split into a training set with 2 examples.

提供机构：

westbrook

原始信息汇总

数据集概述

数据集特征

segment_id: 字符串类型
speaker: 字符串类型
text: 字符串类型
audio: 音频类型，采样率为16000
begin_time: 浮点数类型
end_time: 浮点数类型
audio_id: 字符串类型
title: 字符串类型
url: 字符串类型
source: 分类标签类型，包含以下类别：
- 0: audiobook
- 1: podcast
- 2: youtube
category: 分类标签类型，包含以下类别：
- 0: People and Blogs
- 1: Business
- 2: Nonprofits and Activism
- 3: Crime
- 4: History
- 5: Pets and Animals
- 6: News and Politics
- 7: Travel and Events
- 8: Kids and Family
- 9: Leisure
- 10: N/A
- 11: Comedy
- 12: News and Politics
- 13: Sports
- 14: Arts
- 15: Science and Technology
- 16: Autos and Vehicles
- 17: Science and Technology
- 18: People and Blogs
- 19: Music
- 20: Society and Culture
- 21: Education
- 22: Howto and Style
- 23: Film and Animation
- 24: Gaming
- 25: Entertainment
- 26: Travel and Events
- 27: Health and Fitness
- 28: audiobook
original_full_path: 字符串类型
utterance_pitch_mean: 浮点数类型
utterance_pitch_std: 浮点数类型
snr: 浮点数类型
c50: 浮点数类型
speaking_rate: 字符串类型
phonemes: 字符串类型
stoi: 浮点数类型
si-sdr: 浮点数类型
pesq: 浮点数类型
age_ori: 字符串类型
age_value: 浮点数类型
age: 字符串类型
accent_ori: 字符串类型
accent_value: 浮点数类型
accent: 字符串类型
brightness_ori: 字符串类型
brightness_value: 浮点数类型
brightness: 字符串类型
emotion_ori: 字符串类型
emotion_value: 浮点数类型
emotion: 字符串类型
gender_ori: 字符串类型
gender_value: 浮点数类型
gender: 字符串类型
smoothness_ori: 字符串类型
smoothness_value: 浮点数类型
smoothness: 字符串类型
pitch: 字符串类型
noise: 字符串类型
reverberation: 字符串类型
speech_monotony: 字符串类型
text_description1: 字符串类型
text_description2: 字符串类型
text_description3: 字符串类型
text_description4: 字符串类型
text_description5: 字符串类型

数据集划分

train: 包含2个样本，占用146510.0字节

数据集大小

下载大小: 176947字节
数据集大小: 146510.0字节

配置

config_name: default
- data_files:
  - split: train
  - path: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集