AdoCleanCode/giga_train_corr_v1
收藏Hugging Face2025-12-11 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/AdoCleanCode/giga_train_corr_v1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: speaker_id
dtype: string
- name: flac_filename
dtype: string
- name: transcription_full
dtype: string
- name: removed_words
dtype: string
- name: transcription_without_removed
dtype: string
- name: phonemes_full
dtype: string
- name: phonemes_removed
dtype: string
- name: phonemes_annotated
dtype: string
- name: xcodec2_tokens
dtype: string
- name: sequence
dtype: string
- name: match
dtype: bool
- name: removed_start_time
dtype: float64
- name: removed_end_time
dtype: float64
splits:
- name: batch_000
num_bytes: 66395102
num_examples: 20000
download_size: 14894083
dataset_size: 66395102
configs:
- config_name: default
data_files:
- split: batch_000
path: data/batch_000-*
---
提供机构:
AdoCleanCode



