ctaguchi/ikema_youtube_asr_full_with_long
收藏Hugging Face2026-03-27 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/ctaguchi/ikema_youtube_asr_full_with_long
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
- name: transcription
dtype: string
- name: romaji
dtype: string
- name: phoneme
dtype: string
- name: start
dtype: int64
- name: end
dtype: int64
- name: title
dtype: string
- name: recording_id
dtype: string
- name: url
dtype: string
splits:
- name: train
num_bytes: 1867735804.015
num_examples: 9005
- name: dev
num_bytes: 396260497.986
num_examples: 1554
- name: test
num_bytes: 360261296.668
num_examples: 1532
- name: longtrain
num_bytes: 2142749559.929
num_examples: 3389
- name: longdev
num_bytes: 488404304.0
num_examples: 387
- name: longtest
num_bytes: 477371843.0
num_examples: 358
download_size: 6107559856
dataset_size: 5732783305.598
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: dev
path: data/dev-*
- split: test
path: data/test-*
- split: longtrain
path: data/longtrain-*
- split: longdev
path: data/longdev-*
- split: longtest
path: data/longtest-*
---
提供机构:
ctaguchi



