lalok/gyeongsan_address_ko_8k
收藏Hugging Face2024-07-16 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/lalok/gyeongsan_address_ko_8k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频和对应的转录文本,音频的采样率为16000Hz。数据集分为训练集、测试集和验证集,分别包含710600、88826和88825个样本。总下载大小为113758223552字节,数据集总大小为118977496880.349字节。
The dataset contains audio and corresponding transcripts, with the audio sampled at 16000Hz. The dataset is divided into training, test, and validation sets, containing 710600, 88826, and 88825 samples respectively. The total download size is 113758223552 bytes, and the total dataset size is 118977496880.349 bytes.
提供机构:
lalok
原始信息汇总
数据集概述
特征
- audio:
- 采样率: 16000
- transcripts:
- 数据类型: string
数据分割
- train:
- 字节数: 95269502900.9691
- 样本数: 710600
- test:
- 字节数: 11702946536.133759
- 样本数: 88826
- valid:
- 字节数: 12005047443.246136
- 样本数: 88825
数据大小
- 下载大小: 113758223552
- 数据集大小: 118977496880.349
配置
- config_name: default
- data_files:
- train: data/train-*
- test: data/test-*
- valid: data/valid-*
- data_files:



