hungphongtrn/vietmed_asr_v3

Name: hungphongtrn/vietmed_asr_v3
Creator: hungphongtrn
Published: 2024-05-31 14:04:33
License: 暂无描述

Hugging Face2024-05-31 更新2024-07-06 收录

下载链接：

https://hf-mirror.com/datasets/hungphongtrn/vietmed_asr_v3

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: hyp sequence: string - name: id dtype: string - name: gt sequence: string - name: onehot_gt sequence: string splits: - name: w2v2Viet_Paramshare_LossRem_WERtest_29_0 num_bytes: 1962748 num_examples: 3435 - name: XLSR53Viet_Paramshare_LossRem_WERtest28_8 num_bytes: 1965552 num_examples: 3435 - name: XLSR53Viet num_bytes: 1963894 num_examples: 3435 download_size: 991089 dataset_size: 5892194 configs: - config_name: default data_files: - split: w2v2Viet_Paramshare_LossRem_WERtest_29_0 path: data/w2v2Viet_Paramshare_LossRem_WERtest_29_0-* - split: XLSR53Viet_Paramshare_LossRem_WERtest28_8 path: data/XLSR53Viet_Paramshare_LossRem_WERtest28_8-* - split: XLSR53Viet path: data/XLSR53Viet-* ---

The dataset is primarily used for speech recognition tasks, featuring four main characteristics: hyp (hypothetical string sequence), id (identifier string), gt (ground truth string sequence), and onehot_gt (one-hot encoded ground truth string sequence). The dataset is divided into three parts: w2v2Viet_Paramshare_LossRem_WERtest_29_0, XLSR53Viet_Paramshare_LossRem_WERtest28_8, and XLSR53Viet, each with specific byte counts and example quantities. The total download size of the dataset is 991089 bytes, and the total dataset size is 5892194 bytes. The configuration file defines the default configuration and specifies the path for each data file.

提供机构：

hungphongtrn

5,000+

优质数据集

54 个

任务类型

进入经典数据集