ekacare/vistaar_small_asr_eval
收藏Hugging Face2025-07-25 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/ekacare/vistaar_small_asr_eval
下载链接
链接失效反馈官方服务:
资源简介:
Vistaar Small ASR Eval数据集是一个包含9,486个音频样本的印地语自动语音识别评估数据集,涵盖12种印度语言。这个数据集是AI4Bharat发布的更大Vistaar数据集的一个子集,专门用于评估ASR模型在多样化的印度语言语音数据上的性能。它包含6个不同的子集,每个子集都有其独特的特征,用于构建整体评估框架。数据集支持12种印度语言,总时长约为18.6小时,适用于ASR模型评估和基准测试。
The Vistaar Small ASR Eval dataset is a multilingual automatic speech recognition evaluation dataset containing 9,486 audio samples across 12 Indian languages. This dataset is a subset of the larger Vistaar dataset published by AI4Bharat, designed specifically for evaluating ASR model performance on diverse Indian language speech data. It includes six distinct subsets, each contributing unique characteristics to the overall evaluation framework. The dataset covers 12 Indian languages, providing comprehensive coverage for multilingual ASR evaluation across the Indian subcontinent. The total duration is approximately 18.6 hours, suitable for ASR model evaluation and benchmarking.
提供机构:
ekacare



