AdityK2409/vistaar_small_asr_eval

Name: AdityK2409/vistaar_small_asr_eval
Creator: AdityK2409
Published: 2025-10-23 11:31:57
License: 暂无描述

Hugging Face2025-10-23 更新2025-10-25 收录

下载链接：

https://hf-mirror.com/datasets/AdityK2409/vistaar_small_asr_eval

下载链接

链接失效反馈

官方服务：

资源简介：

Vistaar Small ASR Eval数据集是一个包含9486个音频样本的多语言自动语音识别评估数据集，涵盖12种印度语言。该数据集是从AI4Bharat发布的更大Vistaar数据集中特别为评估ASR模型性能在多样化的印度语言语音数据上而设计的子集。目前，Vistaar只能通过GitHub访问，我们通过Huggingface重新分发这个数据集的子集，以便更容易使用，并遵循相同的MIT许可证。

The Vistaar Small ASR Eval dataset is a multilingual automatic speech recognition evaluation dataset containing 9,486 audio samples across 12 Indian languages. This dataset is a subset of the larger Vistaar dataset published by AI4Bharat, designed specifically for evaluating ASR model performance on diverse Indian language speech data. A smaller evaluation dataset was created for the use-cases where a quick benchmarking of models is needed. Currently, Vistaar can be accessed through github only, we are redistributing this subset of the dataset through Huggingface for easier usage under the same MIT Liscence.

提供机构：

AdityK2409

5,000+

优质数据集

54 个

任务类型

进入经典数据集