UCSC-VLAA/PARADE_audio

Name: UCSC-VLAA/PARADE_audio
Creator: UCSC-VLAA
Published: 2025-09-07 00:18:00
License: 暂无描述

Hugging Face2025-09-07 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/UCSC-VLAA/PARADE_audio

下载链接

链接失效反馈

官方服务：

资源简介：

AHELM（音频语言模型全面评估）数据集是一个用于评估音频语言模型在音频感知、知识、推理、情感检测、偏见、公平性、多语言性、鲁棒性、毒性和安全性等10个关键方面的性能的基准。该数据集包含了多个子数据集，其中包括PARADE和CoRe-Bench，分别用于评估模型避免刻板印象和对话音频推理的能力。

AHELM (A Holistic Evaluation of Audio-Language Models) dataset is a benchmark designed to assess the performance of audio-language models across 10 key aspects: audio perception, knowledge, reasoning, emotion detection, bias, fairness, multilinguality, robustness, toxicity, and safety. The dataset consists of multiple sub-datasets, including PARADE and CoRe-Bench, which evaluate the models ability to avoid stereotypes and reasoning over conversational audio through inferential multi-turn question answering, respectively.

提供机构：

UCSC-VLAA

5,000+

优质数据集

54 个

任务类型

进入经典数据集