nvidia/AudioSkills
收藏Hugging Face2025-08-05 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/AudioSkills
下载链接
链接失效反馈官方服务:
资源简介:
AudioSkills-XL是一个大规模的音频问答(AQA)数据集,旨在开发(大型)音频语言模型,使其能够在短时间内对音频片段进行专家级的推理和解决问题。它扩展了原始的AudioSkills集合,新增了约450万个新的问答对,总计约1000万个多样化示例。该数据集按每个音频的来源数据集划分成子集。数据集的授权和使用条款由NVIDIA OneWay非商业许可管理。
AudioSkills-XL is a large-scale audio question-answering (AQA) dataset designed to develop (large) audio-language models capable of expert-level reasoning and problem-solving over short audio clips (≤30 seconds). It expands upon the original AudioSkills collection by adding approximately 4.5 million new QA pairs, resulting in a total of ~10 million diverse examples. The dataset is partitioned into subsets based on each audios source dataset. The use of AudioSkills-XL is governed by the NVIDIA OneWay Noncommercial License.
提供机构:
nvidia



