AudioSet Strong Ensemble Logits
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14626112
下载链接
链接失效反馈官方服务:
资源简介:
This upload contains one HDF5 file that stores ensemble predictions on AudioSet Strong audio files. It is supplementary material for the ICASSP'25 paper Effective Pre-Training of Audio Transformers for Sound Event Detection. The corresponding code can be found in this GitHub repository.
The HDF5 file contains filenames (Youtube IDs) matched with ensembled logits of multiple transformer models. The corresponding keys are "filenames" and "strong_logits". Ensemble Logits for one file are of shape 447 x 250 (number of classes x timeframes at 40 ms resolution). Ensemble Logits are stored in float16 format to save space. Check out the GitHub repository for information on how to use the ensemble logits.
创建时间:
2025-01-10



