DCASE 2024 Task 9: Language-Queried Audio Source Separation | Pre-trained Weights for the Baseline System
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10887459
下载链接
链接失效反馈官方服务:
资源简介:
== Descriptions ==
We trained the AudioSep [1] model using the development set (Clotho and augmented FSD50K datasets) for 200k steps with a batch size of 16 using one Nvidia A100 GPU (around 1 day). Model details can be found in the AudioSep paper.
Pre-trained weights for the baseline system:
audiosep_16k,baseline,step=200000.ckpt
Baseline codebase:
GitHub: https://github.com/Audio-AGI/dcase2024_task9_baseline
== Reference ==
[1] Liu X, Kong Q, Zhao Y, et al. Separate anything you describe. arXiv:2308.05037, 2023.
== Contact ==
Xubo Liu, xubo.liu@surrey.ac.uk
创建时间:
2024-03-27



