DCASE 2024 Task 9: Language-Queried Audio Source Separation | Pre-trained Weights for the Baseline System

NIAID Data Ecosystem2026-05-01 收录

下载链接：

https://zenodo.org/record/10887459

下载链接

链接失效反馈

官方服务：

资源简介：

== Descriptions == We trained the AudioSep [1] model using the development set (Clotho and augmented FSD50K datasets) for 200k steps with a batch size of 16 using one Nvidia A100 GPU (around 1 day). Model details can be found in the AudioSep paper. Pre-trained weights for the baseline system: audiosep_16k,baseline,step=200000.ckpt Baseline codebase: GitHub: https://github.com/Audio-AGI/dcase2024_task9_baseline == Reference == [1] Liu X, Kong Q, Zhao Y, et al. Separate anything you describe. arXiv:2308.05037, 2023. == Contact == Xubo Liu, xubo.liu@surrey.ac.uk

创建时间：

2024-03-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集