Avian Language Models: Rethinking Bioacoustic Pipelines with Self-Supervised Learning. In Data Science & Engineering Master of Advanced Study (DSE MAS) Capstone Projects
收藏DataCite Commons2026-04-17 更新2026-05-06 收录
下载链接:
https://library.ucsd.edu/dc/object/bb8580508d
下载链接
链接失效反馈官方服务:
资源简介:
For our capstone project in the MAS Data Science and Engineering program, we focused on automated bird species identification from ambient soundscapes, a task traditionally constrained by the laborious process of manual audio annotation. Tracking bird populations can serve as a leading indicator to problems in the environment, such as pollution or food chain imbalances. As many bird species are small and spend most of their time hidden, counting populations through sound rather than sight is much easier and more accurate. The data used in this project consists of audio files containing bird vocalizations collected using focal and omnidirectional soundscape recording devices. The data is available publicly through the BirdSet and Cornell Birdcall Identification (CBI) datasets. We conducted controlled ablation studies to isolate the effects of LLMs within the NatureLM model, and compared to the same architecture without the LLM component, as well as our own encoder-only model based on CLAP.
提供机构:
UC San Diego Library Digital Collections
创建时间:
2025-07-03



