EarthSpeciesProject/NatureLM-audio-training
收藏Hugging Face2025-06-03 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/EarthSpeciesProject/NatureLM-audio-training
下载链接
链接失效反馈官方服务:
资源简介:
NatureLM-audio-training是一个大型的、多样化的音频-语言数据集,用于训练能够对自然语言查询提供自然语言回答的生物声学模型。数据集包含超过2600万对音频-文本样本,来源于动物叫声、昆虫、人类语音、音乐和环境声音等多种不同的来源。该数据集支持多种生物声学相关的任务,如分类、检测、字幕生成等。
NatureLM-audio-training is a large and diverse audio-language dataset designed for training bioacoustic models that can generate a natural language answer to a natural language query on a reference bioacoustic audio recording. The dataset consists of over 26 million audio-text pairs derived from diverse sources including animal vocalizations, insects, human speech, music, and environmental sounds, supporting tasks such as classification, detection, and captioning.
提供机构:
EarthSpeciesProject



