FSC22 Dataset

Name: FSC22 Dataset
Creator: IEEE DataPort
Published: 2022-09-23 03:42:33
License: 暂无描述

DataCite Commons2022-09-23 更新2025-04-16 收录

下载链接：

https://ieee-dataport.org/documents/fsc22-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

DescriptionForest environmental sound classification is one use case of ESC which has been widely experimenting to identify illegal activities inside a forest. With the unavailability of public datasets specific to forest sounds, there is a requirement for a benchmark forest environment sound dataset. With this motivation, the FSC22 was created as a public benchmark dataset, using the audio samples collected from FreeSound org. This dataset includes 2025 labeled sound clips of 5s long. All the audio samples are distributed between six major parent-level classes; Mechanical sounds, Animal sounds, Environmental Sounds, Vehicle Sounds, Forest Threat Sounds, and Human Sounds. Further, each class is divided into subclasses that capture specific sounds which fall under the main category. Overall the dataset taxonomy consists of 34 classes as shown below. For the first phase of the dataset creation, 75 audio samples for every 27 classes were collected.  We expect that this dataset will help research communities with their research work governing Forest Acoustic monitoring and classification domain. SourcesThis dataset contains 2025 audio clips originating from the online audio database FreeSound Org(https://freesound.org/). FreeSound is a free platform that consists of thousands of audio recordings. Collection methodology After finalizing the taxonomy of the dataset, data was collected from the FreeSound platform for each class.  For each of the selected class labels, we queried for audio samples which contain the considered label in the title or the description, using the API endpoint for text search. This was done using an automated python script. Then all the gathered data was filtered and validated through a manual process. First, all the queried results were checked, and removed the irrelevant records. Then filtered audio samples were sent through further processing to select the most accurate audios by downloading each audio clip and listening to them. Following, 75 audio clips with a duration, nearly equal to 5 seconds were selected per class. In the end, the filtered and validated data were normalized and 75 audio clips with a fixed length of 5s were finalized per class.

提供机构：

IEEE DataPort

创建时间：

2022-09-23

5,000+

优质数据集

54 个

任务类型

进入经典数据集