five

FSC22 Dataset

收藏
DataCite Commons2022-09-23 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/fsc22-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
DescriptionForest environmental sound classification is one use case of ESC which has been widely experimenting to identify illegal activities inside a forest. With the unavailability of public datasets specific to forest sounds, there is a requirement for a benchmark forest environment sound dataset. With this motivation, the FSC22 was created as a public benchmark dataset, using the audio samples collected from FreeSound org. This dataset includes 2025 labeled sound clips of 5s long. All the audio samples are distributed between six major parent-level classes; Mechanical sounds, Animal sounds, Environmental Sounds, Vehicle Sounds, Forest Threat Sounds, and Human Sounds. Further, each class is divided into subclasses that capture specific sounds which fall under the main category. Overall the dataset taxonomy consists of 34 classes as shown below. For the first phase of the dataset creation, 75 audio samples for every 27 classes were collected.  We expect that this dataset will help research communities with their research work governing Forest Acoustic monitoring and classification domain. SourcesThis dataset contains 2025 audio clips originating from the online audio database FreeSound Org(https://freesound.org/). FreeSound is a free platform that consists of thousands of audio recordings. Collection methodology After finalizing the taxonomy of the dataset, data was collected from the FreeSound platform for each class.  For each of the selected class labels, we queried for audio samples which contain the considered label in the title or the description, using the API endpoint for text search. This was done using an automated python script. Then all the gathered data was filtered and validated through a manual process. First, all the queried results were checked, and removed the irrelevant records. Then filtered audio samples were sent through further processing to select the most accurate audios by downloading each audio clip and listening to them. Following, 75 audio clips with a duration, nearly equal to 5 seconds were selected per class. In the end, the filtered and validated data were normalized and 75 audio clips with a fixed length of 5s were finalized per class.
提供机构:
IEEE DataPort
创建时间:
2022-09-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作