five

AeroSonicDB 3K: Audio Dataset of Low-Flying Aircraft

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12775559
下载链接
链接失效反馈
官方服务:
资源简介:
AEROSONICDB 3K: AUDIO DATASET OF LOW-FLYING AIRCRAFT Version 2.0-beta (July 2024)   This repository is currently restricted, however you can gain access by "requesting access" on this page with your Zenodo account, or by emailing Blake Downward at aerosonicdb@gmail.com, or by submitting the following Google Form.   Created by Blake Downward Publication When using this data in an academic work, please reference the dataset DOI and version. Please also reference the following paper which describes the methodology for collecting the dataset and presents baseline model results. Downward, B., & Nordby, J. (2023). The AeroSonicDB (YPAD-0523) Dataset for Acoustic Detection and Classification of Aircraft. ArXiv, abs/2311.06368. Description AeroSonicDB 3K is a machine labelled audio dataset of over 3,000 low-flying aircraft events. Samples in this dataset were recorded at locations in close proximity to a flight path approaching or departing Adelaide International Airport’s (ICAO code: YPAD) primary runway, 05/23. Recordings are triggered and labelled based on radio (ADS-B) messages received from the aircraft overhead, then human verified and trimmed to the first and final moments the target aircraft is audible in the sample. Major changes from version 1 This iteration of AeroSonicDB was developed with the aim to improve accessibility and usability for the end-user, streamline the sample collection and annotation procedures, and minimise memory consumption without losing crucial information. A summary of the major changes include: Sample rate for audio files is now 16,000 Hz (previously 22,050 Hz). Samples have been trimmed to the audio event onset and offset moments - the entire duration of the clip can be assumed to be of the same class. “altitude” has been ommited as a sample feature. “engfamily” and “shortdesc” have benn ommitted as aircraft features. Samples are split across 5 temporaly disjointed folds. (previously had a sixth fold to act as a hold-out test set) Introduction of a PDM microphone embedded in a microcontroller for capturing audio (approximately 20% of all samples) A machine learning model trained on the original dataset was deployed to assist with annotation. Approximately 60% of newly acquired silence/background samples were annotated by progressive iterations of this model. 582 unique aircraft recorded (up 280 from 302 in version 1) 3,058 recordings of aircraft audio events for a total of 38 hours (up from 625 recordings and 8.9 hours) 32 hours of silence/background audio (up from 3.5 hours) Audio data SAMPLE RATE: 16,000 HzDATA TYPE: ‘int16’/‘wav’ Class distribution (hours of audio) Binary Classes no aircraft/background (32) aircraft (38) Sub-classes no aircraft/background (32) Piston aircraft (1.2) Turboprop aircraft (5.3) Turbofan aircraft (31) Rotorcraft/helicopter (0.5)   Data splits/folds Recordings have been split into 5 folds, giving researchers a common split for cross-validation and ensuring comparable results. Folds are temporaly disjoint to avoid data leakage. Metadata files The entire dataset is referenced with labels in the “sample_meta.csv” file. Each row contains a reference to a unique recording, as well as meta information such as; class, duration and fold. The “aircraft_meta.csv” a file can be used to reference aircraft specific features - such as; manufacturer, engine type, ICAO type designator etc. (see “Columns/Labels” below for all features).   Audio-file naming convention Audio samples are in WAV format, with some additional metadata stored in the filename. Basic Convention “Aircraft ID + Date + Time + Location ID + Microphone ID” “XXXXXX_YYYY-MM-DD_hh-mm-ss_X_X” Sample with aircraft {hex_id} _ {date} _ {time} _ {location_id} _ {microphone_id} . {file_ext} 7C7CD0_2023-05-09_12-42-55_2_1.wav Sample without aircraft “Silence” files are denoted with six (6) leading zeros rather than an aircraft hex code. All relevant metadata for “silence” samples are contained in the audio filename, and again in the accompanying “sample_meta.csv” 000000 _ {date} _ {time} _ {location_id} _ {microphone_id} . {file_ext} 000000_2023-05-09_12-30_2_1.wav Columns/Labels (sample_meta.csv) filename: The filename of the audio recording duration: Length of the audio event in seconds. date: Date of the recording fold: Digit from 0 to 4 splitting the dataset 5 ways class: Class label for the recording (eg. 0 = No aircraft/background, 1 = Aircraft audible) hex_id: Unique ICAO 24-bit address for the aircraft recorded   Columns/Labels (aircraft_meta.csv) hex_id: Unique ICAO 24-bit address for the aircraft recorded Manu: Aircraft manufacturer (eg. Boeing, Pilatus, Airbus) Model: Aircraft model (eg. 737-800, A320-232, DHC-8-315) engnum: Number of engines Engmanu: Engine manufacturer (eg. Pratt & Whitney, CFM International, Rolls Royce) Engtype: Type of engine (eg. Piston, Turboprop, Turbofan, Turboshaft) Engmodel: Engine model (eg. TRENT XWB, CFM56-7B24E, PT6E-67XP) Fueltype: Fuel type used in the engine (eg. Gasoline, Kerosene) Airframe: Describes the mechanical structure of the aircraft (eg. Power Driven Aeroplane, Rotorcraft) Propmanu: Propeller manufacturer (eg. Hartzell Propellers, Hamilton Standard, “Aircraft Not Fitted With Propeller”) Propmodel: Propeller model (eg. HC-E5A-3A/NC10245B, 14SF-15, “Not Applicable”) MTOW: Maximum take off weight (MTOW) in kilograms ICAOtypedesig: ICAO type designator for make and model of aircraft (eg. PC12, C185, B738)   Conditions of use Dataset created by Blake Downward. The AeroSonicDB 3K dataset is offered free of charge for non-commercial use under the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license. https://creativecommons.org/licenses/by-nc/4.0/ Acknowledgements Special thanks to Jon Nordby of Soundsensing AS - his contributions were pivotal in maximising the potential of this dataset for open-source release. Feedback Please send suggestions, feedback and comments to: Blake Downward: aerosonicdb@gmail.com Change log 2.0-beta: All major changes as described above 1.1.2: Added “environment_mappings_raw.csv”. No change to audio from Version 1.1 1.1.1: Minor change to “sample_meta.csv” - replaced “6” with “test” in the “fold” column 1.1: Replaced truncated aircraft samples with the original full-length files and annotated the beginning and end of each audio event. Added ‘ignore’ statements to aircraft event boundaries in the environmental class mappings file. 1.0: Environmental audio and mappings added 0.3: locations.json file added, README updated 0.2: location information added to README
创建时间:
2024-08-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作