Songdo Traffic: High Accuracy Georeferenced Vehicle Trajectories from a Large-Scale Study in a Smart City

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://zenodo.org/record/13828383

下载链接

链接失效反馈

官方服务：

资源简介：

Overview The Songdo Traffic dataset delivers precisely georeferenced vehicle trajectories captured through high-altitude bird's-eye view (BeV) drone footage over Songdo International Business District, South Korea. Comprising approximately 700,000 unique trajectories, this resource represents one of the most extensive aerial traffic datasets publicly available, distinguishing itself through exceptional temporal resolution that captures vehicle movements at 29.97 points per second, enabling unprecedented granularity for advanced urban mobility analysis. ⚠️ Important: If you use this dataset in your work, please cite the following reference [1]: Robert Fonod, Haechan Cho, Hwasoo Yeo, Nikolas Geroliminis (2025). Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery, arXiv preprint arXiv:2411.02136. (Note: This manuscript shall be replaced by the published version once available.) Dataset Composition The dataset consists of four primary components: Trajectory Data: 80 ZIP archives containing high-resolution vehicle trajectories with georeferenced positions, speeds and acceleration profiles, and other metadata. Orthophoto Cut-Outs: High-resolution (8000×8000 pixel) orthophoto images for each monitored intersection, used for georeferencing and visualization. Road and Lane Segmentations: CSV files defining lane polygons within road sections, facilitating mapping of vehicle positions to road segments and lanes. Sample Videos: A selection of 4K UHD drone video samples capturing intersection footage during the experiment. Data Collection The dataset was collected as part of a collaborative multi-drone experiment conducted by KAIST and EPFL in Songdo, South Korea, from October 4–7, 2022. A fleet of 10 drones monitored 20 busy intersections, executing advanced flight plans to optimize coverage. 4K (3840×2160) RGB video footage was recorded at 29.97 FPS from altitudes of 140–150 meters. Each drone flew 10 sessions per day, covering peak morning and afternoon periods. The experiment resulted in 12TB of 4K raw video data. More details on the experimental setup and data processing pipeline are available in [1]. Data Processing The trajectories were extracted using geo-trax, an advanced deep learning framework designed for high-altitude UAV-based traffic monitoring. This state-of-the-art pipeline integrates vehicle detection, tracking, trajectory stabilization, and georeferencing to extract high-accuracy traffic data from drone footage. Key Processing Steps: Vehicle Detection & Tracking: Vehicles were detected and tracked across frames using a deep learning-based detector and motion-model-based tracking algorithm. Trajectory Stabilization: A novel track stabilization method was applied using detected vehicle bounding boxes as exclusion masks in image registration. Georeferencing & Coordinate Transformation: Each trajectory was transformed into global (WGS84), local Cartesian, and orthophoto coordinate systems. Vehicle Metadata Estimation: In addition to time-stamped vehicle trajectories, various metadata attributes were also extracted, including vehicle dimensions and type, speed, acceleration, class, lane number, road section, and visibility status. More details on the extraction methodology are available in [1]. File Structure & Formats 1. Trajectory Data (Daily Intersection ZIPs, 16.2 MB ~ 360.2 MB) The trajectory data is organized into 80 ZIP files, each containing traffic data for a specific intersection and day of the experiment. File Naming Convention: YYYY-MM-DD_intersectionID.zip YYYY-MM-DD represents the date of data collection (2022-10-04 to 2022-10-07). intersectionID is a unique identifier for one of the 20 intersections where data was collected (A, B, C, E, …, U). The letter D is reserved to denote "Drone". Each ZIP file contains 10 CSV files, each corresponding to an individual flight session: YYYY-MM-DD_intersectionID.zip │── YYYY-MM-DD_intersectionID_AM1.csv ├── … │── YYYY-MM-DD_intersectionID_AM5.csv │── YYYY-MM-DD_intersectionID_PM1.csv ├── … └── YYYY-MM-DD_intersectionID_PM5.csv Here, AM1-AM5 and PM1-PM5 denote morning and afternoon flight sessions, respectively. For example, 2022-10-04_S_AM1.csv contains all extracted trajectories from the first morning session of the first day at the intersection 'S'. CSV File Example Structure: Each CSV file contains high-frequency trajectory data, formatted as follows (d.p. = decimal place): Dataset Column Name Format / Units Data Type Explanation Vehicle_ID 1, 2, … Integer Unique vehicle identifier within each CSV file Local_Time hh:mm:ss.sss String Local Korean time (GMT+9) in ISO 8601 format Drone_ID 1, 2, …, 10 Integer Unique identifier for the drone capturing the data Ortho_X, Ortho_Y px (1 d.p.) Float Vehicle center coordinates in the orthophoto cut-out image Local_X, Local_Y m (2 d.p.) Float KGD2002 / Central Belt 2010 planar coordinates (EPSG:5186) Latitude, Longitude ° DD (7 d.p.) Float WGS84 geographic coordinates in decimal degrees (EPSG:4326) Vehicle_Length*, Vehicle_Width* m (2 d.p.) Float Estimated physical dimensions of the vehicle Vehicle_Class Categorical (0–3) Integer Vehicle type: 0 (car/van), 1 (bus), 2 (truck), 3 (motorcycle) Vehicle_Speed* km/h (1 d.p.) Float Estimated speed computed from trajectory data using Gaussian smoothing Vehicle_Acceleration* m/s² (2 d.p.) Float Estimated acceleration derived from smoothed speed values Road_Section* N_G String Road section identifier (N = node, G = lane group) Lane_Number* 1, 2, … Integer Lane position (1 = leftmost lane in the direction of travel) Visibility 0/1 Boolean 1 = fully visible, 0 = partially visible in the camera frame * These columns may be empty under certain conditions, see [1] for more details. 2. Orthophoto Cut-Outs (orthophotos.zip, 1.8 GB) For each intersection, we provide the high-resolution orthophoto cut-outs that were used for georeferencing. These 8000×8000 pixel PNG images cover specific areas, allowing users to overlay orthophoto trajectories within the road network. orthophotos/ │── A.png │── B.png │── … └── U.png For more details on the orthophoto generation process, refer to [1]. 3. Orthophoto Segmentations (segmentations.zip, 24.9 KB) We provide the road and lane segmentations for each orthophoto cut-out, stored as CSV files where each row defines a lane polygon within a road section. Each section (N_G) groups lanes moving in the same direction, with lanes numbered sequentially from the innermost outward. The CSV files are structured as follows: segmentations/ │── A.csv │── B.csv │── … └── U.csv Each file contains the following columns: Section: Road section ID (N_G format). Lane: Lane number within the section. tlx, tly, blx, bly, brx, bry, trx, try: Polygon corner coordinates. These segmentations enabled trajectory points to be mapped to specific lanes and sections in our trajectory dataset. Vehicles outside segmented areas (e.g., intersection centers) remain unlabeled. Perspective distortions may also cause misalignments for taller vehicles. 4. Sample Videos (sample_videos.zip, 26.8 GB) The dataset includes 29 video samples, each capturing the first 60 seconds of drone hovering over its designated intersection during the final session (PM5) on October 7, 2022. These high-resolution 4K videos provide additional context for trajectory analysis and visualization, complementing the orthophoto cut-outs and segmentations. sample_videos/ │── A_D1_2022-10-07_PM5_60s.mp4 │── A_D2_2022-10-07_PM5_60s.mp4 │── B_D1_2022-10-07_PM5_60s.mp4 │── … └── U_D10_2022-10-07_PM5_60s.mp4 Additional Files README.md – Dataset documentation (this file) LICENSE.txt – Creative Commons Attribution 4.0 License Known Dataset Artifacts and Limitations While this dataset is designed for high accuracy, users should be aware of the following known artifacts and limitations: Trajectory Fragmentation: Trajectories may be fragmented for motorcycles in complex road infrastructure scenarios (pedestrian crossings, bicycle lanes, traffic signals) and for certain underrepresented truck variants. Additional fragmentations occurred when drones experienced technical issues during hovering, necessitating mid-recording splits that naturally resulted in divided trajectories. Vehicle ID Ambiguities: The largest Vehicle_ID in a CSV file does not necessarily indicate the total number of unique vehicles. Kinematic Estimation Limitations: Speed and acceleration values are derived from raw tracking data and may be affected by minor errors due to detection inaccuracies, stabilization artifacts, and applied interpolation and smoothing techniques. Vehicle Dimension Estimation: Estimates may be unreliable for stationary or non-axially moving vehicles and can be affected by bounding box overestimations capturing protruding vehicle parts or shadows. Lane and Section Assignment Inaccuracies: Perspective effects may cause vehicles with significant heights, such as trucks or buses, to be misassigned to incorrect lanes or sections in the orthophoto. Occasional pedestrian pair misclassifications: Rarely, two pedestrians walking side by side may be briefly mistaken for a motorcycle, but such instances are short-lived and typically removed by the short trajectory filter. For a comprehensive discussion of dataset limitations and validation procedures, refer to [1]. Citation & Attribution Preferred Citation: If you use Songdo Traffic for any purpose, whether in academic research, commercial applications, open-source projects, or benchmarking efforts, please cite our accompanying manuscript [1]: Robert Fonod, Haechan Cho, Hwasoo Yeo, Nikolas Geroliminis (2025). Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery, arXiv preprint arXiv:2411.02136. (Note: This manuscript shall be replaced by the published version once available.) Note: Although Zenodo automatically provides a formal dataset citation (shown below), we kindly request that you reference the manuscript as the primary source of this work. Dataset Citation (for archival purposes): Robert Fonod, Haechan Cho, Hwasoo Yeo, Nikolas Geroliminis (2025). Songdo Traffic: High Accuracy Georeferenced Vehicle Trajectories from a Large-Scale Study in a Smart City (v1). Zenodo. DOI: 10.5281/zenodo.13828384.

创建时间：

2025-03-17