Floating Car Data Collection for Processing and Benchmarking
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/2250118
下载链接
链接失效反馈官方服务:
资源简介:
The dataset is outcome of a paper "Floating Car Data Map-matching Utilizing the Dijkstra Algorithm" accepted for 3rd International Conference on Data Management, Analytics & Innovation held in Kuala Lumpur, Malaysia in 2019.
The floating car data (FCD representing movement of cars with their position in time) is produced by the traffic simulator software (further referred to as Simulator) published in [1] and can be used as an input for data processing and benchmarking. The dataset contains FCD of various quality levels based on the routing graph of the Czech Republic derived from Open Street Map openstreetmap.org.
Should the dataset be exploited in scientific or other way, any acknowledgement or references to our paper [1] and dataset are welcomed and highly appreciated.
Archive contents
The archive contains following folders.
city_oneway and city_roadtrip - FCD from the city of Brno, Czech Republic where FCD is based on Origin-Destination in case of oneway and Origin-Destination-Origin in case of a road trip
intercity_oneway and intercity_roadtrip - FCD from cities of Brno, Ostrava, Olomouc and Zlin, all Czech Republic where FCD is based on Origin-Destination in case of oneway and Origin-Destination-Origin in case of a road trip
Content explanation
All four of mentioned folders contain raw FCD as they come from our Simulator, post-processed FCD enriching Simulator FCD, and obfuscated raw FCD (of both low and high obfuscation level). In the both obfuscated data sets, each measured point was moved in a random direction a number of meters given by drawing a number from a Gaussian distribution. We utilized two Gaussian distributions, one for the roads outside the city (N(0,10) for the lower and N(0,20) for the higher obfuscation level) and one for the roads inside the city (N(0,15) and N(0,30) respectively). Then some predefined number of randomly chosen points were removed (3% in our case). This approach should roughly represent real conditions encountered by FCD data as described by El Abbous and Samanta [2].
In case of post-processed road trip data, there is one extra dataset with "cache" suffix representing the very same dataset limited to a 5-minute session memoization. This folder also contains a picture of processed FCD represented on a map.
Data format
Standard UTF-8 encoded CSV files, separated by a semicolon with the following columns:
RAW
Header
session_id;timestamp;lat;lon;speed;bearing;segment_id
Data
session_id: (Type: unsigned INT) - session (car) identifier
timestamp: (Type: datetime) - timestamp in UTC
lat: (Type: unsigned long) - latitude as used in Google maps
lon: (Type: unsigned long) - longitude as used in Google maps
speed: (Type: unsigned INT) - actual speed in kmh
bearing: (Type: unsigned INT) - actual bearing in angles 0-360
segment_id: (Type: unsigned long) - unique edge identifier
POST-PROCESSED
Header
gid;car_id;point_time;lat;lon;segment_id;speed_kmh;speed_avg_kmh;distance_delta_m;distance_total_m;speedup_ratio;duration;segment_changed;duration_segment;moved;duration_move;good;duration_good;bearing;interpolated
Data
gid: (Type: unsigned long) - global identifier of a record
car_id: (Type: unsigned INT) - session (car) identifier
point_time: (Type: datetime) - timestamp with timezone
lat: (Type: unsigned long) - latitude as used in Google maps
lon: (Type: unsigned long) - longitude as used in Google maps
segment_id: (Type: unsigned long) - unique edge identifier
speed: (Type: unsigned INT) - actual speed in kmh
speed_avg_kmh: (Type: unsigned long) - actual average speed of a car in kmh
distance_delta_m: (Type: unsigned long) - actual distance delta in metres
distance_total_m: (Type: unsigned long) - actual total distance of a car in metres
speedup_ratio: (Type: unsigned long) - actual speed-up ratio of a car
duration: (Type: time) - actual duration of a car
segment_changed: (Type: boolean) - signals if actual segment of a car differs from the previous one
duration_segment: (Type: time) - actual duration on a segment of a car
moved: (Type: boolean) - signals if actual position of a car differs from the previous one
duration_move:(Type: time) - actual duration of a car since moving
good: signals if actual record values satisfies all data constraints (all true as derived from Simulator)
duration_good: actual duration of a car since when all constraints conditions satisfied
bearing: (Type: unsigned INT) - actual bearing in angles 0-360
interpolated: (Type: boolean) - signals if actual segment identifier is calculated (all false as derived from Simulator)
References
[1] V. Ptošek, J. Ševčík, J. Martinovič, K. Slaninová, L. Rapant, and R. Cmar, Real-time traffic simulator for self-adaptive navigation system validation, Proceedings of EMSS-HMS: Modeling & Simulation in Logistics, Traffic & Transportation, 2018.
[2] A. El Abbous and N. Samanta. A modeling of GPS error distri-butions, In proceedings of 2017 European Navigation Conference (ENC), 2017.
创建时间:
2020-01-24



