five

Multi-Season Formula 1 Lap Dataset with Race-Control-Based Safety Car Labels (2022-2025)

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/djr8rnjtjp
下载链接
链接失效反馈
官方服务:
资源简介:
Containing lap-level race data for the 2022-2025 Formula 1 seasons, the year-race-session data were collected using FastF1, public interface that provides access to timing and race information publicly available similarly on the web. Include only official race sessions! The data for each event were downloaded and then all of the events were merged into a single structured dataset. After eliminating invalid or incomplete laps and cleaning out missing values, the final dataset contains 89,980 driver lap observations. Each line represents one driver on one laps. The dataset includes such contextual and performance variables as lap number, race progression, tire compound, tire life, driver position, team ID, environmental variables including (air temperature, track temperature, humidity, rainfall) etc. We applied certain basic feature engineering processes in order to make the data model-friendly. Safety Car (SC) and Virtual Safety Car (VSC) labels were constructed by parsing official race control messages and aligning them with lap-level records. This makes it possible to model rare events at the level of the lap. The dataset also supports forward-looking targets (for instance, detection of a Safety Car within the next few laps) for probabilistic risk forecasting. The dataset's structure allows experiments with event-level validation. This means that entire Grand Prix events can be separated during training and testing to avoid leakage between laps in the same event. The data set is intended for academic research, it is a set of processing and structured derivative of public interface data FastF1 that was obtained via timing.
创建时间:
2026-02-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作