five

San Francisco Bay Area Coastal Flood Prediction Dataset for Deep Learning

收藏
DataONE2026-01-04 更新2026-01-24 收录
下载链接:
https://search.dataone.org/view/sha256:a52b2cdff13a310e953073dec61b976ec05fb72058ad13f52c2ed95d5f9d2ce3
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset provides a comprehensive collection of coastal flood simulation outputs for the San Francisco (SF) Bay Area, designed for the development and evaluation of deep learning models for climate adaptation and flood risk management. The data was generated as part of the research presented in the paper \"Climate Adaptation-Aware Flood Prediction for Coastal Cities Using Deep Learning\" to train and validate the CASPIAN-v2 model. The flood scenarios were simulated using the high-fidelity hydrodynamic model Delft3D. The dataset covers various hypothetical future scenarios, incorporating different Sea-Level Rise (SLR) projections and numerous shoreline protection strategies. The SF Bay Area coastline is segmented into 30 distinct Operational Landscape Units (OLUs), allowing for detailed and varied protection configurations. The dataset is organized into three types, each serving a specific purpose in the machine learning workflow: Main Set: This set contains the primary training dataset. It consists of 285 unique shoreline protection scenarios under a 1.0 m SLR condition. This set is intended for training the core model. Holdout Set: This set is for validation and testing, containing 46 challenging shoreline protection scenarios that were not seen during training, also under the 1.0 m SLR condition. It is designed to evaluate the model's performance on complex, unseen configurations within the same climate scenario. Generalizability Set: This set is curated to test the model's ability to generalize to different climate conditions. It includes 32 unique shoreline protection scenarios for a 0.5 m SLR level and another 32 scenarios for a 1.5 m SLR level. File Information and Format All data files are provided in the NumPy (.npy) format with a spatial resolution of 1024x1024 pixels. The raw simulation data (originally in CSV format) has been mapped onto this standardized grid. The filenames follow a systematic convention that encodes the simulation scenario and the dataset split: {binary_string}_{slr_level}m.{type}_{dataset}.npy {binary_string}: A 30-digit binary string representing the protection status of the 30 OLUs. A 1 indicates the OLU is protected (e.g., by a seawall), and a 0 indicates it is unprotected. {slr_level}: The sea-level rise used for the simulation (e.g., 0.5, 1.0, 1.5). {type}: Indicates whether the file is an input or output for the model. input: The input matrix for the deep learning model. Each grid cell value represents the classification of that point based on its proximity to the nearest protected (0.5) or unprotected (1.0) shoreline. output: The corresponding ground truth flood map. Each cell contains the Peak Water Level (PWL) in meters, representing the maximum flood depth at that location for the given scenario. Negative PWL values from the simulation have been set to zero. {dataset}: Indicates which experimental subset the file belongs to: Main: Primary training set containing 285 scenarios under a 1.0 m SLR condition. Holdout: Validation set of 46 challenging, unseen scenarios under a 1.0 m SLR condition. Generalizability: Test set for out-of-distribution climate conditions, comprising scenarios under 0.5 m and 1.5 m SLR levels. Usage This dataset is primarily intended for researchers and practitioners in machine learning, hydrology, and climate science to develop and benchmark predictive models for coastal flooding. A Python script VisualizeNPY.py for visualizing the .npy files is also included to facilitate data exploration. Related Publication For a complete understanding of the data generation methodology, model architecture (CASPIAN-v2), and experimental results, please refer to the following publication: Hassan, B., Karapetyan, A., Chow, A. C. H., and Madanat, S.: Climate Adaptation-Aware Flood Prediction for Coastal Cities Using Deep Learning, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2025-838, 2025.
创建时间:
2026-01-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作