five

Performance and Generalizability Impacts of Incorporating Geolocation into Deep Learning for Dynamic PM2.5 Estimation

收藏
Figshare2025-05-23 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Performance_and_Generalizability_Impacts_of_Incorporating_Geolocation_into_Deep_Learning_for_Dynamic_PM2_5_Estimation/29139464/1
下载链接
链接失效反馈
官方服务:
资源简介:
Deep learning models have demonstrated success in geospatial applications, yet quantifying the role of geolocation information in enhancing model performance and geographic generalizability remains underexplored. A new generation of location encoders have emerged with the goal of capturing attributes present at any given location for downstream use in predictive modeling. Being a nascent area of research, their evaluation has remained largely limited to static tasks such as species distributions or average temperature mapping. In this paper, we discuss and quantify the impact of incorporating geolocation into deep learning for a real-world application domain that is characteristically dynamic (with fast temporal change) and spatially heterogeneous at high resolutions: estimating surface-level daily PM<sub>2.5</sub> levels using remotely sensed and ground-level data. We build on a recently published deep learning-based PM<sub>2.5</sub> estimation model that achieves state-of-the-art performance on data observed in the contiguous United States. We examine three approaches for incorporating geolocation: excluding geolocation as a baseline, using raw geographic coordinates, and leveraging pretrained location encoders. We evaluate each approach under within-region (WR) and out-of-region (OoR) evaluation scenarios. Aggregate performance metrics indicate that while naïve incorporation of raw geographic coordinates improves within-region performance by retaining the interpolative value of geographic location, it can hinder generalizability across regions. In contrast, pretrained location encoders like GeoCLIP enhance predictive performance and geographic generalizability for both WR and OoR scenarios. However, our qualitative analysis reveals artifact patterns caused by high-degree basis functions and sparse upstream samples in certain areas, and our ablation results indicate varying performance among location encoders such as SatCLIP vs. GeoCLIP. To the best of our knowledge, this is a first integration and systematic evaluation of location encoders in a complex, temporally dynamic estimation scenario. In addition to guiding better model development for air pollution estimation and location encoders, this study provides insights for effective incorporation of location into deep learning for geospatial predictive tasks.
提供机构:
Wang, Zhongying; Crooks, James L.; Karimzadeh, Morteza
创建时间:
2025-05-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作