Introducing Spatial Heterogeneity via Regionalization Methods in Machine Learning Models for Geographical Prediction: A Spatially Conscious Paradigm
- Autor(en)
- Lukas Bögl, Ourania Kounadi
- Abstrakt
This study addresses the challenge of incorporating spatial heterogeneity in predictive modeling by introducing regionalization methods in the preprocessing step of the modeling workflow. Spatial heterogeneity, where the mean of attribute values varies across spatial units, poses difficulties for traditional models. To tackle this, we propose a novel approach called Regionalization Random Forest (RegRF), which combines Random Forest with regionalization techniques to enhance predictive performance. Regionalization combines multiple spatial objects into homogeneous regions, which are incorporated into predictive models, allowing models to capture local variations. This research investigates three key questions: (1) How does the predictive performance of RegRF vary when constructed using different regionalization techniques? (2) How does RegRF compare to benchmark methods, including both spatial statistical approaches and spatially conscious machine learning models like Geographically Weighted Random Forest (GW-RF)? Five regionalization methods—WARD, AZP, Kmeans, SKATER, and Max-p—are tested on datasets of varying sizes. Results show that RegRF significantly improves performance over "non-spatial" Random Forest models with minimal additional computation time. While RegRF performs competitively with Geographically Weighted Regression, it requires much less computational effort. GW-RF was not outperformed on smaller datasets but failed to complete for larger datasets. These findings suggest that RegRF can enhance machine learning models by accounting for spatial phenomena, with potential for further optimization.
- Organisation(en)
- Institut für Geographie und Regionalforschung
- Journal
- Europan Journal of Geography
- Band
- 15
- Seiten
- 244-255
- Anzahl der Seiten
- 12
- ISSN
- 1792-1341
- DOI
- https://doi.org/10.48088/ejg.l.boe.15.4.244.255
- Publikationsdatum
- 10-2024
- Peer-reviewed
- Ja
- ÖFOS 2012
- 507003 Geoinformatik, 507001 Angewandte Geographie, 102019 Machine Learning, 102035 Data Science
- Schlagwörter
- ASJC Scopus Sachgebiete
- Demography, Geography, Planning and Development, Cultural Studies, Urban Studies
- Link zum Portal
- https://ucrisportal.univie.ac.at/de/publications/de392c13-04d8-4dc0-87e4-b2a17458bb39