JavaScript is Disabled
Your browser's JavaScript functionality is disabled. It has to be enabled to use this function of ConfTool.
Here you can find information on how to enable JavaScript
If you have any problems, please contact the organizers at isprs2026@icsevents.com.

Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).

Agenda Overview

Date: Saturday, 04-July-2026

8:30am - 5:00pm

TuT1: 3D Reconstruction from Multi-View Satellite Imagery: From Classic to Modern Methods
Location: 713A

8:30am - 5:00pm

TuT2: A Full Immersion in 3D Underwater Mapping
Location: 713B

8:30am - 5:00pm

TuT3: Geospatial Deep Learning in Practice
Location: 714A

8:30am - 5:00pm

TuT4: Hybrid and Precise Camera Pose Estimation in MicMacV2
Location: 714B

8:30am - 5:00pm

TuT5: InSAR Time Series Analysis with SARvey and InSAR Explorer
Location: 715A

8:30am - 5:00pm

TuT6: Open Point-to-point Correspondences for Loose or Tight Integration in Kinematic Laser Scanning
Location: 715B

8:30am - 5:00pm

TuT8: Quantum Computing for Earth Observation
Location: 716A

8:30am - 5:00pm

TuT9: Open Web-GIS for Disaster Response and Campus Routing: From Architecture to Deployment
Location: 716B

Date: Sunday, 05-July-2026

8:30am - 12:00pm

TuT10: Open-source Scientific Software py4dgeo for Change Analysis in 3D/4D Point Clouds
Location: 713A

8:30am - 12:00pm

TuT11: Photogrammetric Mapping by Drones: Theory and Practice
Location: 713B

8:30am - 12:00pm

TuT12: Advanced Topographic Time Series Data Management Using the Topo4d Extension of the Spatiotemporal Asset Catalog (STAC) for Curation, Analysis, and Visualization of 4D Point Clouds
Location: 714A

8:30am - 12:00pm

TuT13: Digital Twinning with UAV and Backpack Mobile Mapping Systems
Location: 714B

8:30am - 12:00pm

TuT14: FastFlood: Rapidly Using Earth Observation Data for Flood Forecasts
Location: 715A

8:30am - 12:00pm

TuT15: Getting Started with CNES Open-Source 3D Tools in Python
Location: 715B

8:30am - 12:00pm

TuT16: Metrics That Make a Difference: How to analyze change and error
Location: 716A

8:30am - 12:00pm

TuT17: Towards Geospatial Embeddings: Investigating Accurate and Accessible Deep Geospatial Feature Representations
Location: 716B

8:30am - 12:00pm

TuT18: Urban Scene Modeling
Location: 717A

8:30am - 3:00pm

General Assembly 1
Location: 701A

12:00pm - 1:15pm

ICWG I/IV: Robotics for Mapping and Machine Intelligence
Location: 713A

12:00pm - 12:15pm

A category-specific prompt strategy for semantic 3D indoor mapping using RGB-D camera

Jiwei Hou¹, Vivien Volland², Samer Karam¹, Dorota Iwaszczuk¹

¹Remote Sensing and Image Analysis, Department of Civil and Environmental Engineering, Technical University of Darmstadt, Germany; ²Geodetic Measurement Systems and Sensor Technology, Department of Civil and Environmental Engineering, Technical University of Darmstadt, Germany

Semantic 3D indoor mapping often depends on supervised learning and large annotated datasets, limiting scalability across diverse environments. This work introduces a category-specific prompt strategy for semantic 3D mapping using RGB-D cameras, integrating RGB-D SLAM with the Segment Anything Model 2 (SAM2) to enable annotation-efficient reconstruction. Keyframes and trajectories extracted from SLAM provide spatial references, while SAM2 performs zero-shot segmentation guided by a Category-Specific Prompt Strategy (CPSS), which segments structural and functional elements (e.g., floors, doors, staircases) by category to reduce prompt interference and manual effort. The segmented keyframes are then fused with depth and pose data to produce instance-level semantic point clouds. Experiments on custom RGB-D sequences and selected ScanNet scenes demonstrate centimeter-level geometric accuracy and strong semantic consistency, with mIoU values up to 0.89 on the custom dataset and 0.98 on ScanNet. The resulting semantic point clouds are clean, structured, and require minimal post-processing, showing that the proposed strategy provides an efficient and scalable solution for semantic 3D indoor mapping without retraining or environment-specific supervision.

12:15pm - 12:30pm

3L-Planner: Lightweight LiDAR mapping and real-time local planning for ground robot autonomous navigation

Wenlei Fan, Zhenqi Zheng, Mingyang Zhou, You Li

State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, China

Mobile robots are widely used in unmanned surveying, warehouse logistics, and emergency response. However, achieving safe, reliable, and efficient autonomous navigation in unknown environments remains challenging, where accurate environment representation and feasible trajectory planning are crucial. This paper presents an autonomous navigation method integrating lightweight LiDAR mapping with real-time local planning for ground robots. At the perception level, an incremental single-frame point cloud update is used to accumulate and project locally traversable space, producing a lightweight obstacle map that preserves geometric accuracy while reducing planning computation. At the planning level, A* is employed to generate reference control points, and uniform B-spline curves are used to optimize the trajectory while enforcing kinematic feasibility and smoothness. At the control level, nonlinear model predictive control (NMPC) ensures accurate trajectory tracking by producing control commands that satisfy velocity and acceleration constraints. The framework also supports low-cost evaluation in simulation. Experiments in simulated forests, simulated indoor corridors, and real-world gardens and hallways show average navigation speeds of 2.24 m/s, 0.76 m/s, 0.43 m/s, and 0.38 m/s, respectively. Results demonstrate that the proposed method generates smooth, feasible, and safe trajectories and completes autonomous navigation and mapping tasks across diverse environments.

12:30pm - 12:45pm

CMCL-PR: Cross-Modal Camera-to-LiDAR Place Recognition with Cross-Attention Contrastive Learning

Qipeng Li, Yuan Zhuang

Wuhan University, China, People's Republic of

Place recognition is a crucial task for both robots and autonomous vehicles, facilitating positioning and loop closure within pre-built global maps. Although single-modal sensor-based methods have shown satisfactory performance, cross-modal place recognition—retrieving low-cost camera images from global point cloud databases—remains a significant challenge. In this paper, we propose a contrastive learning-based lightweight cross-modal place recognition framework (CMCL-PR) to retrieve a single image from a global offline point cloud map. We introduce a perspective projection based field-of-view(FoV) transformation module that converts point clouds into a modality analogous to images; Then, we design a dual branch intra-modal encoder structure based on shared Transformer, which extracts and aligns image and point cloud features separately, effectively unifying the feature distribution between modals; Besides, a cross-attention mechanism module guided by inter-modal consistency was constructed, which utilizes the contribution of scene context information within different modalities to generate the discriminating cross-modal descriptors. Finally, during the contrastive learning, cross-modal feature was enhanced, and a multi loss function was constructed, including cross-modal contrastive learning loss, intra-modal consistency loss, and matching supervision loss. We assess the effectiveness and generalizability of our method using three publicly available datasets: KITTI, KITTI-360, and Oxford RobotCar. The project page and code will be released at https://github.com/qp-li/CMCL-PR.

12:00pm - 1:15pm

WG II/7A: Underwater Data Acquisition and Processing
Location: 713B

12:00pm - 12:15pm

Explicit vs implicit Modelling of Refraction in underwater Structure-from-Motion – A practical Guide

Fabio Menna¹, Robin Rofallski², Erica Nocerino³

¹Department of Chemical, Physical, Mathematical and Natural Sciences, University of Sassari, Sassari, Italy; ²Institute for Applied Photogrammetry and Geoinformatics, Jade University of Applied Sciences, Oldenburg, Germany; ³Department of Humanities and Social Sciences, University of Sassari, Sassari, Italy

The presence of refraction-induced systematic errors has always been a cause of concern in the field of underwater photogrammetry.

This work extends previous studies from the authors with new simulations specifically aimed at practical applications underwater using popular sensor devices and configurations, such as GoPro action cameras fitted with standard flat port housing that are very common among marine ecologists and archaeologists.

We aim at investigating whether approaches used by regular photogrammetry above water can be applied underwater without a significant accuracy loss for the application of interest.

Due to the complexity of collecting ground truth data, simulations are used. We utilize the POSER framework (https://github.com/GEOSS-UNISS/POSER) developed within the 2024 ISPRS Education and Capacity Building Initiatives (ECBI).

We investigate the benefits and cons of the refractive vs the implicit modelling approaches with respect to estimability of camera calibration (refractive) parameters, need for pre-calibration setups with approaches from literature, availability of ground control points, and assessing the accuracy of both approaches against ground truth simulated data.

The accuracy is reported as discrepancies in the reconstructed 3D models, exterior orientation and camera calibration parameters.

12:15pm - 12:30pm

Investigating the Potential of SfM, MVS, and Monocular Depth Estimation for Water Surface Reconstruction

Anatol Günthner¹, Markus Brezovsky², Frederik Schulte³, Lukas Winiwarter³, Gottfried Mandlburger², Boris Jutzi¹

¹Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany; ²Department of Geodesy and Geoinformation, TU Wien, Vienna, Austria; ³Unit of Geometry and Surveying, University of Innsbruck, Innsbruck, Austria

Reconstructing the water surface in refractive domains such as rivers and lakes is challenging, since light bending at the air-water interface alters the apparent geometry and breaks the straight-ray assumption of conventional image-based 3D reconstruction. Accurate water surface models are therefore a key prerequisite for many refraction-aware applications. This contribution investigates the potential of three passive image-based methods, Structure from Motion (SfM), Multi-View Stereo (MVS), and Monocular Depth Estimation (MDE), to derive a geometrically consistent water surface model from UAV imagery of the Pielach River study site in Austria. The dataset represents a demanding scenario with clear, fast-ﬂowing water and low texture, which causes strong refraction and poor feature stability. Quantitative comparisons against LiDAR-derived reference surfaces show that SfM yields sparse and inconsistent points, MVS reconstructs the riverbed instead of the water surface, and MDE exhibits scale and offset inconsistencies despite explicit calibration using SfM reprojections. Completeness remains below 45 % for all methods with mean vertical deviations in the decimetre-to-metre range. The results indicate that current image-based approaches are insufﬁcient for reliable water-surface reconstruction in such settings, reinforcing the need for an explicitly derived surface model as input to refraction-aware modeling, for example in bathymetric reconstruction and future refractive neural rendering methods, rather than relying on implicitly learned water surfaces.

12:30pm - 12:45pm

Complementary Usage of RTI and SfM-MVS for Inspecting Reflective Weld Seams under Water

Oliver Kahmen, Thomas Luhmann

Jade University of Applied Sciences, Institute for Applied Photogrammetry and Geoinformatics (IAPG), Ofener Str. 16/19, 26121 Oldenburg, Germany

This article studies the complementary usage of RTI and SfM-MVS visual inspection (visual testing) of welds under water. Two compact low-cost lighting domes of different designs were developed and deployed with a monochromatic camera at close range.

The lighting domes generate homogenous and direct illumination, respectively, tailoring the requirements of SfM and RTI. The camera is housed in a cylindrical tube, equipped with a dome port interface.

The 3D reconstruction in combination with RTI models could augment existing testing strategies and provide digital, gapless documentation. Experiments were conducted in laboratory in air, clear and turbid water questioning were the capabilities and limits are for given setup with respect to visual testing of welds.

Under the correct lightning, in air the techniques perform on a high accuracy level and are well suited for inspecting welds digital and interactively. Underwater the results differ in dependence of the degree of turbidity and prove to be sensitive for configurational parameters leaving space for improvements of acquisition and processing workflows. However, even in turbid water the 3D reconstructions and RTI models could be calculated enabling novel possibilities for weld inspection.

12:00pm - 1:15pm

WG II/8A: Environmental & Infrastructure Monitoring
Location: 714A

12:00pm - 12:15pm

Detection of hygroscopic dead tree branch movement using permanent laser scanning

Vanessa Guthier^1,2, Hannah Weiser^1,2, Ronald Tabernig^1,2, Mariana Campos³, Eetu Puttonen³, Bernhard Höfle^1,2

¹3DGeo Research Group, Institute of Geography, Heidelberg University, Germany; ²Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Germany; ³Department of Remote Sensing and Photogrammetry, Finnish Geospatial Research Institute, National Land Survey of Finland, Espoo, Finland

We present the first quantitative evidence of systematic daily dead tree branch movement under field conditions, which has been previously only observed qualitatively. Using a boreal forest Permanent Laser Scanning (PLS) dataset, we investigated whether such movement can be detected and characterized in 3D laser scans. We link branch anatomy and geometry with environmental drivers, leading to the hypothesis that branch bending is proportional to wood moisture content. Hourly point clouds collected over 3.5 days from 17 dead branches (16 coniferous), attached mainly to living trees, were analyzed under calm weather conditions. We developed a novel workflow that tracks movement via non-rigid registration, calculates the angle between each branch and the vertical, and quantifies branch movement uncertainties across epochs. After accounting for time lags, these movements were related to the modeled wood moisture content using a linear mixed model. Clear and consistent daily movement was detected in all branches, with a mean amplitude of 21 cm and an average delay of 3.5 hours relative to moisture content changes. All branches moved downward during the day and upward at night, except one deciduous branch displaying the opposite pattern, consistent with our theoretical framework. The linear mixed model revealed a significant positive linear relationship between branch movement and wood moisture content. Our findings confirm that daily dead tree branch movement is primarily hygroscopic and demonstrate its effective detection using operational PLS. These insights open new possibilities for monitoring tree vitality using hypertemporal 3D sensing.

12:15pm - 12:30pm

3D Reconstruction of deciduous Trees using low-cost UAV- and Crane-based Photogrammetry for Monitoring Shoot Elongation across entire Canopies

Stephan Nebiker¹, Micha Tschanz¹, Nando Amport¹, Frederik Baumgarten²

¹FHNW University of Applied Sciences and Arts Northwestern Switzerland, Switzerland; ²University of Basel

Tree growth determines how much CO2 is sequestered from the atmosphere and temporarily stored in woody biomass. At the same time tree growth is affected by increasing temperatures, more frequent drought periods, late frosts and other extreme events associated with climate change. While continuous measurements of radial (secondary) tree growth using dendrometers are well established, monitoring of shoot elongation (primary growth) has largely been neglected because suitable measurement techniques are lacking. As a result, the effects of climate change on primary tree growth remain insufficiently understood. This work aims at reconstructing native deciduous trees in 3D as a basis for measuring and monitoring shoot elongation over entire tree canopies. Here we explored the use of low-cost UAV photogrammetry and of a multi-camera CraneCam system under real-world conditions. Data were collected in two study areas over an entire growing season. We present sensor evaluations, photogrammetric data acquisition and processing strategies. A special focus is placed on the analysis of the resulting photogrammetric 3D point clouds in terms of accuracy, resolution and completeness. Results demonstrate 3D point accuracies of 5-6 mm for entire trees using consumer-grade UAVs weighing less than 250 g and a 3D reconstruction completeness between 92% and 98% depending on the UAV type. The paper introduces a novel 3D-printed ground-truth branch to evaluate the capability to reconstructing fine-detail structures such as thin tree shoots. Finally, we discuss operational challenges and initial experiments towards a skeletonization of entire trees based on photogrammetric point clouds.

12:30pm - 12:45pm

In-Situ Gaussian Splatting-generated 3D Thermal Mesh Visualization for Urban Trees in Augmented Reality

Chaimaa Delasse^1,2, Dario Billi^2,3, I Gede Mahendra Darmawiguna⁴, Kadek Ananta Satriadi⁴, Arnadi Murtiyoso², Hélène Macher², Vincent Lecomte², Rafika Hajji¹, Tania Landes²

¹College of Geomatic Sciences and Surveying Engineering, Institute of Agronomy and Veterinary Medicine (IAV), Rabat 6202, Morocco; ²University of Strasbourg, INSA Strasbourg, CNRS, ICube UMR 7357 Laboratory, 67000 Strasbourg, France; ³Department of Civil and Industrial Engineering, ASTRO Laboratory, University of Pisa, Largo Lucio Lazzarino, 56122 Pisa, Italy; ⁴Faculty of Information Technology, Monash University, Australia

Urban trees provide critical cooling benefits and regulate microclimates in cities, yet their three-dimensional thermal behaviour remains difficult to visualize and communicate to stakeholders. Traditional approaches rely on 2D thermal imagery analyzed on desktop screens, lacking spatial context and the capability for temporal monitoring across phenological stages. This work presents a pipeline combining Gaussian Splatting with augmented reality to enable immersive, on-site visualization of urban tree thermal patterns.

We captured thermal infrared (TIR) images of silver linden trees (Tilia tomentosa) at the University of Strasbourg using a FLIR T560 thermal camera. The TIR images were preprocessed and then processed using MILo (Mesh-In-the-Loop Gaussian Splatting), generating 3D thermal meshes with approximately 3 million vertices. The reconstruction is validated against reference thermal point clouds acquired with a terrestrial laser scanner equipped with an integrated thermal camera, assessing both geometric completeness and thermal attribute fidelity.

The thermal meshes are deployed in a mobile augmented reality application, allowing users to visualize temperature distributions directly overlaid on physical trees in the field. This work demonstrates the first application of 3D Gaussian Splatting to thermal vegetation modelling, providing an engaging educational tool to communicate urban trees' cooling role while offering researchers a platform for detailed thermal analysis and forest health monitoring.

12:45pm - 1:00pm

Challenges in automated 4D Point Cloud Generation for Glacier Calving Monitoring at high temporal Resolution

Laura Camila Duran Vergara¹, Xabier Blanch Gorriz², Bindusara Nagathihalli Lokesh¹, Anette Eltner¹

¹Technische Universität Dresden, Germany; ²Universitat Politècnica de Catalunya, Spain

To robustly support glacier calving monitoring at high temporal resolution and enable future AI-based calving forecasts, this study presents an optimized Multi-Epoch Multi-Imagery (MEMI) strategy for automated 4D point cloud model generation. To date, the dataset comprises over 160,000 images acquired since December 2024 by an autonomous multi-camera system operating at 30 min intervals at Glacier Perito Moreno (GPM), Argentina. Despite high scene variability and harsh environmental conditions, the proposed MEMI workflow effectively addresses constraints imposed by continuous glacier motion and image degradation. The enhanced strategy aims to generate precise dense clouds with high alignment accuracy and computational efficiency, forming the basis for subsequent analysis of glacier front evolution. To achieve this, various parameter configurations are evaluated, including AI-based image masking and adaptive, optimized alignment-adjustment settings. Results from a representative eight-day subset show that variations in the tie point computation strategy lead to measurable differences in alignment-adjustment efficiency, with the best configuration being about 11 % faster than the least efficient one. By contrast, adaptive alignment-adjustment consistently improves alignment accuracy. Moreover, masking enhances both image quality checking and reconstruction quality, and, albeit modestly, improves pre-failure deformation analysis. Furthermore, daily seasonal responses to alignment are observed, as accuracy varies with solar illumination relative to the camera positions. Applying the optimal configuration to 260 MEMI projects in under 42 h produced 518 high-precision dense clouds and detected calving retreat magnitudes of up to 17.5 m, demonstrating the robustness and scalability of the proposed MEMI strategy for high-temporal-resolution 4D point cloud generation.

12:00pm - 1:15pm

WG III/7D: Remote Sensing of the Hydrosphere and Cryosphere
Location: 714B

12:00pm - 12:15pm

Machine Learning-based Retrieval of Turbidity in Gorgan Bay, Southeastern Caspian Sea, using Sentinel-2 Multispectral Imagery

Salar Khani, Ata A. Kakroodi, Sara Attarchi

University of Tehran, Iran, Islamic Republic of

Gorgan Bay in the southeast of the Caspian Sea faces significant issues with water volume reduction and water quality deterioration. The turbidity levels of this water body have increased recently owing to the ongoing decline in the Caspian Sea level and the increase in human activity. In this study, to monitor water quality of the bay, various machine learning models were used to retrieve turbidity levels from Sentinel-2 satellite imagery. In situ turbidity measurements acquired throughout the bay were correlated with Sentinel-2 reflectance data. A statistical evaluation was conducted to ascertain the prospective band combinations for estimating turbidity. Four regression methods, including Multiple Linear Regression (MLR), Random Forest (RF), Classification and Regression Trees (CART), and Gradient Tree Boost (GTB), were implemented to estimate turbidity levels using six different input scenarios. These models were tested on unseen test data, and it was found that the CART model with RMSE = 7.89 FTU, R² = 0.93, and Nash-Sutcliffe efficiency (NSE) = 0.74 exhibited superior performance. The generated turbidity maps across the bay showed sediment plumes next to southeastern river mouths, indicating increased turbidity levels in these areas compared to the rest of the bay, revealing intra-bay variability due to tidal and discharge dynamics. The applied methodology demonstrated superior performance compared to conventional empirical models in turbid coastal environments. The results indicated that the machine learning approaches coupled with satellite data provides water resource managers with a cost-effective and real-time tool for coastal water quality monitoring.

12:15pm - 12:30pm

Use of Remote Sensing and In Situ Monitoring to Evaluate Turbidity in an Open-Pit Mining Lake

Luisa da Cunha Vieira¹, Danilo Marques de Magalhães², Hugo de Oliveira Fagundes³

¹Geotechnical Project Management, BVP Geotecnia e Hidrotecnia, Belo Horizonte, Brazil; ²Dept. of Geography and Environmental Planning, São Paulo State University, Rio Claro, Brazil; ³Water Resources Department, Campinas State University, Campinas, Brazil

The formation of pit lakes in decommissioned open-pit mines has raised concerns regarding long-term water quality. Turbidity, a key indicator of suspended particulate matter, influences water clarity and aquatic ecological processes. This study estimates surface turbidity in the Águas Claras Mine (MAC) pit lake in Nova Lima, Brazil, using satellite imagery and in situ data to generate a continuous time series and assess compliance with thresholds established by current Brazilian environmental legislation (CONAMA Resolution No. 357/2005). Landsat 5 and 8 imagery were used to derive a spectral turbidity index. Based on the temporal overlap between satellite and field data, a linear regression model (R² = 0.77) was developed and applied to extend the turbidity time series. The results indicate that turbidity values remained below the legal limits for Class 1 freshwater. Higher turbidity levels were observed during the initial filling phase, associated with exposed slopes, as well as episodic increases during the rainy season due to sediment runoff. Over time, progressive revegetation and minimal anthropogenic disturbance contributed to the stabilization of water quality conditions. The integration of in situ measurements and remote sensing proved to be an effective approach for monitoring water quality in post-mining environments, supporting both environmental liability assessment and closure management.

12:30pm - 12:45pm

A Bio-Optical Model Modified for Estimating Red Tide Intensity

Wonkook Kim¹, Seungil Baek¹, Sooyoon Koh¹, Keunyong Kim²

¹Pusan National University, Korea, Republic of (South Korea); ²Korea Institute of Ocean Science and Technology

Harmful algal blooms caused by Margalefidinium polykrikoides have intensified in Korean coastal waters, yet existing bio-optical models are not able to reproduce the species-specific spectral features required for quantitative bloom assessment. This study develops a dedicated semi-analytical bio-optical model by integrating multi-year field measurements collected from six campaigns between 2013 and 2022, including hyperspectral above-water radiometry, laboratory absorption spectra, and chlorophyll-a (Chla) observations.

The model formulation follows a standard absorption–backscattering reflectance framework, in which total absorption is decomposed into water, phytoplankton, NAP, and CDOM components, while phytoplankton backscattering is parameterized using two optimized species-dependent parameters. An iterative inversion procedure identifies the optimal backscattering structure by minimizing the spectral mismatch between modeled and measured hyperspectral Rrs. In addition, an empirical red-edge term is introduced to capture the distinct fluorescence-associated peak near ~700 nm that characterizes high-biomass M. polykrikoides waters.

The resulting model accurately reconstructs observed Rrs across low to high Chla conditions, reproducing key features such as strong blue absorption, the secondary blue rebound, and the pronounced red-edge peak. Comparisons with GIOP and Karenia-based models show substantially improved performance, particularly under extreme bloom conditions.

This work provides the first validated species-specific bio-optical parameterization for M. polykrikoides and offers a practical pathway for satellite-based HAB monitoring using upcoming hyperspectral missions such as PACE and GLIMR. The framework is extendable to additional HAB species and supports future development of physics-based, species-resolved coastal water-quality retrievals.

12:45pm - 1:00pm

Comparative assessment of shallow water bathymetry derived from satellite imagery and aerial photogrammetric data in karimunjawa cays, indonesia

Wahyudi Nugraha^1,2, Gabriella Alodia³

¹Institut Teknologi Bandung, Faculty of Earth Sciences and Technology, Geodesy and Geomatics Engineering Postgraduate Programme, Bandung, Indonesia; ²Geospatial Information Agency (BIG), Cibinong, Indonesia; ³Institut Teknologi Bandung, Faculty of Earth Sciences and Technology, Hydrography Research Group, Bandung, Indonesia

Coastal zones are highly vulnerable to multiple hazards, including tsunami, shoreline erosion, coral reef degradation, and escalating impacts of sea-level rise. These concerns illustrate the urgent need for accurate and high-resolution geospatial data in coastal areas are required to support coastal risk assessment and management. A seamless and accurate coastal digital elevation model (DEM) is a foundational dataset to support these needs.

However, the development of a seamless land-sea elevation surface remains challenging. The intertidal zone often forms a critical data gap between land DEMs and bathymetry grids. To address these limitations, the use of multi-sensor geospatial data has grown considerably in coastal science and hydrography, such as structure-from-motion (SfM) photogrammetry and satellite-derived bathymetry (SDB). Assimilating SfM and SDB offers a viable pathway for constructing seamless coastal DEM.

Therefore, understanding the quality of SfM and SDB data before integration is a critical step. This study addresses this gap by evaluating the vertical accuracy, effective spatial resolution, and internal consistency of SfM-derived coastal topography and SDB-derived shallow-water bathymetry in a challenging coastal environment commonly found in Indonesian waters, i.e., coral reef islands. In this study, the area of interest is Karimunjawa and Kemujan Islands, located in the Java Sea, Indonesia, approximately 80-90 km from the mainland.

Based on our preliminary results, SfM provides high spatial detail depths but introduces short wavelength oscillation while SDB shows smoother depth gradients. In addition, several depths derived from SfM and SDB indicate different vertical reference levels.

12:00pm - 1:15pm

SpS1: Roundtable Digital Twins for Conservation of World Heritage Sites
Location: 715A

12:00pm - 12:15pm

3D Point Cloud–Based Digital Twin Reconstruction and VR Immersive Visualization of Borobudur’s Hidden Foot Reliefs

Satoshi Takatori¹, Jiajun Gao¹, Liang Li¹, - Fadjar Ibnu Thufail², - Brahmantara³, Satoshi Tanaka¹

¹Ritsumeikan University, Japan; ²National Research and Innovation Agency, Indonesia; ³Indonesian Heritage Agency, Indonesia

This presentation introduces a digital twin and immersive VR system that reconstructs the “Hidden Foot” of Borobudur Temple in Central Java, Indonesia—an encased architectural layer containing 160 narrative bas-relief panels that cannot be directly observed today. The system integrates multi-source 3D point cloud data acquired via UAV and ground-level imaging with deep-learning–based 3D reconstructions of reliefs from historical monocular photographs and semantic segmentation results from previous work.

A custom passage model simulates the narrow gap between the original relief-bearing wall and the surrounding protective masonry, allowing users wearing a head-mounted display to virtually enter and walk through this otherwise inaccessible space. Within the VR environment, users can switch between photorealistic relief rendering and color-coded semantic overlays to support both immersive appreciation and analytical interpretation. A torch-based Level of Detail (LoD) strategy, driven by dynamic illumination, maintains high visual fidelity near the user while reducing rendering load for distant geometry, ensuring stable frame rates suitable for comfortable VR exploration.

A small user study indicates high ratings for visual realism, immersion, and the educational clarity of semantic overlays and gaze-triggered text annotations, highlighting the potential of this approach for research, documentation, and public engagement with hidden cultural heritage.

12:15pm - 12:30pm

Enhancing 3D Point Cloud Visualization through Adaptive Transparency with Light Sources and Normal Vectors

Hatsuki Nojiri¹, Satoshi Takatori¹, Liang Li¹, Motoaki Adachi², R. Michael Feener³, Satoshi Tanaka¹

¹College of Information Science and Engineering, Ritsumeikan University, Japan; ²Shrewd Design Co., Ltd., Japan; ³Center for Southeast Asian Studies, Kyoto University, Japan

Three-dimensional (3D) scanning is widely used to preserve cultural heritage as large-scale point clouds. While these datasets contain rich geometric information, transparent visualization of such massive point clouds often suffers from visual clutter and reduced clarity, particularly when both external and internal structures are involved. Previous work resolved the problem of normal orientations, laying the foundation for robust shading in transparent visualization. Building on this foundation, this paper introduces a novel method of adaptive opacity control for region highlighting, which interprets shading as a distribution of opacity. By adjusting the lighting direction, effective opacity can be locally controlled without modifying the original point cloud data. This mechanism enables selective highlighting of user-specified regions, enhances the visibility of complex structures, while also allowing interactive dynamic shading by continuously changing the lighting direction. The effectiveness of the proposed method is demonstrated using culturally significant heritage point clouds, including UNESCO World Heritage sites, where intricate internal structures can be more clearly analyzed. Beyond cultural heritage, the proposed method is also applicable to modern architectural and other large-scale 3D scanned objects with similarly complex forms.

12:30pm - 12:45pm

Digital Twin in Heritage Buildings and Sites: a Comparative Literature Review of Integrated Technologies, Devices, and Applications (2020–2025)

Rana Tootoonchi, Mona Hess

University of Bamberg, Germany

The concept of Digital Twin has attracted growing interest within research communities, including heritage conservation, in recent years. It combines detailed geometric documentation, real-time monitoring, and semantic information to create dynamic digital replicas of historic buildings. This paper presents the results of a scoping review of 204 peer-reviewed studies published between 2020 and 2025. The aim is to identify the main technologies, devices, and methods used to develop a Digital Twin for heritage buildings. The review reveals that terrestrial laser scanning (TLS), UAV photogrammetry, BIM, and IoT sensor networks form the core technological base. It also highlights the growing use of artificial intelligence for automated defect detection, predictive maintenance, and semantic processing. Based on the reviewed literature, the paper introduces a six-stage workflow for building a heritage Digital Twin, covering baseline documentation, static reality capture, semantic modelling, sensor integration, data fusion, and operational use. The findings show a clear shift from static 3D documentation toward dynamic, data-rich systems that support continuous monitoring and more informed decision-making. However, the review also identifies major challenges, including limited interoperability, complex data integration, incomplete AI validation, and long-term digital preservation issues. Overall, the study outlines the current state of Digital Twin technologies in architectural heritage and identifies key areas that require further research to support reliable and sustainable applications.

12:45pm - 1:00pm

Coupling Hyperspectral and 3D Data for the preventive Conservation of Palace-museums

Julie Fromager¹, Vincent Gauthier¹, Loïc Martinez¹, Danilo Forleo², Stéphane Serfaty¹

¹SATIE UMR CNRS 8029; ²Musée national des châteaux de Versailles et de Trianon

In the current context of energy and climate transition, the preventive conservation of historic buildings is particularly important due to their impact on architecture and works of art. Establishing the correlation between environmental variables and the condition of artworks in situ requires comprehensive and individualized monitoring, allowing for an understanding of cause-and-effect mechanisms. To address this challenge, the EPICO method provides a systematic framework for assessing deterioration risks in palace-museums through multi-scale monitoring and correlation between environmental parameters and object condition. The aim of the proposed topic is to enhance this decision-making tool with the creation of digital twins. These digital twins being fed with three-dimensional hyperspectral and LiDAR mapping of spaces and objects.

12:00pm - 1:15pm

WG IV/8A: Digital Twins for Mobility and Navigation
Location: 715B

12:00pm - 12:15pm

Vision-Language Models for Urban Digital Twins

Amirhossein Nourbakhshrezaei, Saeed Abbasi, Mojgan Jadidi

Civil Engineering Department, Lassonde School of Engineering, York University, Canada

Urban digital twins are virtual city replicas that can greatly support urban planning by simulating infrastructure and mobility scenarios. However, keeping a digital twin up-to-date with fine-grained, real-world urban conditions is challenging. This paper proposes a novel system that leverages multi-modal AI models to bridge the gap between physical urban data collection and a 3D city digital twin. In our approach, ordinary smartphones carried in vehicles act as mobile sensors, continuously capturing multi-modal data (road images, GPS coordinates, and speed). Advanced vision-language models then analyze the data to automatically extract information from the traffic infrastructure and detect road anomalies. The extracted information such as the locations of traffic signs, traffic signals, road surface cracks, and potential blind spots at intersections is geo-tagged and streamed into language-vision models to interpret data and stream human readable insights into the digital twin model. The case study is the digital twin of the city of Toronto. By aggregating data from many drivers and analyzing it (in post-processing for high accuracy), the digital twin evolves into a living model of the urban environment. This enriched and dynamic twin provides urban planners with up-to-date insights on traffic signage, road conditions, and other relevant road infrastructure elements, enabling proactive maintenance and informed decision-making for city planning.

12:15pm - 12:30pm

GeoSceneGraph: Geometric Scene Graph Diffusion Model for Text-guided 3D Indoor Scene Synthesis

Antonio Ruiz^1,2, Tao Wu¹, Andrew Melnik², Qing Cheng¹, Xuqin Wang¹, Lu Liu¹, Yongliang Wang¹, Yanfeng Zhang¹, Helge Ritter²

¹Huawei Technologies; ²Center for Cognitive Interaction Technology (CITEC), Bielefeld University

Methods that synthesize indoor 3D scenes from text prompts have wide-ranging applications in film production, interior design, video games, virtual reality, and synthetic data generation for training embodied agents. Existing approaches typically either train generative models from scratch or leverage vision-language models (VLMs). While VLMs achieve strong performance, particularly for complex or open-ended prompts, smaller task-specific models remain necessary for deployment on resource-constrained devices such as extended reality (XR) glasses or mobile phones. However, many generative approaches that train from scratch overlook the inherent graph structure of indoor scenes, which can limit scene coherence and realism. Conversely, methods that incorporate scene graphs either demand a user-provided semantic graph, which is generally inconvenient and restrictive, or rely on ground-truth relationship annotations, limiting their capacity to capture more varied object interactions. To address these challenges, we introduce GeoSceneGraph, a method that synthesizes 3D scenes from text prompts by leveraging the graph structure and geometric symmetries of 3D scenes, without relying on predefined relationship classes. Despite not using ground-truth relationships, GeoSceneGraph achieves performance comparable to methods that do. Our model is built on Equivariant Graph Neural Networks (EGNNs), but existing EGNN approaches are typically limited to low-dimensional conditioning and are not designed to handle complex modalities such as text. We propose a simple and effective strategy for conditioning EGNNs on text features, and we validate our design through ablation studies.

12:30pm - 12:45pm

A Multi-Dimensional Digital Twin Framework for the Low-Altitude Economy

Yan Li, Chenming Ye, Wenxuan Shi, Wenqing Zhang, Yuyang Zhang, Teng Hu, Zhizhong Kang

China University of Geosciences (Beijing), China, People's Republic of

The Low-Altitude Economy (LAE), driven by the widespread deployment of UAVs and eVTOL aircraft, demands a high-fidelity Digital Twin that extends far beyond static geographic representation. This study presents a critical review of 39 peer-reviewed papers to propose a three-layer mapping framework — Geospatial Infrastructure Layer, Environmental Sensing Layer, and Interaction Layer — and evaluates the Technology Readiness Level (TRL) of each sub-domain. The Geospatial Infrastructure Layer encompasses terrain models, ground facilities, airspace structures, and semantic navigation landmarks. The Environmental Sensing Layer covers electromagnetic modeling, target sensing and countermeasure, and micro-meteorological mapping. The Interaction Layer addresses network trust, data security, swarm coordination, and platform reliability. Our TRL assessment reveals that Environmental Sensing is the most mature layer (mean TRL 4.4, 8 field-validated papers), while cross-layer integration remains the weakest link (mean TRL 3.5, zero field-validated demonstrations). We identify standardization of low-altitude spatial data products, AI-enabled predictive mapping, crowdsourced Digital Twin updating, and closed-loop cross-layer integration as the four priority research directions.

12:45pm - 1:00pm

Lightweight indoor Pedestrian Localization via multi-step State-extended Fusion of Wi-Fi weighted Fingerprinting and PDR

Renjie Yuan, Zhiyong Wang, Kunlin Yu, Lang Hu, Yonghan Liao, Yilang Lin, Junjie Lu

South China University of Technology, China, People's Republic of

Indoor pedestrian localization on smartphones requires a lightweight yet robust fusion framework capable of handling unstable wireless signals and drift-prone inertial motion. In this work, we propose an efficient positioning system that integrates weighted Wi-Fi fingerprinting with Pedestrian Dead Reckoning (PDR) through a Multi-Step Extended Kalman Filter (EKF). Unlike conventional single-step filtering, the EKF employed here is formulated from the perspective of factor-graph–based probabilistic estimation, where Wi-Fi observations and PDR increments naturally act as complementary measurement and motion factors. Building on this unified view, the proposed Multi-Step EKF retains several consecutive states within its estimation window, effectively approximating short-horizon smoothing while maintaining the computational footprint required for real-time execution on consumer smartphones.

To enhance observation stability, an inverse-distance weighted fingerprinting module mitigates RSSI fluctuations and gracefully handles missing values. Meanwhile, PDR inputs are refined through diagnostic analysis and incorporated as motion constraints within the fusion process. Global optimization of noise parameters is performed via dual annealing, further improving the reliability of state updates.

Experiments conducted on an open indoor dataset demonstrate that the proposed method achieves a reduction of approximately 31% in mean localization error compared with a standard single-step EKF baseline. The results confirm that enforcing short-term temporal consistency through a multi-step state representation significantly suppresses drift accumulation and enhances robustness under dynamic indoor environments. Overall, the proposed framework offers a theoretically grounded, computationally efficient, and practically deployable solution for smartphone-based indoor positioning.

12:00pm - 1:15pm

ThS26: Earth Observation Foundation Models: Scalable, Multimodal AI for Environmental Intelligence
Location: 716A

12:00pm - 12:15pm

From Orthophotos to Insights: AI-Powered Forest Monitoring for Digital Forest Twin

Nina Krueger¹, Sandra Uhlig², Taimur Khan³

¹M.O.S.S. Computer Grafik Systeme GmbH, Germany; ²Landesamt für Geobasisinformation Sachsen (GeoSN); ³Helmholtz Center for Environmental Research (UFZ)

This project, a collaboration between the Landesamt für Geobasisinformation Sachsen (GeoSN) and M.O.S.S. Computer Grafik Systeme GmbH, pioneers the development of a Digital Twin Forest prototype for Saxony. The initiative leverages high-resolution aerial orthophotos (DOP) and advanced AI methods to generate detailed, current forest information. The core methodology centers on the “DeepTrees” workflow, a convolutional neural network (CNN)–based approach developed by the Helmholtz Center for Environmental Research (UFZ). This workflow processes DOP imagery at 10–20 cm resolution to segment individual tree crowns and extract key forest indicators, including crown area, crown radius, and tree density.

The process unfolds in three main stages: (1) preprocessing and model adaptation using transfer learning, (2) inference and postprocessing for accurate tree segmentation, and (3) integration into GeoSN’s data infrastructure. This integration utilizes OGC-compliant services and moGI-based data management, enabling automated processing, configuration, and visualization.

Results from the prototype confirm the feasibility of precise, large-scale tree crown segmentation from aerial imagery. The system also demonstrates the potential to derive temporal and structural forest information from recurring DOP datasets. These outputs can be directly incorporated into operational geospatial systems, supporting climate adaptation, forest management, and policy-making.

In conclusion, the Project showcases how explainable, interoperable AI workflows can strengthen national geodata infrastructures and serve as a model for future federated, AI-driven digital forest twins across Germany.

12:15pm - 12:30pm

Scalable Framework for Peatland Aboveground Biomass Mapping Using Multi-source Satellite Data and Machine Learning

Mohammadali Hemati¹, Masoud Mahdianpari^1,2, Hodjat Shiri³, Fariba Mohammadimanesh⁴

¹Department of Electrical and Computer Engineering, Faculty of Engineering and Applied Sciences, Memorial University of Newfoundland; ²C-CORE; ³Civil Engineering Department, Faculty of Engineering and Applied Sciences, Memorial University of Newfoundland; ⁴Canada Centre for Remote Sensing, Natural Resources Canada

This study presents a scalable framework for mapping aboveground biomass and moisture content in peatlands using intensive field sampling, multi-sensor satellite imagery, and advanced machine learning. Field data collected from diverse bog and fen sites in Western Newfoundland are integrated with Sentinel-1/-2 synthetic aperture radar and optical data, complemented by 3 m PlanetScope imagery for site-level detail. Ensemble learning models, particularly XGBoost, yield high biomass mapping accuracy, with regional maps revealing major biogeographical gradients and fine-scale site mosaics. Feature importance analysis highlights the role of red-edge and SAR bands in prediction. The results demonstrate that free satellite archives and machine learning can overcome limitations of costly airborne campaigns, supporting operational carbon monitoring and ecological management in northern peatlands. This approach establishes a foundation for wide-area wetland monitoring and future expansion using emerging remote sensing technologies.

12:30pm - 12:45pm

A self-supervised method for soil moisture estimation using multisensor data over forests

Rouhollah Esmaeilisarteshniz¹, Ramata Magagi¹, Samuel Foucher¹, Aaron Berg², Andreas Colliander³

¹Centre d'applications et de recherches en télédétection (CARTEL), Université de Sherbrooke, Québec, Canada; ²Department of Geography, Environment and Geomatics, University of Guelph, Ontario, Canada; ³Finnish Meteorological Institute, Helsinki 00560, Finland

Surface soil moisture (SM) plays a significant role in environmental and hydrological processes, particularly runoff and evapotranspiration. Within forest ecosystems, changes in SM can lead to significant ecological impacts, including paludification and greater susceptibility to forest fires. Microwave remote sensing facilitates large-scale monitoring of SM. Moreover, machine learning (ML) have demonstrated strong potential for capturing the nonlinear relationships between SM and satellite data. In general, supervised ML techniques achieve higher success rates when trained on larger ground measurements. However, obtaining extensive ground measurements of SM over vast areas such as forests is challenging, expensive, and time-consuming. To address this limitation, this study proposes a self-supervised method based on pre-task learning to estimate SM over forested areas using multisensor data. The core idea of the self-supervised approach is to leverage the knowledge gained during pre-task learning from multisensor data and transfer it to the SM estimation task, thereby improving the model’s generalization ability to SM estimation. The self-supervised learning method achieved an overall coefficient of determination (r²) of 0.74 and an RMSE of 0.04 m³/m³ on the testing dataset By focusing on each forest site, the model obtained r² = 0.75 with RMSE = 0.04 m³/m³ at Millbrook, r² = 0.63 with RMSE = 0.04 m³/m³ at Massachusetts, and r² = 0.74 with RMSE = 0.03 m³/m³ at Saskatchewan. The results highlight the potential of multisensor data for SM estimation in forested areas. Our method, which utilizes self-training on the input data, reduces dependence on ground SM measurements and enhances generalization capability.

12:45pm - 1:00pm

Zero-shot multi-class semantic segmentation of remote sensing images using SAM 2 with prior database information

Paula Lilian Lippmann, Mareike Dorozynski, Franz Rottensteiner, Christian Heipke

Institute of Photogrammetry and GeoInformation - Leibniz University Hannover, Germany

Land cover data need to be updated regularly. Typically, remote sensing images (RSI) play a central role in this process. A first step is RSI semantic segmentation. Today, this task is mainly solved by deep learning. Especially vision foundation models (VFM) have gained increasing importance in this context. Having been trained on large datasets, VFM for segmentation can yield good results on data from various domains without further training. We present a new method for using the VFM Segment Anything Model 2 (SAM 2) for multi-class semantic segmentation of Sentinel-2 images that does not require training data. Our method is based on a prompt engineering approach, using SAM 2 in its pre-trained form. The different prompt types are generated on the basis of existing topographic data. We also propose a post-processing step for merging the output of SAM 2 to obtain a multi-class label image. The results of our experiments show that our method achieves an overall accuracy (OA) of up to 93% at pixel-level using mask prompts. Experiments with other Sentinel-2 3-channel composite images do not show significantly different results compared to R-G-B images. Incorporating data from different time steps, intended to be used for map updating, shows good results. But the small amount of changed areas indicate limitations. In general, the proposed method is suitable for further

research into semantic segmentation tasks with little or no training data, as well as for the process of updating databases.

12:00pm - 1:15pm

ThS10: Resilient Localization, Mapping, and Perception in Adverse Conditions using Modern Civilian Radars
Location: 716B

12:00pm - 12:15pm

Radar-centric sensor fusion for robust indoor SLAM in complex environments

Binliang Wang, Jianzhu Huai, Yulong Han

Wuhan University, China, People's Republic of

This paper presents a radar-centric multi-sensor fusion framework, RLIO, designed for robust indoor SLAM in perceptually challenging environments such as underground garages and smoke-filled areas. Unlike conventional LiDAR-based methods that degrade under poor visibility, RLIO tightly integrates 4D imaging radar, 3D LiDAR, and an IMU within an iterated extended Kalman filter. The system introduces three key modules: a motion-prior-driven radar velocimetry algorithm for stable velocity estimation, a velocity-prior-enhanced scan-to-map registration for drift reduction in degenerate geometries, and an adaptive fusion strategy that dynamically adjusts sensor weights based on real-time degradation detection. Experimental results from both handheld and UGV platforms demonstrate that RLIO achieves accurate localization and high-quality mapping even when LiDAR performance deteriorates due to smoke or repetitive structures, highlighting its potential for reliable all-weather autonomous navigation and mapping in complex indoor and outdoor environments.

12:15pm - 12:30pm

RAMBA: 4D radar mapping by bundle adjustment

Jianzhu Huai, Yiwen Chen, Binliang Wang

Wuhan University, China, People's Republic of

4D radars have attracted increasing interest for robotic perception because they remain effective in adverse conditions such as darkness, dust, smoke, rain, and fog. Compared with conventional automotive radars that mainly provide planar coordinates and relative Doppler velocity, modern 4D radars also sense elevation, which makes them more suitable for geometric odometry and mapping.

In this paper, we propose RAMBA, an offline 4D radar mapping framework based on bundle adjustment. Starting from initial poses and radar frames produced by a radar--inertial odometry front-end, we refine the radar frame states to improve global mapping consistency, measured by covariance-weighted point-to-point distances. In essence, our method extends pairwise generalized iterative closest point (GICP) to the multi-frame setting. Candidate correspondences are formed within voxels of a voxel grid built from all selected frames, and each residual is weighted by the sum of the two point covariances. The geometric constraints are jointly optimized with IMU preintegration and radar ego-velocity constraints.

To reduce false associations caused by drift and revisits, RAMBA enforces temporal consistency when forming correspondences and explicitly allows constraints around loop closures. We evaluate the method on the ColoRadar and SNAIL Radar datasets. The proposed refinement consistently improves map quality and usually improves trajectory accuracy over the initial radar--inertial odometry and pose graph optimization. To the best of our knowledge, this is the first geometric offline bundle-adjustment framework for consistent 4D radar mapping.

12:30pm - 12:45pm

Deep point matching for 4D radar odometry

Yi Liu¹, Jianzhu Huai^1,2, Lin Zhao¹

¹Dept. of Electrical and Computer Engineering, National University of Singapore; ²State Key Lab of Info Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 129 Luoyu Road, Hongshan District, Wuhan, Hubei

Four-dimensional (4D) imaging radar offers robustness in adverse weather and lighting, but its point clouds remain sparse, noisy, and affected by ghost reflections, making geometric scan matching unstable. This work integrates two existing deep correspondence models—Radar Transformer and RPM-Net—into a radar–inertial odometry pipeline without retraining. Both networks run asynchronously in dedicated ROS nodes: radar and submap point clouds are cropped, transformed, and sent to the matchers, which return either hard or soft correspondences for the first IEKF iteration of each frame. When neural outputs are delayed, the system automatically falls back to geometric matching.

Returned matches are fused with a voxelized IEKF backend that computes Mahalanobis-weighted residuals. RPM-Net further supplies soft targets and confidence weights, enhancing point-to-point constraints. Experiments on ColoRadar indoor and outdoor sequences show that learning-based correspondences can reduce drift in weakly structured scenes while maintaining robustness when geometry is reliable.

12:45pm - 1:00pm

LiDAR–Radar–IMU fusion for multi-robot SLAM in adverse environments

Yiwen Chen, Jianzhu Huai

Wuhan University, State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing

This paper presents MSMR SLAM, a multi sensor and multi robot SLAM framework that integrates LiDAR, 4D radar, and IMU to achieve robust localization and consistent mapping in large scale and degraded environments. The front end includes radar inertial odometry and LiDAR inertial odometry, and evaluates the reliability of LiDAR observations using point cloud sparsity and effective range. An adaptive fusion module combines the two odometry estimates to maintain stable state estimation, while radar assisted dynamic point removal improves the reliability of geometric constraints. The back end constructs a unified factor graph that incorporates multi sensor odometry constraints, loop closure factors, and inter robot association factors. A LiDAR centered and radar assisted matching strategy enhances cross robot data association, and radar based loop closures improve global consistency when LiDAR measurements degrade. The system maintains a dense LiDAR map together with a complementary radar map, enabling hybrid mapping that remains reliable in perceptually challenging regions. Experiments on campus datasets, including smoke filled scenarios, demonstrate that MSMR SLAM achieves high precision multi robot localization and globally consistent mapping. Compared with single robot baselines, the proposed framework provides improved accuracy and robustness, and the integration of LiDAR and radar yields more complete and stable map reconstruction in complex environments.

1:00pm - 1:15pm

Heuristic-Guided Extrinsic Calibration for 4D Radar-Camera Systems Using Dynamic Objects

Hang Zhen, Ronghe Jin, Zhi Gao

Wuhan University, China, People's Republic of

1. We introduce a novel framework for 4D radar-camera extrinsic calibration that utilizes commonly available dynamic objects as natural correspondences.

2. We develop a heuristic-guided strategy to reliably associate radar points with image detections and estimate the extrinsic parameters without deep learning.

12:00pm - 1:15pm

ThS16: Earth Embeddings: Investigating Accurate and Accessible Deep Geospatial Feature Representations
Location: 717A

12:00pm - 12:15pm

Beyond AlphaEarth: Toward Human-Centred Spatial Representation via POI-Guided Contrastive Learning

Junyuan Liu¹, Quan Qin^1,2, Guangsheng Dong^1,2, Xinglei Wang¹, Jiazhuang Feng¹, Zichao Zeng¹, Tao Cheng¹

¹University College London, United Kingdom; ²Wuhan University, China

General-purpose spatial representations are essential for building transferable geospatial foundation models (GFMs). Among them, the AlphaEarth Foundation (AE) represents a major step toward a global, unified representation of the Earth's surface, learning 10-meter embeddings from multi-source Earth Observation (EO) data that capture rich physical and environmental patterns across diverse landscapes. However, such EO-driven representations remain limited in capturing the functional and socioeconomic dimensions of cities, as they primarily encode physical and spectral patterns rather than human activities or spatial functions. We propose AETHER(AlphaEarth–POI Enriched Representation Learning), a lightweight framework that adapts AlphaEarth to human-centered urban analysis through multimodal alignment guided by Points of Interest (POIs). AETHER aligns AE embeddings with textual representations of POIs, enriching physically grounded EO features with semantic cues about urban functions and socioeconomic contexts. In Greater London, AETHER achieves consistent gains over the AE baseline, with a 7.2% relative improvement in land-use classification F1 and a 23.6% relative reduction in Kullback–Leibler divergence for socioeconomic mapping. Built upon pretrained AE, AETHER leverages a lightweight multimodal alignment to enrich it with human-centered semantics while remaining computationally efficient and scalable for urban applications. By coupling EO with human-centered semantics, it advances geospatial foundation models toward general-purpose urban representations that integrate both physical form and functional meaning.

12:15pm - 12:30pm

Bridging Earth's surface and atmosphere with Copernicus embeddings

Yi Wang¹, Zhitong Xiong¹, Chenying Liu^1,2, Adam Stewart^1,2, Thomas Dujardin¹, Nikolaos Ioannis Bountos^3,4, Angelos Zavras^3,4, Franziska Gerken⁵, Ioannis Papoutsis³, Laura Leal-Taixé⁵, Xiao Xiang Zhu^1,2

¹Technical University of Munich, Germany; ²Munich Center for Machine Learning, Germany; ³National Technical University of Athens & National Observatory of Athens, Greece; ⁴Harokopio University of Athens, Greece; ⁵NVIDIA

This work demonstrates the potential of foundation-model-encoded satellite embeddings to bridge Earth's surface and atmosphere. Based on our multimodal foundation model Copernicus-FM, we curate a global embedding dataset at 0.25°x0.25° resolutions (in consistency with ERA5). For each grid, multi-sensor images from Sentinel-1, 2, 3, 5P, and DEM are encoded into an embedding vector. These grid embeddings serve as condensed surface representations for downstream users. We verify their benefits as input for a climate task that predicts the 10-year mean and standard deviation of several climate parameters (e.g., precipitation) from ERA5. Compared to raw coordinates or location encodings, our results suggest that introducing surface embeddings helps produce more accurate prediction maps, reducing RMSEs by an average of up to 45%.

12:30pm - 12:45pm

DESPINA: Synthesis of High-Fidelity Planetary Horizon Reconstructions Using DEM-Guided Diffusion

Adam Nelson-Archer, Raunak Sarbajna, Christoph F. Eick

University of Houston, United States of America

Ground-level horizon imagery is scarce across planetary bodies, making representation-centred approaches attractive for downstream geospatial tasks. We present DESPINA, a geospatial representation system that converts digital elevation models (DEMs) into structured neural embeddings of terrain geometry that condition a diffusion model to produce geometry-preserving, terrain-consistent visual reconstructions for a specified location and view direction.

Our pipeline integrates numeric elevation data (DEMs), structural embeddings (inverse-depth and soft edges), and textual priors, unifying heterogeneous geospatial signals into a shared, metric conditioning space. Using a Stable Diffusion model constrained with ControlNet, we can generate geologically consistent yet texturally diverse horizon datasets. Appearance priors are learned from historical surface photography to capture realistic textures and lighting cues, and geometric validation is performed against DEM-derived skylines and depth structure, independent of photographic training data. Through quantitative evaluation and a pilot qualitative study, DESPINA maintains skyline fidelity and geological boundaries while improving structural similarity relative to an image-conditioned baseline. Although our experiments use lunar DEMs and historical surface photography, the method is domain-agnostic and applicable to Earth, Mars, and other planetary DEMs.

12:45pm - 1:00pm

Towards improved crop type classification: a compact embedding approach suitable for small fields

Madeline Lisaius¹, Andrew Blake³, Clement Atzberger², Srinivasan Keshav¹

¹Department of Computer Science and Technology, The University of Cambridge, United Kingdom; ²dClimate Labs, New York; ³Clare College, The University of Cambridge, United Kingdom

Satellite -based crop classification and maps are important tools for food security and climate change mitigation, but existing approaches are not effective for small field systems. To address this, crop type classification using embeddings generated by a global foundation model, TESSERA, are compared to standard classification approaches in the literature. We find that our embedding -based approach offers a triple win: 1) consistent and statistically significant performance improvement over current methods, 2) greater simplicity due to the elimination of feature engineering, and 3) the reduction of computational cost. Our embedding -based approach achieves significantly higher F1 scores in the classification of 5 of 7 crop types for small fields in Austria (over 10% improvement in one case). Additionally, the TESSERA embedding -based method uses 8% of compute compared to the raw data method. These results indicate that embeddings are an effective approach for crop type classification tasks in small field systems.

1:00pm - 1:15pm

Utilising embeddings for maps of winter wheat and crop rotation in Henan China during 2018-2024

Ziwei Du¹, Keyi Rao¹, Chang Wu¹, Kexin Wang¹, Shixin Dong¹, Zhaocong Wu^1,2

¹School of Remote Sensing and Information Engineering, Wuhan University, China; ²Aerospace Information Research Institute, Henan Academy of Sciences, Henan 450046, China.

This study explores the potential of the AlphaEarth Foundation (AEF) embeddings, a global, annual, analysis-ready satellite embedding dataset, for winter wheat and crop rotation mapping. Firstly, we analyze AEF embeddings for intra-class consistency and inter-class separability, assessing their effectiveness in representing wheat within the semantic embedding space. Subsequently, we compare multiple lightweight classifiers to identify an optimal model and conduct spatiotemporal generalization experiments across Henan Province from 2018 to 2024 using only a limited set of labelled samples from 2020. Based on the resulting wheat distribution maps, crop rotation patterns are further identified.Experimental results demonstrate that AEF embeddings exhibit strong semantic coherence and discriminative capability. Acceptable classification accuracy (OA = 0.85) can already be achieved using simple models such as cosine similarity and linear regression. More advanced lightweight classifiers further improve the performance (OA = 0.86–0.93) while maintaining stable results across different years and regions (spatial consistency = 0.82). In addition, the crop rotation maps show high spatial agreement with existing products, while producing more spatially contiguous field patterns.Overall, this study confirms that AEF embeddings can serve as effective, ready-to-use features for large-scale agricultural remote sensing applications. By substantially reducing the reliance on complex feature engineering and extensive training samples, they provide a practical and scalable solution for mapping winter wheat and its crop rotation patterns.

1:30pm - 2:45pm

WG II/4A: AI/ML for Geospatial Data
Location: 713A

1:30pm - 1:45pm

Target Vessel Identification in Aerial Search Imagery via MLLM-Based Attribute Extraction and Geolocation Fusion

Jeonghyo Oh, Youngon Oh, Impyeong Lee

Dept. of Geoinformatics, University of Seoul, Seoul, Republic of Korea

Identifying a distressed vessel among many ships detected in wide-area aerial imagery is a critical challenge in maritime Search and Rescue (SAR) operations. Conventional methods cannot determine which vessel matches the incident description, especially when Automatic Identification System (AIS) reports are uncertain. This study proposes an integrated framework that combines MLLM-based semantic attribute extraction with geolocation fusion to prioritize candidate vessels according to their consistency with Situation Report (SITREP) based scenarios. The method detects vessels using YOLOv8, tracks them with Deep Simple Online and Real-time Tracking (DeepSORT), and performs image-based georeferencing using onboard metadata. A Multi-modal Large Language Model (MLLM) extracts appearance/status attributes from representative vessel images, while scenario descriptions are also converted to attributes. Both sets are encoded using MiniLM embeddings. Finally, semantic similarity is fused with geolocation proximity within an Support Vector Machine (SVM) classifier to produce a probability-ranked list of candidates. Experiments using real aerial search footage demonstrate robust identification performance across a range of scenario quality levels. The correct vessel appears within the top three candidates in more than 73\% of cases and within the top five in more than 91\%, even when attribute extraction is affected by low resolution, illumination effects, or missing scenario information. These results show that coarse semantic cues, when combined with approximate geolocation, provide a resilient basis for identifying target vessels under high uncertainty. The proposed framework offers a practical foundation for automated SAR decision support, enabling faster and more reliable prioritization during wide-area maritime search operations.

1:45pm - 2:00pm

Zero-shot Vision-Language Reranking for Cross-View Geolocalization

Yunus Talha Erzurumlu¹, John E. Anderson², William J. Shuart², Charles Toth³, Alper Yilmaz³

¹Dept. of Electrical and Computer Engineering, The Ohio State University, United States of America; ²US Army Engineer Research and Development Center, Geospatial Research Laboratory, United States of America; ³Dept. of Civil Engineering, The Ohio State University, United States of America

Cross-view geolocalization (CVGL) systems, while effective at retrieving a list of relevant candidates (high Recall@k), often fail to identify the single best match (low Top-1 accuracy). This work investigates the use of zero-shot Vision-Language Models (VLMs) as rerankers to address this gap. We propose a two-stage framework: state-of-the-art (SOTA) retrieval followed by VLM reranking. We systematically compare two strategies: (1) Pointwise (scoring candidates individually) and (2) Pairwise (comparing candidates relatively). Experiments on the VIGOR dataset show a clear divergence: all pointwise methods cause a catastrophic drop in performance. In contrast, a pairwise comparison strategy using LLaVA improves Top-1 accuracy over the strong retrieval baseline. Our analysis concludes that VLMs are poorly calibrated for absolute relevance scoring but are effective at fine-grained relative visual judgment, making pairwise reranking a promising direction for enhancing CVGL precision.

2:00pm - 2:15pm

From Pixels to Polygons: Evaluating Vision Foundation Models for High-Resolution Orthophoto Segmentation and Vectorization

Islam Mansour, Michael Schmitt

University of the Bundeswehr Munich, Germany

Accurate and topologically consistent vector data extraction from aerial imagery is essential for geospatial applications such as urban mapping and the enrichment of volunteered geographic information (VGI) platforms like OpenStreetMap (OSM). While deep learning has advanced automated feature extraction, traditional supervised networks remain constrained by large annotation requirements and limited generalization. Recent vision foundation models offer a promising alternative through promptable, zero-shot segmentation capabilities.

This study presents a modular “pixels-to-polygons” workflow for transforming high-resolution orthophotos (20 cm DOP20) into GIS-ready vector data, comprising four stages: data preparation, segmentation, vectorization, and validation. The framework is model-agnostic and designed to integrate different vision foundation models without modification to downstream processing. As a representative instantiation, we evaluate the Segment Anything Model 2 (SAM2) in automatic mask generation mode over the University of the Bundeswehr Munich campus.

The resulting segmentation masks are polygonized and compared against OSM reference layers. Preliminary results indicate that vision foundation models can effectively delineate major built-up areas, vegetation, and transport infrastructure without task-specific training, producing geometrically smooth and topologically valid polygons. However, smaller or shadowed objects remain challenging, often leading to partial merging or fragmentation.

Future work will include quantitative evaluation using raster-based Intersection-over-Union and vector-domain metrics such as completeness, correctness, and geometric fidelity. The study provides an initial assessment of vision foundation models for high-resolution remote sensing and highlights their potential to bridge the gap between general-purpose segmentation models and GIS-compatible vector data generation.

2:15pm - 2:30pm

Polarization-Aware Segmentation for Camouflaged Threat Detection from UAVs

Youssef Korny, Sunghwan Yoo, Gunho Sohn

Department of Earth and Space Science and Engineering, York University

Surface-laid unexploded ordnance (UXO) and landmines constitute a critical humanitarian crisis. While unmanned aerial vehicles (UAVs) provide a scalable remote sensing solution, detecting modern, non-metallic explosive devices in cluttered environments remains a profound Camouflaged Object Detection (COD) challenge. Traditional optical sensors frequently suffer from foreground-background confusion when a target's texture mimics its surroundings. To overcome these physical bottlenecks, we introduce XPol-Net, a novel multimodal architecture synergizing the semantic reasoning of Vision Transformers with the deterministic physics of polarimetric imaging. Built on a hierarchical PVTv2 backbone, XPol-Net utilizes a progressive Dual Cross-Attention Strategy for effective modality fusion. In early stages, Channel Cross-Attention (CCA) filters material-specific Degree of Linear Polarization (DoLP) cues to suppress background clutter. In deeper stages, Spatial Cross-Attention (SCA) dynamically aligns high-level RGB semantics with strict structural boundaries. To enhance robustness and prevent modality collapse, we deploy a multi-task auxiliary learning framework that reconstructs the continuous Angle of Linear Polarization (AoLP) map. On the PCOD benchmark, XPol-Net achieves state-of-the-art results in global structural alignment (E_phi of 0.980 and 0.984 at 352 x 352 and 704 x 704, respectively). While minor trade-offs are observed in localized metrics such as S_alpha or F_beta, XPol-Net remains highly competitive, consistently delivering superior results in E_phi and MAE. By prioritizing structural recall over localized strictness, XPol-Net ensures the complete discovery of concealed targets, establishing a reliable, physics-aware foundation for humanitarian demining operations.

1:30pm - 2:45pm

WG II/9A: Vision Metrology
Location: 713B

1:30pm - 1:45pm

A Novel Camera-to-Robot Calibration Method for Vision-Based Floor Measurements

Jan Andre Rudolph, Dennis Haitz, Markus Ulrich

Karlsruhe Institut für Technologie, Germany

A novel hand–eye calibration method for ground-observing mobile robots is proposed. While cameras on mobile robots are common, they are rarely used for ground-observing measurement tasks. Laser trackers are increasingly used in robotics for precise localization. A referencing plate is designed to combine the two measurement modalities of laser-tracker 3D metrology and camera-based 2D imaging. It incorporates reflector nests for pose acquisition using a laser tracker and a camera calibration target that is observed by the robot-mounted camera. The procedure comprises estimating the plate pose, the plate–-camera pose, and the robot pose, followed by computing the robot–-camera transformation. Experiments indicate sub-millimeter repeatability.

1:45pm - 2:00pm

Photogrammetric Monitoring of load-induced vertical Deformations in the Superstructure of a research Bridge

Robert Blaskow, Hannes Sardemann, Hans-Gerd Maas

Dresden University of Technology, Germany

Early detection of structural issues is crucial for timely maintenance and extending bridge lifespans. Conventional visual inspections alone are not sufficient for the vast number of structures, highlighting the need for intelligent, real-time monitoring systems. The studies presented here were conducted as part of the DFG priority programme SPP100+ in collaboration with the IDA-KI project. In addition to conventional civil engineering sensors, the 45-meter-long openLAB research bridge, a three-span prestressed concrete structure, was equipped with target fields for photogrammetric measurements. During controlled load tests, in which a motorized load vehicle passes over the bridge, a high-resolution camera captures image sequences of the measurement fields. The photogrammetric workflow involved camera calibration, image sequence acquisition, and precise 3D coordinate determination using coded targets. Displacement values in the range of several millimeters were calculated frame-by-frame. Results from a typical load cycle showed initial upward deflection followed by downward movement, corresponding to the load vehicle’s passage. This approach demonstrates the potential of photogrammetry for accurate, non-contact deformation monitoring, supporting the development of digital twins and advanced structural health monitoring systems for bridges.

2:00pm - 2:15pm

Assessing the effects of time on cadaveric facial anatomy using conventional photogrammetry, stereophotogrammetry and computed tomography

Morgan Titmus¹, Hannah Radley¹, Paul Ellery¹, Gary Whittaker¹, Zhonghua Sun¹, Petra Helmholz²

¹Curtin Medical School, Curtin University, Australia; ²School of Earth and Planetary Sciences, Curtin University, Australia

Body donation remains critically important for anatomical science, allowing examination of biological structures with three-dimensional (3D) context. However, body donors (cadavers) are a time-limited resource and the scarcity of body donors has prompted an interest in digital body preservation. Multiple imaging techniques (e.g., conventional photogrammetry[CPG], stereophotogrammetry[SPG] and computed tomography[3DCT]) can capture the 3D characteristics of a specimen indefinitely. Digital anatomical records provide an opportunity to measure anatomical structures in the absence of the physical specimen. In 2022, the face of a preserved body donor was digitally reconstructed using CPG and 3DCT. 28 months later, a repeat survey was performed using SPG and a series of facial landmarks were directly measured. The accuracy and stability of facial soft-tissues over time were measured using point-to-point and cloud-to-mesh techniques. The results show that anatomical models produced by 3DCT and CPG produce similar facial measurements to those acquired by SPG and direct measurement at later timepoints. These data indicate that chemical fixation adequately stabilises facial anatomy over time, each sensor can be used interchangeably for facial measurement and models can be co-registered with minimal discrepancy.

2:15pm - 2:30pm

Investigating calibration constraints for the processing of a narrow-view multi-camera system

Petra Helmholz¹, Derek Lichti², Liam Boyle¹, Richard Palmer^1,3, Roslyn Ward³

¹Spatial Sciences, School of Earth and Planetary Sciences, Curtin University, Kent St, Bentley, Australia; ²Department of Geomatics Engineering, University of Calgary, Calgary, AB T2N 1N4, Canada; ³School of School of Allied Health, Curtin University, Kent St, Bentley, Australia

Speech is a highly complex and multidimensional process, requiring precise coordination of muscular actions within the vocal tract. Disruptions or delays in speech motor control often lead to speech impairments. Recent advancements in markerless facial tracking technology enable the collection of objective measurements to assess these impairments. To obtain such photogrammetric measurements, a multi-camera network is employed, making accurate camera calibration essential. This paper examines the constraints applied during the calibration process. Two adjustment strategies were evaluated. The first, Independent Adjustment (IDP), performs self-calibration for each camera without introducing constraints. The second, Combined Adjustment (CMB), incorporates object space constraints by ensuring that object point locations observed from all cameras remain consistent. Given the cameras’ narrow fields of view, both IDP and CMB were tested with additional constraints related to the principal point offset. Each adjustment was executed under two conditions: fixing the principal point offset to zero or estimating it as part of the calibration. Results indicate that the choice of adjustment significantly affects the interior orientation parameters (IOPs). IDP with the principal point offset fixed to zero produced the most accurate outcomes. However, variations in IOPs had no meaningful impact on object space coordinates. These findings suggest that the simplest approach—IDP with the principal point offset fixed to zero—offers reliable calibration for multi-camera systems used in speech assessment. This streamlined method can be adopted in future applications to enhance efficiency without compromising accuracy.

1:30pm - 2:45pm

WG II/8B: Environmental & Infrastructure Monitoring
Location: 714A

1:30pm - 1:45pm

Reliability-qualified Nighttime Lights for Disaster Impact and Recovery in cloud-impacted tropical Regions

Rene Jr Principe, Karin Reinke, Simon Jones

RMIT University, Australia

Daily satellite-derived nighttime lights (NTL) are increasingly used to monitor electricity disruption and recovery, but their reliability in tropical regions is constrained by persistent cloud cover and intermittent observation. This study adopts a diagnostic-first framework that treats observability as a prerequisite for interpretation, determining when daily NTL signals are sufficiently supported to reflect underlying grid dynamics. Using NASA’s VIIRS Black Marble product, we quantify spatial and temporal completeness and radiance stability across the Samar–Leyte sub-grid in the Philippines. Results show that observations are highly intermittent and spatially heterogeneous, with urban areas providing more stable and interpretable signals, while rural regions remain noise-dominated.

To assess whether reliability-qualified NTL reflects electricity demand, DNB-BRDF radiance is aligned with hourly load data from the National Grid Corporation of the Philippines (NGCP) using settlement-based masks derived from GHSL SMOD. Alignment is evaluated using correlation, error, and retained coverage, combined into a composite score. Strongest agreement occurs under low to mid-range valid-pixel thresholds and within urban-focused masks, which balance signal fidelity and temporal continuity at the cost of reduced coverage. Replication across four additional Visayas sub-grids shows that optimal threshold–mask configurations vary by region, reflecting differences in cloud regime and settlement structure.

These results establish explicit conditions under which daily NTL can be interpreted as a proxy for grid dynamics. The framework provides a reproducible basis for reliability-qualified analysis using globally available datasets and can be tested in other cloud-prone regions where ground-based data are limited.

1:45pm - 2:00pm

Drone-based photogrammetry for pavement deterioration detection and quantification in airport infrastructure

Guido Staub, Elisabet Jara

University of Concepción, Chile

The maintenance of airport pavements is critical to ensuring the safety and efficiency of air operations. Conventional inspection methods are often time-consuming, subjective, and prone to inconsistencies in data collection. Recent advances in unmanned aerial vehicle (UAV) photogrammetry offer a potential alternative for improving inspection efficiency and measurement accuracy.

This study evaluates the applicability of UAV-based photogrammetry for the detection and quantification of pavement distresses under conditions representative of airport infrastructure. Image data were acquired at different flight altitudes and overlap configurations and processed using Structure-from-Motion techniques to generate high-resolution orthomosaics and Digital Elevation Models (DEMs). The resulting datasets were analyzed to identify, delineate, and classify deterioration types and severity levels.

The results indicate that a flight altitude of 10 m combined with 80% longitudinal and 70% transversal overlap provides an optimal balance between spatial resolution and operational efficiency. Under unobstructed conditions, photogrammetric analysis detected more than 98% of existing distresses and enabled more precise geometric delineation compared to traditional field-based methods. Undetected distresses were primarily associated with shadowed or obstructed areas, highlighting the influence of environmental conditions on detection performance.

Overall, the findings demonstrate that UAV-based photogrammetry is a reliable and efficient approach for pavement condition assessment, with significant potential to enhance data quality and reduce inspection time in airport infrastructure management.

2:00pm - 2:15pm

MultiChange3D: A Multi-Scene, Multi-Sensor Dataset for Benchmarking 3D Geometric Change Detection

Zhaoyi Wang¹, Paweł Trybała², Andreas Wieser¹, Fabio Remondino²

¹Geosensors and Engineering Geodesy (GSEG), ETH Zurich, Zurich, Switzerland; ²3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy

3D change detection is essential for monitoring infrastructure, environmental dynamics, and natural hazards. However, existing algorithms are often evaluated on single-scene datasets, and their generalization across varied real-world scenes remains largely unexplored due to the absence of a universal benchmark. To address this issue, we propose MultiChange3D, a multi-scene, multi-sensor 3D change detection dataset for identifying geometric changes in 3D space. The dataset provides registered pairs of point clouds with ground-truth geometric change labels, enabling standardized evaluation across different methods. To demonstrate the use of the MultiChange3D dataset, we benchmark an initial set of approaches on a subset of the dataset. The evaluated methods include classical Euclidean distance-based methods (C2C, M3C2), 3D displacement estimation-based approaches (F2S3, Landslide-3D), and deep learning-based classification methods (KPConv, EF-KPConv, PGN3DCD). Quantitative and qualitative analyses indicate the strengths and limitations of the evaluated methods, highlighting the challenges in cross-scene generalization under variations in point density, scale, and types of changes. The full dataset and evaluation code is openly available at: https://github.com/3DOM-FBK/multichange3d.

2:15pm - 2:30pm

Time-Adaptive Change Analysis through Extension of the M3C2 Algorithm using Multi-Modal Laser Scanning Data in a Salt Marsh Environment

Xiaoyu Huang¹, Mathilde Letard¹, Dimitri Lague^2,3, Bernhard Höfle^4,5, Ronald Tabernig^4,5, Paul Leroy², Katharina Anders¹

¹Remote Sensing Applications, TUM School of Engineering and Design, Technical University of Munich, Ottobrunn, Germany; ²Univ Rennes, Plateforme LiDAR, OSERen, UAR 3343 CNRS, France; ³Univ Rennes, Géosciences Rennes, UMR 6118 CNRS, France; ⁴3DGeo Research Group, Institute of Geography, Heidelberg University, Heidelberg, Germany; ⁵Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Heidelberg, Germany

Quantifying topographic dynamics from 3D point cloud time series is essential for geoscientific applications. However, laser scanning data typically varies between epochs in point density due to differing survey properties. These irregularities present a challenge for change detection, particularly across multi-temporal and multi-modal data. We propose a new approach, adaptive temporal aggregation, as an extension of the Multiscale Model to Model Cloud Comparison (M3C2) algorithm. Driven by a local point density requirement, our method employs both a spatial and a temporal neighborhood. If a core point's neighborhood is too sparse for M3C2 estimation, an iterative temporal search progressively incorporates data from temporally adjacent epochs until the density requirement is met or a maximum temporal window is reached. This adaptive process ensures sufficient local density while preventing unnecessary temporal aggregation, a key advantage over global aggregation. We evaluated our method on a multi-modal dataset from the Mont-Saint-Michel Bay, France (38 irregular epochs, ~1 decade). Results demonstrate significantly improved change detection, increasing completeness by >13% (vs. standard M3C2) and accuracy by 31% (vs. fixed-window averaging). Our work provides a robust approach for enhancing 3D change detection algorithms for complex, real-world 4D datasets, enabling higher accuracy and completeness in analysing surface dynamics.

1:30pm - 2:45pm

ThS22: Earth Observation for Crop Health and Resilient Food Systems
Location: 714B

1:30pm - 1:45pm

Evaluation of Sentinel-2 and EnMAP for crop classification across Canadian agricultural landscapes

Emily Lindsay¹, Catherine Champagne², Oliver McMillan³

¹Agriculture and Agri-Food Canada, Canada; ²Agriculture and Agri-Food Canada, Canada; ³Agriculture and Agri-Food Canada, Canada; Carleton University, Canada

This study evaluates the classification performance of Sentinel-2 multispectral and EnMAP hyperspectral imagery across three Canadian agricultural sites. Using Random Forest with recursive feature elimination, we assess whether EnMAP’s spectral richness provides measurable improvements for operational crop mapping. Results show comparable overall accuracies, with EnMAP offering advantages for spectrally complex crop types. Findings highlight the potential and practical limitations of incorporating satellite hyperspectral data into national agricultural monitoring workflows.

1:45pm - 2:00pm

Loss of Agricultural Land in Slovakia: Evidence from LPIS and Sentinel-2 Data

Šimon Opravil¹, Dobromil Galvánek², Miloš Rusnák¹, Monika Kopecká¹, Tomáš Goga¹

¹Institute of Geography, Slovak Academy of Sciences, Slovak Republic; ²Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Slovak Republic

Agricultural land in Slovakia has undergone a significant transformation over the past two decades, but the extent and direction of this change remain insufficiently quantified at the national level. This study provides the first spatially explicit assessment of agricultural land loss using Land Parcel Identification System (LPIS) records (2004–2022) in combination with Sentinel-2-based land cover classification. By differentiating the LPIS, we identified polygons that were lost from the agricultural land register; these polygons could represent abandoned agricultural land as well as areas converted to other uses. These polygons were classified into four land cover classes using a Random Forest model trained on Sentinel-2 spectro-temporal metrics and cleaned LUCAS 2022 samples (F1 = 0.867), with additional filtering applied to separate grasslands from shrubs based on vegetation height. The results show that more than 1,000 km² of originally suitable agricultural land has been converted to other land cover types. Forest expansion accounts for 834 km², while 298 km² has been converted to shrubland and 553 km² remains as grassland. Non-forested areas, including buildings and infrastructure, cover an area of 258 km². Only 17 km² of formerly agricultural land remained as actively utilised arable land. These findings indicate that agricultural land abandonment, ecological succession, and urbanisation are the primary causes of agricultural land loss in Slovakia. The research presented provides important data confirming that the loss of agricultural land is extensive and largely threatens habitats with high biodiversity, highlighting the urgent need to harmonise strategies across agriculture, the environment, and land-use planning.

2:00pm - 2:15pm

A Hierarchical Robust Combined Index for Agricultural Drought Detection and Monitoring Using Earth Observation Big Data: Application to a Case Study in Southern Italy

Filippo Bocchino¹, Giulia Graldi¹, Camillo Zaccarini², Deodato Tapete³, Alessandro Ursi³, Maria Virelli³, Patrizia Sacco³, Valeria Belloni¹, Roberta Ravanelli⁴, Mattia Crespi¹

¹Geodesy and Geomatics Division, Department of Civil, Building and Environmental Engineering (DICEA), Sapienza University of Rome, Rome, Italy; ²Risk Management Department, Institute of Services for Agricultural and Food Market (ISMEA); ³Italian Space Agency (ASI), Rome, Italy; ⁴Geomatics Unit, Department of Geography, University of Liège, Liège, Belgium

Agricultural drought affects crop productivity and threatens food security. This study presents the Hierarchical Robust Combined Drought Index (HRCDI) for operational agricultural drought monitoring, based on Earth Observation (EO) data freely available in Google Earth Engine. The HRCDI integrates four complementary indicators—Standardized Precipitation Evapotranspiration Index (SPEI3), Soil Moisture (SM), Land Surface Temperature (LST), and Normalized Difference Vegetation Index (NDVI)—using a hierarchical fuzzy logic framework that reflects the progression of drought impacts from climatic anomalies to vegetation stress. To ensure robustness, monthly anomalies of SM, LST, and NDVI were computed through a robust z-score formulation based on median and NMAD, which reduces the influence of outliers. The HRCDI was applied to the Province of Foggia (southern Italy), one of the main durum wheat production areas in Italy, over the period 2017–2022. HRCDI outputs were aggregated at the municipality scale and validated against independent datasets, including durum wheat yield statistics (2006–2022), SPEI3 provided by the Institute of Services for Agricultural and Food Market (ISMEA), and reports from the European Drought Impact Database. The HRCDI effectively captured the severity and spatial extent of major drought events, particularly in 2017 and 2022, which corresponded to documented yield losses of −5% and −22%, respectively. Results highlight the scalability, operational relevance, and transferability of the HRCDI for supporting drought early warning and agricultural risk management. The HRCDI framework could be applied to other regions and integrated with higher-resolution satellite data to enhance drought monitoring in line with the objectives of the Common Agricultural Policy.

2:15pm - 2:30pm

Remote Sensing of Maize Physiological and Nutrient Dynamics in Response to Fall Armyworm Infestation

Tatenda Dzurume^1,2, Roshanak Darvishzadeh¹, Timothy Dube², Wafaa Rushdi Saleh Isleem³, Ismail Ibrahim⁴, Andrew Nelson¹

¹Department of Natural Resources, Faculty of Geo-Information Science and Earth Observation, University of Twente, Enschede, the Netherlands; ²Institute for Water Studies, University of the Western Cape, Bellville, South Africa; ³Department of Plant Protection, Ministry of Agriculture, Amman, Jordan; ⁴Department of Plant Protection, School of Agriculture, The University of Jordan, Amman, Jordan

Fall Armyworm (FAW) is a major pest, threatening maize production and food security. FAW preferentially attacks nitrogen-rich maize due to its nutritional composition, which accelerates larval development. Understanding the effects of FAW infestation on maize growth and nutrient status is critical for effective crop management. This study (i) examines the impact of FAW infestation on maize using key physiological metrics, including fresh/dry weight, chlorophyll content (as measured by SPAD), leaf area index (LAI), stem length, and leaf dimensions (length, width, and area); (ii) investigates differences in carbon, hydrogen, nitrogen, and sulphur concentrations between healthy and infested maize, and (iii) analyses spectral variability between healthy and infested maize using leaf-level hyperspectral data. Hyperspectral data at the leaf scale were used to identify and distinguish the spectral reflectance between healthy and infested FAW maize crops. Results indicate that infested crops exhibit lower fresh weight, reduced LAI, shorter stems, and smaller leaves compared to healthy crops, highlighting a substantial negative effect on above-ground biomass and overall crop vigour. Infested crops showed higher nitrogen levels than healthy crops. This trend could be attributed to nitrogen redistribution following FAW damage, where nitrogen decreases in attacked leaves and increases in roots, with partial recovery after the pest moves on to another crop. The study establishes a methodological framework for linking laboratory, field, and remote sensing approaches, providing a foundation for future predictive modelling of pest impacts on maize nutrient content and productivity.

1:30pm - 2:45pm

WG IV/1A: Spatial Data Representation and Interoperability
Location: 715A

1:30pm - 1:45pm

Bridging Semantic Mesh, CityGML, and Gaussian Splatting for Urban Modelling and Visualization

Deni Suwardhi^1,3, Muhammad Arif Sudibyo², Wahyunan Andika³, Muhammad Ihsan², Agus Ambarwari², Ratri Widyastuti¹, Widiatmoko Azis Fadilah⁴, Arnadi Murtiyoso⁴, Pierre Grusennmeyer⁴, Fabio Remondino⁵

¹Spatial System and Cadastral Research Group, Institut Teknologi Bandung (ITB), Indonesia; ²Postgraduate Programmes, Institut Teknologi Bandung (ITB), Indonesia; ³PT Inovasi Mandiri Pratama, Spatial Information Company, Indonesia; ⁴Université de Strasbourg, CNRS, INSA Strasbourg, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, 67000, Strasbourg, France; ⁵3D Optical Metrology (3DOM) Unit, Bruno Kessler Foundation (FBK), Trento, Italy

Urban digital twin systems require 3D city representations that reconcile semantic structure, geometric reliability, simulation capability, and photorealistic real-time rendering. Existing approaches typically prioritize a single modelling paradigm, limiting their capacity to simultaneously support analytical and visualization demands. CityGML ensures standardized semantics and topological consistency but often lacks detailed surface realism. Surface-based semantic mesh models preserve geometric detail suitable for environmental simulations but provide limited hierarchical semantic organization. In contrast, neural radiance-field approaches, such as 3D Gaussian Splatting (3DGS), enable photorealistic rendering at interactive frame rates but do not explicitly encode topology or structured semantics. This study establishes a structured comparative framework linking LiDAR-derived CityGML, triangle-based semantic mesh, 3D Gaussian Splatting, and Triangle Splatting within a unified urban modelling workflow. UAV-based data acquired using a multirotor platform with a DJI ZENMUSE L2 sensor serve as the geometric backbone for reconstructing CityGML LoD1-LoD2 models. The semantic model is transformed into a textured triangular mesh to provide a geometry-consistent baseline, while radiance-based models are generated from the same imagery using multiple 3DGS implementations and a triangle splatting framework. Comparative evaluation investigates geometric coherence, semantic preservation, and radiance consistency to identify structural correspondences across the representations. Rather than treating them as competing alternatives, the results reveal complementary modelling layers that can be systematically mapped. Based on these findings, the paper formulates a conceptual foundation for a unified 3D urban model capable of transforming consistently into semantic-structured, surface-based, and radiance-based representations, enabling adaptive and extensible urban digital twin systems.

1:45pm - 2:00pm

Linking Persistent Scatterers with Urban Features Using LoD2 Building Models

Natalia Wielgocka, Grzegorz Jóźków, Paweł Bogusławski

Wrocław University of Environmental and Life Sciences, Poland

Persistent Scatterer Interferometry (PSI) provides valuable information on ground and structural changes, particularly in dynamic urban environments. At the same time, urban digital twins (UDTs), as detailed three-dimensional representations of cities, are increasingly used for monitoring and analysis. However, the effective integration of results of PSI processing named Point Scatterers (PSs) into such frameworks remains challenging due to the limited positional accuracy of PSs, despite the high precision of displacement estimates. This study investigates a methodology for integrating PSI data from the European Ground Motion Service (EGMS) with airborne laser scanning (ALS) data and Level of Detail 2 (LoD2) building models to improve the connection of PSs with real-world objects. Three integration variants were analysed, differing in the reference datasets used for linking: (i) ALS point cloud, (ii) point cloud derived from LoD2 models and digital terrain model (DTM), and (iii) a combined approach integrating ALS and LoD2 representations. The results demonstrate that the combined approach yields the highest performance, achieving up to 88% of successfully linked PSs, compared to 70.4% and 80.3% for the ALS-only and LoD2-based approaches, respectively. The findings indicate that LoD2 models provide sufficient geometric detail for PS linking, despite lacking fine-scale building elements. Their use improves data completeness, particularly on building facades, where ALS data are often sparse or missing. The proposed methodology confirms the applicability of EGMS products as a valuable data source for 3D geoportals and urban digital twins, supporting advanced spatial analyses in complex environments.

2:00pm - 2:15pm

IFC and QGIS integration for the Integrated Water Service management

Michele Berlato¹, Giorgio Agugiaro², Ken Arroyo Ohori², Carlo Zanchetta³

¹DTG – Department of Management Engineering, University of Padua, Italy; ²3D Geoinformation group, Department of Urbanism, Faculty of Architecture and Built Environment, Delft University of Technology, Delft, The Netherlands; ³DICEA – Department of Civil, Building and Environmental Engineering, University of Padua, Italy

Integrated Water Service (IWS), which combines water supply and wastewater treatment, requires complex geometric and semantic management. Building Information Modelling (BIM) and Geographic Information Systems (GIS) are the two main geospatial technologies involved in this field. In very simple terms, BIM allows to have 3D models with detailed geometric and semantic information, and GIS permits to geolocate and manage the models in the territory. To facilitate the integration of these two systems, we propose to manage the BIM models through a standardised relational database. In the BIM world, relational databases are not yet widely used, but the technology is already available. For example, ifcSQL is an encoding of the Industry Foundation Classes (IFC) data model for a relational database. This article proposes an extension of the ifcSQL database with the added possibility to store the georeferenced explicit geometries of the IFC models. Additionally, we present a prototype to make such IFC-based data available via QGIS. In this way, a user can interact with BIM data using open GIS technologies. As a result, it is possible to visualise the models in 2D and 3D, and to perform queries on their attributes. A set of real-world case studies has served as testing ground to develop the functionalities that allow for the interaction with the BIM models via QGIS. Such test cases originate from interviews with a company that manages IWS in Northeast Italy.

1:30pm - 2:45pm

WG IV/8B: Digital Twins for Mobility and Navigation
Location: 715B

1:30pm - 1:45pm

Topological Analysis of OpenDRIVE Models for Advanced Autonomous Vehicle Simulations

Janos Mate Logo, Viktor Gyozo Horvath, Vivien Poto, Arpad Barsi

Budapest University of Technology and Economics, Department of Photogrammetry and Geoinformatics, Hungary

The increasing demand for safe and efficient autonomous vehicle (AV) operations has intensified the need for realistic, high-fidelity digital road representations that enable robust virtual testing environments. Simulation-based validation has become a cornerstone of the AV development process, allowing for the reproducible assessment of perception, localization, and decision-making modules under controlled conditions. Within this context, the ASAM OpenDRIVE specification provides a standardized, XML-based description of static road networks, encapsulating geometric, semantic, and structural elements such as roads, lanes, junctions, and roadside objects. While previous research has primarily focused on the geometric accuracy and semantic richness of High Definition (HD) maps, comprehensive topological analyses—especially those addressing consistency, connectivity, and completeness of OpenDRIVE models—remain largely unexplored. This study aims to fill that gap by introducing a formal topological framework for evaluating OpenDRIVE-based road models through both synthetic and real-world test cases.

1:45pm - 2:00pm

A Comprehensive Toolkit for Semi-Automated HD Maps Production: Integrating AI-Driven Feature Extraction with 3D Interactive Validation and Editing

YI-FENG CHANG, KAI-WEI CHIANG, MENG-LUN TSAI, PEI-LING LEE, SYUN TSAI, CHI-HSIN HUANG, TING-CHUN WU, YI-HAN JEN, JIUN-YO LIN, CHANG-LE LEE, CHING-HSIANG LIN, HAN-CHE HUANG

National Cheng Kung University, Chinese Taipei

This paper presents a comprehensive toolkit for semi-automated High-Definition Maps (HD Maps) production that integrates Artificial Intelligence (AI)-driven feature extraction with 3D human-in-the-loop validation. High-definition maps provide centimeter-level road geometry and traffic asset information, but large-scale production remains costly due to dense mobile mapping data and manual digitization. The proposed workflow consists of two self-developed components: a Semi-automated HD Maps Production Tool for batch extraction and a 3D HD Maps Validation and Editing Tool for structured review. The project-based pipeline ingests georeferenced mobile laser scanning point clouds, Inertial Navigation System / Global Navigation Satellite System (INS/GNSS) trajectories, and camera imagery, and applies configurable chains of ground filtering, road-marking extraction, voxel down-sampling, clustering, oriented bounding box analysis, and AI-based traffic asset detection. Candidate features with confidence indicators and basic attributes are stored in a project database and edited in a tightly coupled 3D environment that supports snapping, constrained adjustments, and semantic reclassification while logging all user edits. The toolkit is evaluated on a closed proving ground (CARLab, Shalun) and a freeway section of Taiwan National Highway No. 3. At CARLab, semi-automated extraction achieves F1-scores of 0.85–0.95 for key layers. For a one-kilometer highway section, operator time is reduced from 90–120 minutes in a purely manual Geographic Information System (GIS) workflow to about 45 minutes with the proposed approach, while maintaining comparable geometric accuracy. These results demonstrate a practical path towards scalable, traceable HD Maps production for autonomous driving applications.

2:00pm - 2:15pm

A Low-Altitude Data Space Framework Based on China̓s National 3D Mapping Program

Yin Gao^1,2, Jun Chen^1,2, Dehu Yang^1,3, Chaoquan Zhang¹, Fengyu Han¹

¹Moganshan Geospatial Information Laboratory, Huzhou, 313200, China; ²National Geomatics Center of China, Beijing 100830, China; ³Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu 610031, China

National Key Research and Development Program of China (2025YFB3910300);

2:15pm - 2:30pm

Geometrically accurate 3D Gaussian Reconstruction using high-density UAV LiDAR point clouds and open-vocabulary semantic optimization

Banghui Yang^1,2, Kai Qin^1,2, Yucheng Li^1,2, Jing Li^1,3

¹Aerospace Information Research Institute，Chinese Academy of Science, China, People's Republic of; ²University of Chinese Academy of Sciences,Beijing; ³International Research Center of Big Data for Sustainable Development Goals, China

3D scene reconstruction lies at the core of computer vision, photogrammetry, and geospatial science, spatial intelligence, aiming for accurate, photorealistic, and efficient digital twin representations of the real world. The emergence of revolutionary 3D Gaussian Splatting (3DGS) enables real-time rendering and geometrically precise reconstruction, yet existing methods struggle in large-scale outdoor scenes with weak textures, low geometric accuracy, dynamic objects, and lack of semantic information. Therefore, geometrically accurate 3D GS with enhanced semantic understanding greatly facilities the realization of digital twins for mobility and navigation. This work proposes a novel 3DGS framework which seamlessly incorporates dense UAV LiDAR point clouds, multi-view images and open-set semantics in an all-in-one optimization process. The key objective here is to investigate how geometric constraints derived from dense UAV LiDAR point clouds and cognitive supervision from SAM (Segment Anything Model) semantics can jointly participate in the optimization of Gaussian primitives, thereby improving geometry accuracy, visual realism, and semantic consistency in large-scale UAV 3D reconstructions for creating digital twins of the environments.

1:30pm - 2:45pm

ThS23A: Towards Large Cultural Heritage Foundation Models: Datasets, Semantic Alignment, and Component-Level Annotation
Location: 716A

1:30pm - 1:45pm

Investigating The Form And Restoration Of The Diji Altar

Wang Jinghan, Qi Ying, Hou Miaole

Beijing University of Civil Engineering and Architecture, China, People's Republic of

The restoration of historic buildings is an important topic in today's society and constitutes the primary subject of this study. The Diji Altar, located along the central axis of Beijing, is not only a significant historical landmark but also an important remnant of China's ancient imperial sacrificial architecture. Although some studies have focused on the Diqi Altar, such as its ritual hierarchy and craftsmanship as recorded in historical texts, certain research gaps remain. Due to the damage to the altar structure and insufficient documentation in relevant literature regarding its structural form, platform base specifications, and stylistic evidence, systematic research on restoration techniques remains relatively scarce. There is a need to reconstruct evidence based on architectural principles. Addressing this critical gap is of great significance for understanding the technical achievements and ceremonial principles of official architecture during the Ming and Qing dynasties, and for guiding the restoration and preservation of ancient buildings.

1:45pm - 2:00pm

A Digital Restoration Method for Earth God Altars from Discrete Components to Scene Reconstruction

Sining Li¹, Nan Meng^1,2, Tao Zhang^1,3, Lili Jiang^1,4, Miaole Hou^1,5

¹Beijing University of Civil Engineering and Architecture, China, People's Republic of; ²Ancient Chinese Architecture Museum,China,People's Republic of; ³Beijing Institute of Archaeology,China,People's Republic of; ⁴Beijing Digsur Science & Technology Com. Ltd,China,People's Republic of; ⁵Beijing University of Civil Engineering and Architecture, China, People's Republic of

The digital preservation of open-air sites often faces multiple challenges, such as dispersed components, varied forms, and missing historical records. In response, this study focuses on the Beijing Dizhitan and proposes and implements an innovative workflow that deeply integrates architectural morphology theories, archaeological typological methods, and modern digital technologies. This workflow systematically constructs a complete methodological chain, from the semantic annotation, classification, and virtual assembly of stone components, to the virtual restoration and model reconstruction of the site, ultimately achieving scenario-level restoration and display evaluation. The successful restoration of the Dizhitan demonstrates that this approach not only effectively "revives" dispersed components, placing them in their proper positions in a virtual space, but also pioneers a replicable new paradigm that embeds rigorous academic research throughout the digital process. This provides an entirely new technical approach and perspective for the preservation, study, and interpretation of immovable open-air cultural relics.

2:00pm - 2:15pm

Building a Multimodal Dataset of Rock Art: Integrating Text, Images, and 3D Point Clouds

Dongxu Huo, Chenxu Nie, Miaole Hou

Chang'an University, China, People's Republic of

This paper addresses the limitations of single-modal data in rock art cultural heritage preservation, such as incomplete information and fragmented semantics. It proposes a method for constructing a multimodal dataset that integrates text, images, and 3D point clouds. Text data is structured and semantically annotated using the ArchaeoBERT model; image data is obtained through web scraping, annotation, and augmentation; and point cloud data is captured using laser scanning, noise reduction, and registration techniques. Feature mapping alignment is employed, combining CNN, BERT, and PointNet++ to extract features and generate unified vector representations. Through a three-level quality control process, the data is accurate and reliable, with information coverage increased by 47.3%. This dataset achieves comprehensive integration of semantic, visual, and spatial information, providing a multidimensional data foundation and practical reference for the digital preservation, 3D reconstruction, and cross-modal retrieval of rock art.

2:15pm - 2:30pm

Monocular Depth Estimation from UAV images for 3D documentation of architectural heritage: a Depth Anything v2-based approach

Andrea Maria Lingua¹, Filiberto Chiabrando², Francesca Gallitto¹, Stefania Manca¹, Alessio Martino², Francesca Matrone¹, Alessandra Spadaro¹

¹Politecnico di Torino (DIATI), Italy; ²Politecnico di Torino (DAD), Italy

The rapid evolution of Monocular Depth Estimation (MDE) models — and in particular the emergence of recent foundation models such as Depth Anything v2 (Yang et al., 2024; Ranftl et al., 2022) — is opening concrete perspectives for the application of artificial intelligence in architectural and cultural heritage surveying.

This research aims to assess the feasibility of employing such models to obtain metric depth estimations from UAV imagery, acquired in both oblique and nadir views, with the broader goal of integrating neural networks into 3D documentation, HBIM, and GIS workflows for built heritage.

The Depth Anything v2 models were trained initially for ground-level scenarios, where the camera typically operates 1–2 m above the ground, with horizon distances extending up to 60–80 m. When applied to aerial imagery, particularly drone-based acquisitions, this results in a substantial domain gap: the network tends to interpret top-down landscapes as distant horizons, thereby compressing the depth scale.

To address this issue, this study develops an experimental calibration and adaptation procedure aimed at transforming the depth maps produced by the model into metrically consistent estimates that are coherent with architectural reality.

1:30pm - 2:45pm

SpS2: ISO Data Quality Measures Register and the ISPRS Community
Location: 716B

1:30pm - 1:45pm

From Data Standards to GeoAI Governance: Strengthening Data Quality and Trust in the Next Era of Geospatial Intelligence

Nadine Alameh

LunateAI, United States of America

We present an approach to ensure trustworthy, high-quality GeoAI which requires a coordinated effort across academia, industry, government, and standards bodies. The ISPRS community, in partnership with organizations such as ISO, OGC, the World Geospatial Industry Council (WGIC), the International Society for Digital Earth (ISDE), and other international initiatives is uniquely positioned to host this dialog. By aligning emerging GeoAI practices with established data-quality standards and ethical-AI frameworks, the community can help shape a future-proof foundation for responsible innovation in geospatial intelligence.

1:45pm - 2:00pm

Adding Data Quality and Licensing Aspects to Open Science Workflows

Robert Atkinson¹, Ivana Ivanova²

¹Open Geospatial Consortium; ²Curtin University, Australia

This paper presents research on integrating Data Quality and Licensing metadata into Open Science workflows using ISO and OGC standards and machine-readable profiles to enhance interoperability, transparency, and reusability of scientific data. All if this is possible via ongoing development of modular building blocks, validation frameworks, and engagement with standards bodies to support FAIR principles and scalable data reuse across domains. Our approach demonstrates the possible integration of concept schemes and measures defined in the ISO 19157 multipart standard.

2:00pm - 2:15pm

ISO 19157-3 Data quality measures register for geographic information: What is it, what can we do with it and why is it benefitial for the ISPRS and wider geocommunity?

Ivana Ivanova¹, Torsten Svard², Robert Atkinson³

¹Curtin University, Australia; ²Lamtmateriet, The Swedish mapping, cadastral and land registration authority; ³Open Geospatial Consortium

This paper presents the ISO 19157-3 Data quality measures register, discusses its design and implementation and illustrates its the utility to the ISPRS and wider geocommunity. In this paper we highlight the importance of providing geographic metadata about quality, the evolution of international standards to support this, and a novel implementation of a human readable and machine-actionable web register for geographic data quality measures.

2:15pm - 2:30pm

Investigating the Role of Post-Quantum Cryptography in Enhancing Blockchain-Based Geospatial Data Exchange

Darshana Rawal, Jan Seedorf, Atharva Chaudhari, Matthias Hamann

Hochschule für Technik Stuttgart, Germany

The rapid growth of geospatial data, fueled by advancements in satellite imagery, IoT sensors, and mobile services, presents significant opportunities in sectors like urban planning and environmental monitoring. However, these data are also vulnerable to cyber threats, emphasizing the need for strong protection mechanisms. This paper introduces a modular, hybrid architecture that addresses security challenges by integrating post-quantum cryptography, decentralized storage, and access control via Blockchain. It employs AES-GCM for the secure encryption of large datasets and Kyber for enhanced key protection against quantum threats. Encrypted data is stored securely in the Interplanetary File System (IPFS), with access managed by smart contracts on a private Ethereum blockchain. The architecture utilizes FastAPI for back-end processes, microservices for cryptographic services, and React for the user interface. Performance assessments show good scalability and resilience, paving the way for secure geospatial data sharing while harmonizing data sovereignty, quantum security, and decentralized management.

2:30pm - 2:45pm

Benchmarking the Quality of High-Resolution Global Land Cover Products: Toward a Shared Framework for Assessment

Qiongjie Xu¹, Hao Wu², Jun Chen², Maria Antonia Brovelli¹

¹Politecnico di Milano, Italy, Department of Civil and Environmental Engineering; ²Moganshan Geospatial Information Laboratory, Zhejiang Province, China

High-Resolution Global Land Cover (HRLC) products are essential for monitoring Earth’s surface dynamics and supporting policy frameworks like the Sustainable Development Goals. Recent global products such as ESA WorldCover, ESRI LULC, FROM-GLC, and Dynamic World offer 10–30 m resolution maps, but their interoperability remains limited due to differences in input data, class legends, and validation protocols. This lack of harmonization hampers cross-comparison and integrated use for environmental monitoring. Although advances in remote sensing, AI, and cloud computing have enabled more frequent and detailed mapping, they have also introduced new challenges for ensuring data consistency and comparability.

Validation of HRLC products is hindered by the absence of a common benchmark dataset, as current accuracy metrics are derived from heterogeneous reference samples and class definitions. Traditional validation methods are costly and time consuming, while temporal inconsistencies and cloud contamination further increase uncertainty. ISO 19157-3 offers a standardized framework to describe and automate quality measures such as positional accuracy and thematic correctness, supporting transparent and reproducible evaluation across datasets.

A sustainable solution involves establishing an international benchmarking framework with standardized reference data, legends, and sampling strategies. As a practical interim approach, the Map of Land Cover Agreement (MOLCA) combines multiple HRLC products to identify spatial consensus and disagreement, offering a proxy for thematic reliability. Although MOLCA measures consistency rather than absolute accuracy, its integration into ISO 19157-3 would advance data quality assessment, fostering transparency, interoperability, and confidence in HRLC-derived environmental analyses.

1:30pm - 2:45pm

ThS13: CO3D Mission
Location: 717A

1:30pm - 1:45pm

The CO3D mission, a worldwide one-meter accuracy Digital Surface Model

Laurent Lebegue, Fabrice Buffe, Olivier Melet, David Youssefi, Jean-Baptiste Henry

CNES, France

The goal of the CO3D (Constellation Optique 3D) mission is the full-automatic production of a worldwide accurate DSM (Digital Surface Model). This DSM is generated from stereo acquisitions obtained from a new generation of high-resolution optical satellites, called CO3D. The DSM accuracy is one meter in relative height for moderate slopes and four meters in absolute height with a one-meter grid space. Each of the four satellites of the constellation provides images with 0.50 m resolution in red, green, blue and near-infrared bands. The satellites resource is shared by, on one hand, the French institutions (government, scientists concerned by global Earth monitoring) who have dedicated access and preferred price conditions, and on the other hand, ADS (Airbus Defence and Space) customers interested in 2D and 3D products. The constellation was launched on July 26, 2025

1:45pm - 2:00pm

3D Product Quality Control in the CO3D Mission: A Critical Role

David Youssefi¹, Timothée Accadia¹, Emilie Colleu², Aymeric Godet², Valentine Bellet¹, Dimitri Lallement¹, Yannick Tanguy¹, Laurent Lebegue¹

¹CNES, France; ²IGN, Service de l’imagerie spatiale, France

We present the qualification of 3D products as part of the CO3D mission. The CO3D mission is dedicated to creating a digital surface model of the Earth's landmass cover. Massive automatic production is a challenge in itself, as this ground segment produces more advanced data for an optical mission. Firstly, the 3D product, which is generally retouched and checked, which can represent a significant cost. In the case of the CO3D mission, these products will be generated completely automatically. Masks will also be included to describe the processing history and provide precise information on the altitudes restored. All of this data requires detailed qualification with precise reference data and methods to best reflect its quality. The paper will describe all of these methods and data and provide an overview of the performance of these new CO3D products.

2:00pm - 2:15pm

CO3D image quality calibration

BUFFE Fabrice¹, LUCAS Sylvain¹, LATOURTE Alice¹, ACCADIA Timothée¹, YOUSSEFI David¹, LEBEGUE Laurent¹, COGGIOLA Clément¹, BRUNO Mickaël¹, LUITOT Clément¹, DICK Arthur¹, DELAYGUE Nicolas¹, THOMAS Colin², SCHILLINGER Marc², TANGUY Yannick¹, CHAMBRELAN Alice²

¹CNES, France; ²Airbus Defense and Space, France

The launch of the four CO3D spacecrafts took place on 26th July 2025 aboard Vega-C from Kourou space center with Microcarb microsatellite. After a first week spent calibrating the most critical subsystems of the spacecraft, the instrument was switched on, enabling the 9-months Image Quality commissioning phase to begin. Images are acquired in RGB and NIR spectral bands with a 50 cm Ground Sampling Distance (GSD), thanks to matrix sensors based on a Bayer pattern. This brings new calibration challenges such as demosaicing and 2D line of sight determination. 3D calibration activities take place in a second stage of the commissioning phase once radiometric and geometric calibration are finalized.

2:15pm - 2:30pm

CO(ast)3D: a predictive pipeline for CO3D satellite imagery acquisition decisions

Elora M. H. Oades¹, Erwin W. J. Bergsma¹, Stephanie Artigues¹, Déborah Idier², Marcello de Michele², Daniel Raucoules², Laurent Lebegue¹, Jean-Marc Delvit¹

¹CNES (Centre Nationale d'Études Spatiales); ²BRGM (Bureau de Recherches Géologiques et Minières)

This work introduces CO(ast)3D, a predictive pipeline that helps identify when upcoming CO3D satellite overpasses are likely to capture surface wave signals suitable for bathymetry acquisitions. Because CO3D cannot directly image the seafloor, depth must be inferred from the optical expression of surface waves, whose visibility depends strongly on illumination, viewing geometry, and sea state.

The pipeline combines CO3D orbital tracks with forecasted wave parameters from the Copernicus Marine model to construct a directional wave spectrum, generate a time-varying free surface through linear wave theory, and simulate CO3D-like radiance images at native spatial resolution. These synthetic scenes allow the clarity of the wave field to be evaluated a priori for any future time and location.

By predicting whether conditions will yield a sufficiently coherent wave signal, the system supports more efficient tasking, reduces acquisition risk, and improves the likelihood of capturing images suitable for accurate bathymetric inversion.

2:30pm - 2:45pm

CNES CO3D Image Ground Segment

Olivier Melet, Celine L'Helguen, Laure Pradines, David Youssefi, Valentine Bellet

CNES, France

The challenges, main design elements, and results of the CODIP and ICC components will be presented in this paper, as well as the modalities for accessing and using the CO3D products.

3:00pm - 3:30pm

Afternoon Coffee Break
Location: Exhibition Hall "E"

3:30pm - 5:15pm

Opening Ceremony
Location: Exhibition Hall "G"

Awards Ceremony

ISPRS Honorary Member
Brock Gold Medal Award
Val Shaw Memorial Award

5:30pm - 7:30pm

Congress Welcome Reception

Awards Ceremony:

CP Lo Award
ISPRS WEC Kennert Torlegård Travel Grant

Date: Monday, 06-July-2026

8:30am - 10:00am

WG III/1A: Remote Sensing Data Processing and Understanding
Location: 713A

8:30am - 8:45am

Cube Kernel: A Novel Approach to Enable Local Gradient Flow Across Channels in CNNs

Zhimeng HE, Yuwei Cai, Meiliu Wu, Xinyan Xian, Brian Barrett

University of Glasgow, United Kingdom

Understanding inter-band and cross-channel relationships is essential for human color perception and object recognition. Yet, local gradients in standard convolutions are tied to fixed input–output channel pairs, and thus channels are fused by a dense, fully-coupled weight tensor: each output channel aggregates all input channels in a uniform way at every spatial location. This leads to heavy computation and does not exploit structured sparsity or selective local channel mixing. To overcome this limitation, we introduce Cube Kernel, a novel convolutional operator that introduces structured cross-channel groups into the local gradient. This design strengthens cross-channel feature fusion, improves optimization efficiency, and reduces computational overhead. Extensive building extraction experiments validate its effectiveness: Cube Kernel consistently outperforms standard convolutions and Involution when integrated into UNet, and replacing a single layer in DeepLabV3+, Swin-UNet, or UNet leads to consistent performance gains. Beyond serving as a lightweight plug-in module, Cube Kernel also scales effectively as a fundamental building block. A Cube-enhanced ConvNeXt variant, ConvNeXt-Cube, achieves state-of-the-art performance across all models (0.9095 IoU / 0.9535 F1 on WBD and 0.9133 IoU / 0.9547 F1 on WHU), demonstrating strong stackability and architectural potential. These results highlight a largely overlooked space in CNN design: enhancing cross-channel interaction at the gradient level. Cube Kernel offers a scalable and efficient alternative to deepen networks for channel mixing, laying a foundation for future advancements in convolutional architecture design.

8:45am - 9:00am

Land Surface Dynamics Modeling and Prediction with dual Latent-Space Representations

Keli Shi^1,2, Zheng Zhang¹, Liang Tang³, Wenhe Xu⁴, Xiaojun Shan¹, Ping Tang¹

¹Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China; ²School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing, China; ³Yazhou Bay Innovation Institute, Hainan Tropical Ocean University, Sanya, China; ⁴The University of Hong Kong, Hong Kong, China

Modeling land surface dynamics from satellite observations is crucial for revealing change patterns and predicting future states, although effective modeling methods remain limited.

For complex systems such as reaction-diffusion, two approaches have proven particularly effective:

(i) Direct modeling in the high-dimensional observation space with deep networks(e.g., (Wang et al., 2022)). These methods are often autoregressive. Errors accumulate during rolling extrapolation.

(ii) Modeling in a reduced-dimensional latent space(e.g., (Chen et al., 2022)). One reduces dimension and then learns the evolution. Some works estimate the intrinsic dimension (ID) and model in the ID latent space.

This improves long-term stability, but reliance on latent representations may reduce accuracy.

This route is promising if two issues are addressed:

(1) effectively modeling multi-scale spatiotemporal data with long sampling intervals;

(2) combining ID-space modeling with other latent dimensions to balance accuracy and stability.

This paper proposes a Dual Latent-Space Representation-based Land Surface Dynamic Model （DLS-LSDM）. The core contributions are:

(1) a stacked-convolution and multi-scale linear-attention autoencoder to obtain a base latent, together with ID estimation to derive an ID latent;

(2) a long-horizon scheme that combines ID and base latents to achieve both stability and high accuracy ;

(3) comprehensive evaluation on ten-year MODIS NDVI across multiple climate zones, demonstrating superiority.

9:00am - 9:15am

Revealing Feature Contribution Mechanisms for Interpretable CNN-Transformer Remote Sensing Classification

He Chen¹, Xianwei Zheng¹, Wei He¹, Jiansi Yang¹, Linwei Yue², Ting Hu³, Jianya gong¹

¹Wuhan university; ²China University of Geosciences; ³Nanjing University of Information Science and Technology

Deep learning models have become the backbone of remote sensing image intelligent classification, enabling high-precision recognition of land cover, geospatial objects, and scene categories. However, their inherent "black-box" nature—where decision logic is embedded in complex parameter spaces—poses critical barriers to deployment in high-stakes domains such as military reconnaissance, disaster monitoring, and environmental governance. These fields demand transparent reasoning to validate model reliability, yet traditional interpretability methods suffer from two key limitations when applied to remote sensing data: They are primarily designed for natural images, failing to account for remote sensing-specific characteristics. They focus on local feature attribution or saliency mapping but lack quantitative analysis of how core image features (shape, texture, spectrum) contribute to global classification decisions, especially across different network architectures.To address these problem, this study proposes a comprehensive feature contribution analysis framework tailored to remote sensing images, with the core objectives of: (1) Decoupling and extracting shape, texture, and spectrum features from remote sensing images in a physically meaningful manner; (2) Quantifying the contribution of each feature type to classification decisions; (3) Revealing differences in feature processing mechanisms between CNN and Transformer architectures.

9:15am - 9:30am

EfficientViM-CD: An Efficient Remote Sensing Change Detection Network Based on Hidden State-Mixer

haiming zhang¹, hongyang fan²

¹State Key Laboratory of Information Engineering in Surveying , Mapping and Remote Sensing, wuhan university, China, People's Republic of; ²School of Information Science and Engineering, Wuchang Shouyi University

High-resolution optical remote sensing change detection (CD) is of great significance in urban evolution monitoring, disaster assessment, and land management. Traditional deep models often face computational, memory, and inference latency bottlenecks when processing large high-resolution imagery. To address this, we propose EfficientViM-CD: a Hidden-State Mixer based efficient remote sensing change detection network. The approach builds upon the EfficientViM backbone, migrating global interaction operations into a compact hidden state space and leveraging Hidden State Mixer based on state space duality (HSM-SSD) to fuse global context while reducing computational complexity. We employ a Siamese encoding architecture to extract multi-scale features and hidden states from paired temporal images, and utilize a Cross-Hidden Fusion module to integrate hidden semantic interactions between time points. At each scale, local difference features are computed and enhanced in hidden state space, and a multi-scale decoder reconstructs a pixel-level change probability map. We conducted experiments on four public datasets (LEVIR-CD+, WHU-CD, S2Looking, SVCD) and compared against nine state-of-the-art methods. Results demonstrate that EfficientViM-CD achieves competitive accuracy while delivering significant advantages in inference speed and memory efficiency. This method offers a lightweight, efficient, and scalable solution for high-resolution remote sensing change detection, with potential for real-time monitoring and emergency response systems.

9:30am - 9:45am

Local NMS: Enhancing Object Detection in Large-Scale Remote Sensing Images via iterative pipelined Postprocessing

Bettina Felten, Wolfgang Gross, Andreas Michel

Fraunhofer IOSB, Germany

Object detection in large, dense remote sensing imagery is difficult because targets are often small and arbitrarily oriented, and state-of-the-art detectors cannot process very large images directly without a reduction in accuracy. Tiling-based inference workflows mitigate the latter issue by running inference iteratively on overlapping tiles, but introduce pre- and postprocessing overhead for image tiling and Non-Maximum Suppression (NMS). We introduce local NMS, an asynchronous tile-wise postprocessing scheme. Local NMS runs in a separate subprocess in parallel to tile-wise inference and collects intermediate results enqueued by the inference process, immediately applying postprocessing. Intelligent reordering of tiles in a preprocessing step ensures optimal usage of computing resources. We assess our method using three state-of-the art object detection models for horizontal and oriented bounding box detection on two benchmark datasets containing large dense aerial and satellite images, DOTA-v2.0 and Izembek Lagoon Birds, stratifying by image size and average object density. Local NMS consistently reduces end-to-end runtime across models and datasets without significant impact on mAP. A maximum runtime reduction of 60.77% on large dense DOTA-v2.0 scenes could be achieved without modifying model architectures or retraining.

9:45am - 10:00am

ERD: Extended RAW-Diffusion Framework for De-rendering sRGB Images

Jiaqi Shang¹, Yifan Qu¹, Jianbo Qi²

¹Department of Computer Science, University of Toronto, Canada; ²Faculty of Geographical Science, Beijing Normal University, China

Recovering RAW sensor measurements from rendered sRGB images is important for radiometric calibration, low-level vision, and computational photography. However, reversing a camera’s proprietary Image Signal Pipeline (ISP) is highly challenging, especially when the ISP is unknown. Existing inverse-ISP and diffusion-based approaches have several issues: they depend on known ISPs from the sensor, require one model per sensor, or generalize poorly across camera brands.

This work presents ERD (Extended RAW-Diffusion), a unified diffusion-model framework for de-rendering sRGB images into RAW format for any given image, and does not require ISP to be known or camera information from the image. ERD extends the RAW-Diffusion architecture by incorporating camera metadata only during training, allowing the model to learn a shared representation across heterogeneous sensors. To capture global sensor characteristics, ERD introduces a conditioning mechanism, Feature-wise Linear Modulation (FiLM) for global features such as CFA patterns and color gains. To enhance structural consistency, ERD integrates a ControlNet branch that injects edge and gradient priors derived from the sRGB input, stabilizing RAW reconstruction under diverse tone-mapping operations. For practical adaptation to new sensors, ERD supports efficient few-shot tuning via LoRA.

Evaluations on Adobe FiveK (Nikon and Canon) and RAW-NOD (Nikon and Sony) show that ERD outperforms state-of-the-art baselines in PSNR and SSIM, offering improved robustness to unseen camera models. ERD enables a practical, general-purpose inverse ISP process across heterogeneous imaging devices.

8:30am - 10:00am

WG IV/2A: Artificial Intelligence and Uncertainty Modeling in Spatial Analysis
Location: 713B

8:30am - 8:45am

KG-MS-ResNet: A Knowledge-Guided Multi-Scale Attention Residual Network for Cultivated Land Change Monitoring

Jun Zhang¹, Hao Wu¹, Haomiao Shang^1,2,5, Dongyang Hou³, Chunju Zhang⁴

¹National Geomatics Center of China, Beijing,China, 100830; ²China University of Mining & Technology(Beijing), Beijing, China, 100083; ³School of Geoscience and Information Physics, Central South University, Changsha, China, 410083; ⁴School of Civil Engineering, Hefei University of Technology, Hefei, China, 230009; ⁵Corresponding author

Cultivated land conversion to built-up area is a core form of farmland non-agriculturalization and a major threat to farmland protection in China. Current remote sensing methods for detecting such changes face two limitations: insufficient integration of domain prior knowledge and the inability of purely data-driven models to achieve both high Precision and Recall. To address these issues, this study proposes a knowledge graph-enhanced change detection method. A multi-scale knowledge analysis framework incorporating feature, scene, and business knowledge layers is constructed to systematically integrate multi-source geographic information into structured semantic representations. A knowledge fusion residual network, KG-MS-ResNet, is designed based on ResNet-18 with modifications to the first convolutional layer for bi-temporal image inputs. TransE embeds geographic indicator knowledge into multi-scale semantic vectors, while a semantic–feature dual-path fusion strategy and a knowledge-guided attention mechanism enable deep coupling between image features and domain knowledge. Experiments in Pei County, Jiangsu Province, show that the proposed method outperforms baseline ResNet across all metrics, with Recall increasing by 4.84 percentage points and F1-score by 0.0752. The results demonstrate that integrating domain knowledge graphs with deep learning significantly improves detection performance, offering a semantically interpretable solution for monitoring cultivated land non-agriculturalization and advancing the integration of knowledge-driven and data-driven approaches in intelligent remote sensing interpretation.

8:45am - 9:00am

Road Change Detection for Map Updating Using Geometric Boundary Deviation Between Digital Maps and Aerial Segmentation Results

Juhee Lee¹, Chong Lee¹, Inhyeok Lee², Jangwoo Cheon², An Bui¹, Impyeong Lee¹

¹Dept. of Geoinformatics, University of Seoul, Seoul, Republic of Korea; ²Geospatial Team, InnoPAM, Seoul, Republic of Korea

Road change detection is essential for maintaining up-to-date digital maps; however, conventional update processes rely heavily on the manual interpretation of aerial imagery, leading to high labor costs and inconsistent outcomes. To address these limitations, this study proposes an automated road change detection method that integrates aerial orthophoto-based segmentation with geometric boundary deviation analysis. Road areas are first extracted from high-resolution aerial orthophotos using SegFormer, a Transformer based semantic segmentation model. The segmentation results are then converted into vector polygons for geometric analysis. Structural changes, such as newly constructed or removed roads, are detected through a difference-based comparison with historical digital maps. Simultaneously, shape changes are quantitatively analyzed by measuring geometric deviations between road boundaries.

Specifically, vertex-wise distances between corresponding boundaries are computed, and the overall deformation is evaluated using Root Mean Square Error (RMSE), incorporating Z-score-based outlier removal to ensure robustness against noise. Experimental results demonstrate that the proposed method effectively detects both structural changes and subtle geometric variations, including road expansions and boundary shifts. Furthermore, the method enables clear object-level classification of change types, providing a practical and efficient framework for digital map updating workflows.

9:00am - 9:15am

Local Rank-Based Prior Calibration and Graph-Cut Refinement for Building Change Detection

Chong Lee¹, Inhyeok Lee², Jangwoo Cheon², An Bui¹, Juhee Lee¹, Impyeong Lee¹

¹Dept. of Geoinformatics, University of Seoul, Seoul, Republic of Korea; ²Geospatial Team, InnoPAM, Seoul, Republic of Korea

Accurate building change detection depends on how well building boundaries are delineated, as distortions and merging errors hinder reliable correspondence. In dense urban areas, deep learning models frequently merge adjacent buildings—especially within narrow gaps—producing structural inconsistencies that lead to change detection errors. We propose a post-processing method integrating Local Rank-Based Prior Calibration, which reinterprets Softmax probabilities as percentile-based local ranks, with Graph-Cut refinement for structural correction. The refined mask is matched with historical building data to classify four change types. Experiments using aerial imagery from Seoul show that the method reduces structural errors, lowering under-segmentation from 51.64% to 22.02% and improving IoU from 0.748 to 0.759. In change detection, it increases the mean F1-score from 0.522 to 0.608 and improves all classes, including new construction, whose F1-score rises from 0.269 to 0.707. Ablation studies confirm that calibration and graph-based refinement both contribute to the improvements. These results show that stabilizing segmentation outputs enhances the reliability of building-level change detection in dense urban environments.

9:15am - 9:30am

Automated Geometric Correction of OpenStreetMap Buildings via Context- and Boundary-Aware Segmentation

Inhyeok Lee¹, Chong Lee², Jangwoo Cheon¹, An Bui², Juhee Lee², Impyeong Lee²

¹Geospatial Team, InnoPAM, Seoul, Republic of Korea; ²Dept. of Geoinformatics, University of Seoul, Seoul, Republic of Korea

OpenStreetMap (OSM) is a representative open geospatial platform that provides free access to major spatial objects, including buildings worldwide, constructed through crowdsourcing-based manual digitization. However, subjective differences among contributors and the absence of unified quality control standards have led to the accumulation of positional offsets and boundary shape errors in building polygons. To address this issue, studies using deep learning-based semantic segmentation for OSM quality improvement have been conducted. Nevertheless, Transformer-based segmentation models exhibit an under-segmentation tendency that merges adjacent buildings into a single object, along with limitations in precise boundary delineation. To overcome these challenges, this study proposes a two-stage framework that integrates SegFormer, which excels in global context recognition, with SAM 2, which is capable of precise boundary segmentation. In the first stage, SegFormer semantically segments building regions from a true orthoimage, and in the second stage, SAM 2 infers object-level precise boundaries using the bounding boxes of OSM polygons as box prompts. The two results are combined into a prior probability map, enabling uncertain boundary regions to be re-evaluated in an unsupervised manner. In experiments conducted over the Suseo-dong area in Gangnam-gu, Seoul, the proposed method achieved a BIoU of 70.40%, an improvement of 23.85 percentage points over OSM building data, with consistent performance gains across all evaluation metrics. This framework offers scalability applicable to any region worldwide without additional label construction, provided that high-resolution true orthoimagery and OSM data are available.

9:30am - 9:45am

Improving building footprint extraction using NAIP and 3DEP lidar derived features with deep learning

Jung kuan Liu¹, Rongjun Qin², Samantha Arundel¹, Lexie Yang³

¹USGS, United States of America; ²The Ohio State University, United States of America; ³Oak Ridge National Laboratory, United States of America

Accurate building footprint extraction is critical for applications ranging from population estimation to disaster management. Although optical imagery provides detailed spectral information, it often struggles with shadows, occlusions, and background clutter in dense urban environments. Lidar data, by contrast, offer precise elevation and structural attributes but face challenges such as variable point density and noise. This study integrates multispectral imagery from the U.S. Department of Agriculture (USDA) National Agriculture Imagery Program (NAIP) with lidar-derived feature height and intensity from the U.S. Geological Survey (USGS) 3D Elevation Program (3DEP) to improve footprint extraction using a U-Net–based deep learning model. A six-band input stack (RGB, near-infrared, height, intensity) was developed, normalized, and tiled for training and evaluation against Microsoft Global Building Footprints (GBF). Results from the Houston, TX test site show that the six-band model achieved a precision of 0.86, recall of 0.88, F1 score of 0.87, and Intersection-over-Union (IoU) of 0.76, consistently outperforming four-band baselines by reducing false positives while maintaining sensitivity. Predictions on withheld Houston tiles confirmed strong within-region generalization, yielded a precision of 0.78, recall of 0.81, F1 score of 0.79, and IoU of 0.66. Qualitative analysis further revealed limitations stemming from both training label quality and vegetation–building confusion. These findings demonstrate the complementary value of integrating spectral and structural information for robust building footprint extraction and how domain adaptation strategies can be used to enhance cross-regional transferability.

9:45am - 10:00am

Benchmarking a Lightweight Model for Pothole Detection in Asphalt Pavements

Laura Torres Peixoto Pimentel, Vivian de Oliveira Fernandes, Caroline Dias Amancio de Lima

UFBA, Brazil

This contribution presents a benchmarking study of a lightweight deep learning model for automatic pothole detection in asphalt pavements. Accurate and cost-effective identification of surface distresses is essential for road safety and for prioritising maintenance, especially in cities where traditional visual surveys are still predominant. We adapt and train a compact YOLO-based object detection architecture on a dataset of annotated street-level images, covering different lighting conditions, pavement textures and distress severities. The study evaluates how input resolution, confidence thresholds and data augmentation strategies affect detection performance and inference speed, and compares the lightweight model with heavier state-of-the-art detectors. Results indicate that it is possible to obtain competitive accuracy while maintaining real-time processing capabilities on modest hardware, which is crucial for deployment in mobile inspection platforms such as smartphones, dashcams or low-cost onboard units. The paper discusses opportunities and limitations of integrating deep learning into pavement management systems and outlines perspectives for extending the approach to other types of defects and to larger road networks.

8:30am - 10:00am

WG II/7B: Underwater Data Acquisition and Processing
Location: 714A

8:30am - 8:45am

Refraction-aware integrated Georeferencing of bathymetric Laser Scanning Data

Gottfried Mandlburger¹, Lucas Dammert¹, Jan Rhomberg-Kauert¹, Florian Pöppl², Martin Pfennigbauer²

¹TU Wien, Department of Geodesy and Geoinformation, Austria; ²RIEGL Laser Measurement Systems GmbH, Austria

Bathymetric Laser Scanning (BLS) enables high-resolution mapping of underwater topography using green-wavelength laser pulses that penetrate the water column. However, precise georeferencing of the BLS data is affected by refraction at the air–water interface, which displaces submerged features and affects conventional strip adjustment methods. This paper introduces an integrated refraction-aware georeferencing workflow that combines refraction correction with trajectory and boresight optimization within a unified adjustment framework. Implemented using the scientific OPALS laser scanning software, the workflow starts with direct georeferencing of uncorrected laser returns, derives a water surface model, applies Snell’s law-based refraction correction, and performs iterative strip adjustment until convergence. The approach was validated using UAV-borne topo-bathymetric LiDAR data from Lake Alm (Almsee) in Upper Austria, captured with a \emph{RIEGL} VQ-840-GE sensor system. Comparative analysis across multiple processing scenarios demonstrates that the proposed integrated method significantly improves internal consistency between overlapping flight strips. The residual height discrepancies, quantified by the median absolute deviation were reduced from 4.5 cm using standard processing workflows to 2.1 cm with the integrated approach — an improvement exceeding 50%. A single processing pass was sufficient for the relatively calm conditions of the test site, though iterative refinement may benefit more dynamic water surfaces. The presented methodology is generic and can be embedded in any laser scanning framework supporting modular georeferencing and refraction correction.

8:45am - 9:00am

Automated classification of coastal defense structures using airborne bathymetric LiDAR

Jan Rhomberg-Kauert¹, Lucas Dammert¹, Gottfried Mandlburger¹, Tomasz Kogut², Arkadiusz Tomczak², Małgorzata Jarząbek-Rychard³

¹Department of Geodesy and Geoinformation, TU Wien; 1040 Vienna, Austria; ²Faculty of Geoengineering and Environmental Protection, Maritime University of Szczecin; ³Institute of Geodesy and Geoinformatics, Wrocław University of Environmental and Life Sciences

Coastal defense structures, such as breakwaters and groynes are an integral part of coastal engineering. These structures reduce the impact of waves and decrease beach erosion, but due to the constant forces to which they are exposed, repeated monitoring and evaluation is vital to the analysis of their structural integrity. However, coastal defense structures are most often located in the turbulent waters of the surf zone, which characteristics pose severe challenges for current methods. For example, waves pose challenges for image-based analysis, shallow-water limits sonar-based measurements, and currents, represent hazardous environments for surveying personnel. Here, recent advances in topo-bathymetric LiDAR have improved the ability to map data above and below the water surface within the same survey. In the field of structural engineering, point cloud data is already a commonly used information, and thus its applications in the monitoring of coastal defense structures present a natural extension of existing structural monitoring methods. Therefore, this study presents an automatic method for the detection of coastal defense structures with bathymetric LiDAR. The surveyed area consists of multiple groynes located along the Polish coast, which were surveyed using an airplane-based topo-bathymetric LiDAR scanner. The presented method then leverages the echo ratio and repeated clustering to extract the groynes from the data. We evaluate the extracted structures in comparison to manually annotated data. The results of this evaluation display a balanced accuracy of 92%, indicating an overall match with the reference data, but showing challenges and improvements for future work.

9:00am - 9:15am

Accuracy assessment of bathymetric LiDAR using planar reference geometries and total station measurements

Lucas Dammert¹, Jan Rhomberg-Kauert¹, Philipp Amon², Hans-Berndt Neuner¹, Gottfried Mandlburger¹

¹Technische Universität Wien, Austria; ²Riegl Laser Measurement Systems GmbH

A state-of-the-art LiDAR sensor is assessed in terms of the accuracy, described as the sum of trueness and precision, of terrestrial and submerged points. The reference, against which the LiDAR data are evaluated, are conducted with a total station and can be assumed to show an uncertainty of less than 1 cm even for the submerged points. We find that the GNSS-based data set shows a systematic bias of about (-4, 7, 7) cm which can be defined as trueness and does not represent the quality of the LiDAR sensor but mostly of geo-referencing. The precision, which is a measure mostly influenced by the LiDAR sensor itself, is at 0.8 to 2.0 cm for terrestrial points and slightly worse with 1.1 to 2.6 cm for bathymetric points. Our study considers depths of up to 3 m and uses more than 300 points for the assessment.

9:15am - 9:30am

Mapping topobathymetry at ultra-high spatial resolution using RGB UAV and PlanetScope SuperDove neural network fusion

Antoine Collin^1,2, Laurine Nogue^1,2, Dorothee James^1,2, Coralie Monpert³, Laetitia Dupuy⁴, Franck Dolique⁴

¹Coastal GeoEcology Lab, EPHE-PSL University, France; ²Laboratory of Biology of Aquatic Organisms and Ecosystems, France; ³Service Hydrographique et Océanographique de la Marine, France; ⁴Laboratory of Biology of Aquatic Organisms and Ecosystems, Martinique, France

Worldwide coastal areas comprise environmental triple points (air, land and seawater) that cope with coastal risks at unprecedented rates of change. Wind- and wave-related acute hazards add up to the chronic sea-level rise on interface zones that increasingly host human population and assets. Those societal challenges need to be overcome using the most discriminant and finest remote sensors.

We present an innovative two-step methodology to produce an ultra-high spatial resolution (UHSR) topobathymetry using a fusion of a RGB camera mounted on an aerial drone with a multispectral satellite imagery provided with very high temporal resolution. The fusion relied on a DJI Zenmuse P1 (0,08 m pixel size) borne by a DJI Marice 300 RTK, the PlanetScope SuperDove imagery, provided with eight bands at 3 m, and linear or nonlinear (neural network with two hidden layers endowed with three neurons, each) regression. Once the fusion achieved, both topography and bathymetry were mapped using, either the digital surface model (DSM) derived from the drone-derived photogrammetry, or the DSM combined with the UHSR SuperDove imagery. Both datasets served as predictors to model a digital topobathymetric terrain LiDAR response using linear or neural network regression.

The best drone-satellite fusion was completed by the bandwise neural network regression, ranging from R2test of 0,79 for the purple to 0,94 for the red edge band. The UHSR topobathymetry has been mapped by merging the topography and the bathymetry, distinctly predicted by the combination of the DSM with the UHSR Superdove imagery (R2test of 0,68 and 0,92, respectively).

9:30am - 9:45am

Mapping at the Boundary: simultaneous above- and underwater Surveying of rocky coastal Environments with an uncrewed surface vehicle

Sergey Khokhlov¹, Fabio Menna^2,3, Erica Nocerino⁴

¹PhD programme in Culture, Literature, Rights, Tourism and Territory, Department of Humanities and Social Sciences, University of Sassari, Sassari, Italy; ²Department of Chemical, Physical, Mathematical and Natural Sciences, University of Sassari, Sassari, Italy; ³Department of Science and Technology, University of Napoli Parthenope, Napoli, Italy; ⁴Department of Humanities and Social Sciences, University of Sassari, Sassari, Italy

Rocky coastal environments are ecologically important areas where land and sea processes interact in complex ways. Monitoring these zones is challenging, as they include steep cliffs, partially submerged features, and narrow transition areas where traditional surveying methods often struggle. Several European environmental directives now emphasize the need for regular observation of these coastal systems, increasing demand for practical and accessible surveying tools.

This work presents the development and initial testing of a small uncrewed surface vehicle (USV) designed to collect images above and below the water surface at the same time. The platform is based on a commercially available catamaran-style drone and carries two GoPro cameras mounted on a rigid vertical rod, with one camera positioned above the water and the other just below it. Both cameras are synchronized using GPS time, and the system incorporates a PPK-capable GNSS receiver for improved positioning. The payload is wireless and modular, allowing the platform to be deployed quickly.

The main contribution of the system is its ability to document the air–water boundary in a single pass, reducing issues related to changing meteorological and sea conditions. The paper also discusses how the platform was tested at a rocky site in Sardinia and outlines the types of data that can be obtained for environmental mapping.

The approach aims to offer a low-cost, flexible option for coastal monitoring.

9:45am - 10:00am

Evaluation of an Underwater Laser Scanner and an Air-borne Laser Scanner in coastal shallow Waters

Annette Scheider¹, Sethmiya Herath Mudiyanselage¹, Christoph Werner², Ellen Heffner¹, Annika L. Walter¹, Jannis Gangelhoff², Harald Sternberg¹

¹HafenCity University Hamburg, Germany; ²Fraunhofer Institute for Physical Measurement Techniques IPM

Underwater laser scanners and air-borne laser scanners offer considerable potential for high-resolution monitoring of fine-scale underwater structures in shallow, clear waters. An underwater laser scanner mounted on a vessel is used for kinematic data acquisition in coastal waters. Additionally they are surveyed by an air-borne laser scanner. In this investigation, the resulting point clouds from both systems are analyzed in terms of their performance and achievable relative geometric quality.

10:00am - 10:15am

Reconstructing Multibeam Echosounder Bathymetry with Generative Adversarial Networks: Toward Efficient Use of Survey Resources

Anna Brook, Haitham Ezzy, Dror Angel

University of Haifa, Israel

The spatial accuracy and resolution of Multibeam Echosounder data are inherently lower than those of high-resolution underwater LiDAR measurements. However, while Multibeam Echosounder provides wide coverage and extensive historical availability, LiDAR is costly and covers relatively small areas. In this study, we propose an innovative approach to enhance Multibeam Echosounder resolution using a Super-Resolution Generative Adversarial Network with direct comparison to LiDAR data for accuracy assessment. The methodology involves converting Multibeam Echosounder data into grayscale format using various depth gradient techniques, analyzing differences in submarine geomorphology through calculations of slope and aspect, and evaluating statistical accuracy. The results show that the Super-Resolution Generative Adversarial Network model successfully improves Multibeam Echosounder resolution, producing data that closely correspond to LiDAR measurements, particularly in flat, sandy seabed areas. In contrast, regions with complex or rocky terrain exhibited more pronounced deviations, especially in aspect metrics, emphasizing the challenges associated with maintaining topographic orientation throughout the resolution enhancement process. The main conclusion is that enhancing Multibeam Echosounder data using Super-Resolution Generative Adversarial Network enables broader utilization of existing datasets to generate high-resolution models, offering a more cost-effective and accurate solution for seafloor mapping in areas where LiDAR data are unavailable.

8:30am - 10:00am

WG III/8A: Remote Sensing for Agricultural and Natural Ecosystems
Location: 714B

8:30am - 8:45am

Solar-Induced Fluorescence as a Robust Proxy for Vegetation Productivity Across Climate Zones and Vegetation Types in the United States

Luka Mamic^1,3,4, Mj Riches^2,5, Delphine K. Farmer², Francesco Pirotti^3,4

¹Sapienza University of Rome, Department of Civil, Building and Environmental Engineering, Italy; ²Colorado State University, Department of Chemistry, USA; ³University of Padua, Department of Land and Agroforestry Systems (TESAF), Italy; ⁴University of Padua, Interdepartmental Research Centre in Geomatics (CIRGEO), Italy; ⁵Colorado State University, Department of Agricultural Biology, USA

Solar-induced fluorescence (SIF) has become a promising remote sensing proxy for photosynthetic activity and thus plant health, but its broad application across vegetation types and climate regimes remains underexplored. Here, we present the first continental-scale assessment of seasonal SIF signatures for 33 vegetation types across 24 climate zones in the contiguous United States, enabled by a new open-access visualization tool. The analysis uses TROPOMI satellite SIF data (2019-2021), along with MODIS-derived gross primary productivity (GPP), normalized difference vegetation index (NDVI), and vapor pressure deficit (VPD). Our results show that SIF has consistently stronger and more reliable correlations with GPP than NDVI across vegetation types and environmental conditions. This relationship remains robust even under high VPD conditions (except for several perennial crops), confirming the ability of SIF to track productivity even in dry environments. While NDVI retains structural sensitivity, it often decouples from GPP under stress, particularly in arid climates and perennial crops. We also identify clear differences in SIF-NDVI and GPP-NDVI relationships by vegetation type and climate, with NDVI showing limited responsiveness to dynamic changes in canopy physiology. Despite the coarse spatial resolution of TROPOMI, these results demonstrate the feasibility of constructing climate-specific SIF signatures for agricultural and ecological monitoring. By identifying these climate-specific signatures at the continental scale, this work highlights the value of SIF for climate-smart crop management, productivity assessment, and satellite-based ecosystem modeling.

8:45am - 9:00am

High-resolution GPP estimation from Sentinel-1/2 around flux tower sites using convolutional neural networksHigh-resolution GPP estimation from Sentinel 1/2 around flux tower sites using convolutional neural networks

Taewoong Ham, Baoxin Hu

York University, Canada

Accurate estimation of gross primary production (GPP) is fundamental for quantifying the terrestrial carbon cycle. However, coarse-resolution products often fail to capture fine-scale spatial variations in carbon uptake across heterogeneous landscapes. While recent studies have begun to employ 10 m Sentinel-1 and Sentinel-2 imagery, they typically reduce these data to pixel-wise spectral indices, discarding the two-dimensional spatial structure (canopy architecture, land-cover transitions, within-stand heterogeneity) that the imagery encodes. This study investigates whether explicitly exploiting this spatial context via convolutional neural networks yields robust, transferable gains over tabular machine-learning baselines. We curate a quality-controlled dataset of 23,528 eight-day multi-sensor composites from 222 AmeriFlux sites (2015–2025), evaluated under site-wise cross-validation, temporal generalisation, and geographic transfer to an 18-site upper Midwest forest holdout. Under temporal transfer to unseen years (2023–2025), the best convolutional model achieves R² = 0.77 and RMSE = 1.95 gC m⁻² d⁻¹, an 18.6% RMSE reduction over ridge regression (R² = 0.65, RMSE = 2.40 gC m⁻² d⁻¹). Although this advantage narrows under geographic transfer to structurally novel regions (R² = 0.59 vs. 0.54), the convolutional models still outperform all tabular baselines. Spatial structure at 10 m therefore supports more robust temporal generalisation than spectral aggregates alone.

9:00am - 9:15am

Benchmarking GPP Proxies: A Cross-Biome Evaluation of SIF and NIRvP

Fangyi Lv¹, Kaimin Sun¹, Shunxia Miao¹, Yi Xu¹, Tian Guo²

¹Wuhan University, China; ²North Automatic Control Technology Institute, China

Accurate gross primary productivity (GPP) estimation is crucial for understanding ecosystem function and the global carbon cycle. Remote sensing offers promising GPP proxies, including solar-induced chlorophyll fluorescence (SIF) and the structural proxy NIRvP. However, their performance and underlying drivers of effectiveness vary significantly across biomes. This study comprehensively evaluated the accuracy and limitations of SIF and NIRvP against flux GPP across diverse biomes (CRO, GRA, DBF, ENF), also investigating physiological and structural controls on LUE.

We found that proxy performance was highly biome-specific. Notably, the removal of canopy escape probability (fesc) from observed SIF (SIFobs) to derive total emitted SIF (SIFall) did not consistently improve, and sometimes even diminished, its correlation with GPP, particularly in CRO and GRA. Furthermore, we elucidated distinct dominant controls on seasonal LUE variations: apparent SIF emission yield (ΦF×fesc) was paramount in ENF, while canopy structure (fesc) predominated in CRO, GRA, and DBF. Seasonal analysis in ENF further revealed a temporal decoupling, with fesc decline lagging LUE in winter, and ΦF failing to track autumnal LUE reductions. These findings underscore the biome-specific necessity for optimal GPP proxy selection, establishing a robust scientific foundation for improved remote sensing monitoring.

9:15am - 9:30am

Deep Learning Framework for High Spatiotemporal Resolution Monitoring of Carbon Uptake Using Multi-source Satellite Imagery

Jungho Im, Bokyung Son, Taejun Sung, Sejeong Bae

Ulsan National Institute of Science and Technology, Korea, Republic of (South Korea)

Accurate quantification of gross primary productivity (GPP) is essential for understanding carbon dynamics under climate change. However, satellite-based GPP estimates face spatial–temporal trade-offs, limiting accuracy in heterogeneous landscapes. To overcome this challenge, we proposed a novel framework named UNified, high-resolution Intelligent carbon QUantification and Explanation (UNIQUE), which produces daily 30 m GPP maps by integrating spatial relationships between 500 m MODIS and 30 m Landsat imagery. UNIQUE consists of two components. First, two AI models were trained using MODIS- and Landsat-based vegetation indices combined with meteorological reanalysis data and validated with 309 eddy-covariance flux tower observations across the Northern Hemisphere. The Light Gradient Boosting Machine (LGBM) showed the best performance, achieving r = 0.80 and RMSE = 2.47 gC/m²/day for MODIS-based GPP, and r = 0.83 with RMSE = 2.43 gC/m²/day for Landsat-based GPP. Second, a diffusion-based deep learning model was used to downscale MODIS-based GPP to 30 m resolution. The diffusion model from MODIS to Landsat GPP exhibited good performance, demonstrating an RMSE of 2.12 gC/m²/day for the testing sites. The proposed approach enabled the analysis of spatiotemporal characteristics of GPP across different plant functional types, facilitating enhanced high-resolution carbon flux monitoring in diverse ecosystems.

9:30am - 9:45am

Impact of spectral Resolution on SIF Quantification for explaining Almond Yield Variability

Thulasi Vishwanath Harish¹, Tomás Poblete¹, Victoria Gonzalez-Dugo², John Fielke³, Pablo Zarco-Tejada²

¹University of Melbourne, Australia; ²Instituto de Agricultura Sostenible,Consejo Superior de Investigaciones Científicas, Spain; ³Adelaide University, Australia

Insights into crop productivity have long been of great interest to almond growers, as they enable effective planning to optimise economic returns. Advances in sensor technology have made it possible to collect hyperspectral imagery, which captures detailed information across a continuous range of wavelengths and has become a powerful tool for assessing crop physiological status. Solar-induced chlorophyll fluorescence (SIF), along with other plant pigments and structural traits retrieved through radiative transfer modelling, can effectively track crop photosynthetic activity. However, the ability to quantify SIF is strongly influenced by the spectral resolution of the sensor.

This study examines how the spectral resolution of airborne hyperspectral sensors affects the ability to explain yield variability in a commercial almond orchard, by comparing SIF derived from the 760 nm and 687 nm oxygen absorption bands at different spectral resolutions.

9:45am - 10:00am

Multi-temporal Green Roof Vegetation Assessment Using Sentinel-2: A Pilot Study

Afsana Alam Akhie, Darko Joksimovic

Toronto Metropolitan University, ON, Canada

Green roofs (GRs) are constructed systems that replicate natural ecosystems and provide runoff reduction, cooling effect, habitat support and improved air quality services. Over 1,000 GRs have been constructed in Toronto since the GR Bylaw was enacted. As they are dispersed and primarily small-scale stormwater assets on private properties, it is crucial yet difficult to assess these roofs' condition to ensure they continue to deliver the desired advantages and adhere to maintenance regulations. To maintain green stormwater infrastructures in ideal conditions, it is advised that the vegetation be maintained with 80% coverage. GR vegetation experiences plant loss, water stress, and other maintenance concerns requiring regular inspections. This study presents a framework to enable remote assessment of green roof conditions utilizing Sentinel-2A satellite imagery, which captures images every five days. This method overcomes logistical challenges associated with drone imagery inspections, which are limited in frequency and require permits. The study was conducted using Google Earth Engine, focusing on the intra- and inter-annual variation of four GR modules. The study assessed the vegetation health from 2018 to 2025 using NDVI, EVI and NDMI, highlighting the long-term dynamics and distribution of GR vegetation. The results present the effectiveness of NDVI, EVI, and NDMI in assessing plant coverage and moisture content, with low NDMI being an important factor resulting in low NDVI and EVI. The study contributes to the potential of satellite images for scalable and continuous monitoring of GRs and supports efficient and complementary inspection.

8:30am - 10:00am

ICWG III/IVa-A: Disaster Management
Location: 715A

8:30am - 8:45am

A Camera System for Wildfire Detection and strategies against false positive Results

Gerhard Kemper¹, Denny Shaji², Christian Heipke³

¹GGS GmbH, Germany; ²GGS GmbH, Germany; ³Leibniz Universitaet Hannover (LUH) Institut fuer Photogrammetrie und GeoInformation (IPI)

Early Wildfire detection using AI remains challenging in environmental monitoring, particularly when the approach should be flexible enough to handle different sensors and different landscapes. This study presents a multi-stage deep learning framework for real-time smoke and fire detection using imagery from fixed tower cameras and UAVs. The proposed system employs YOLOv11 as the primary detection model for high-speed inference, com-plemented by Faster R-CNN for precision benchmarking and cross-architecture evaluation. Together, these models support an in-depth analysis of detection accuracy, robust-ness, and computational efficiency across diverse envi-ronmental conditions.

An end-to-end pipeline has been developed, integrating real-time image acquisition, asynchronous message han-dling through RabbitMQ, and performance logging via InfluxDB, enabling continuous model evaluation under near-operational conditions. Experimental results indicate that while YOLOv11 achieves high frame-rate perfor-mance and strong detection capability, it remains suscepti-ble to false positives in visually ambiguous scenarios such as haze, fog, or low-contrast backgrounds, where contex-tual patterns closely resemble smoke. Faster R-CNN serves as a complimentary reference to quantify localiza-tion accuracy and analyse error propagation, facilitating threshold tuning and model interpretability.

The presented framework bridges the gap between aca-demic model development and field-deployable fire sur-veillance systems. It establishes a reproducible, scalable foundation for real-time decision support in forest watch-tower networks and autonomous UAV missions aimed at early wildfire detection and response

8:45am - 9:00am

Effectiveness of Airborne LiDAR Intensity for Identifying Surface Fire Burned Areas in Wildfires

Takafumi Kakuno, Junichi Takanuki, Takaaki Ankai, Takumi Satou

Aero Toyota Corporation, Japan

Wildfires induce significant changes in forest structure and the surface reflectance characteristics. This study evaluated the effectiveness of using airborne LiDAR Intensity data to delineate surface fire burn areas in wildfires. We extracted ground returns from both coniferous and deciduous forests and conducted qualitative assessment of Intensity through Intensity images, as well as statistical evaluation using the non-parametric Mann–Whitney U test to compare burned and unburned areas. We compared the median and standard deviation of Intensity at a 10-m mesh scale, calculating standard deviation at a finer 0.5-m mesh resolution. The results revealed significant differences between the two groups. As a result, a significant difference was observed between the two groups. The effect size r for the median in deciduous forests ranged from 0.55 to 0.84, while the effect size r for the standard deviation in coniferous forests ranged from 0.32 to 0.47. Both indicated a medium to large effect. These findings suggest that LiDAR Intensity can effectively identify surface fire burn areas even under heterogeneous forest floor conditions. The proposed method has the potential to contribute to enhancing post-fire monitoring using airborne LiDAR.

9:00am - 9:15am

Assessing Fire Impacts on Aboveground Biomass using Multi Sensor Remote Sensing in the Western Ghats

Balaramdas K R¹, Mohanraj R², Abdul Rahaman S¹

¹Bharathidasan University, Tiruchirappalli, India; ²Sathyabama Institute of Science and Technology, Chennai, India

This study investigates two decade (2000-2020) of Aboveground Biomass dynamics in the biodiversity hotspot of Western Ghats, India, focusing on the impacts on forest fire and climate variability. Using machine learning approaches with GEDI LiDAR data and MODIS satellite imagery, we developed a robust annual AGB model. These analysis reveals a consistent decline in AGB across Kodaikanal and Nilgiris. Results shows that rising temperature and vapor pressure deficit are the key driver for increase in burn are and fire intensity. These are pushing carbon rich evergreen forests toward a critical transition from carbon sink to source. An integrated Structural Equation Model confirms that the dominant role of climate in driving fire regimes and subsequent biomass loss. This research provides a critical scientific foundation for fire adaptive forest management and carbon accounting in vulnerable tropical ecosystem.

9:15am - 9:30am

BC Wildfire Risk Prediciton Time-Series Dataset: 2002--2023

Zhengsen Xu¹, Lanying Wang², Yimin Zhu¹, Wentao Sun², Lincoln Linlin Xu¹

¹University of Calgary, Canada; ²University of Waterloo, Canada

Wildfires are longstanding natural phenomena with significant impacts on ecosystems and communities. In recent years, Canada has experienced particularly severe wildfire effects, especially in British Columbia (BC), which has endured prolonged and impactful wildfire events. However, there is currently no specialized wildfire time-series dataset for BC that considers long-term temporal sequences and multiple driving factors, which are essential for data-driven approaches. To facilitate future research on data-driven wildfire risk and spread prediction, we have developed a dataset covering the entire BC province, encompassing 683 wildfire events from 2002-2023 at 500m resolution with daily observations. For each wildfire event, the dataset includes 20 driving factors, including vegetation status, meteorological factors, human activities, topographical features, and active fire detection. Based on this benchmark and similar datasets from other regions, we compared multiple deep learning models, including CNN-based, Transformer-based, and Mamba-based architectures, to explore the effectiveness of existing deep learning models in wildfire risk prediction. We found that model F1 scores were below 0.6, indicating that this new dataset presents a challenging non-linear modeling scenario that requires more advanced and tailor-designed deep learning models to improve wildfire risk prediction accuracy.

9:30am - 9:45am

Long-term forest fire assessment over Zagros Forests

Fatemeh Motalebizade, Parviz Fatehi

University of Tehran, Iran, Islamic Republic of

Wildfires are known as one of the most important natural hazards, adversely impacting the ecosystems and human lives. Monitoring and management of wildfires is necessary to minimize their negative effects. Global BA products are widely used to study wildfires, but their accuracy is not constant over different environments. In this study, the MCD64A1 BA product was spatially validated using ground truth maps in a fire-prone Zagros Forest over 2021-2023. Our results indicated that its performance varies temporally, as the Kappa coefficient ranged from 0.04 to 0.69. Overall accuracy was higher than 0.96 percent in all years, indicating that MCD64A1 can be considered as a source for studying wildfires; however, its underestimation should be considered. In the next step, the trend of fire and its relationship with precipitation (i.e., obtained from the CHRIPS dataset) were analyzed in three forest ecosystems from 2001 to 2024. In two regions, Marivan and Kermanshah, wildfires experienced an increasing trend, in contrast to the other region, Shiraz, where they decreased over time. Analyzing the correlation between fire and precipitation revealed that spring precipitation is more connected to BA than annual precipitation. Comparing the results of the three regions showed that this matter is also region-related, and the results of one region cannot be referred to another. This study provided information on the performance of MCD64A1 in semiarid forests and the wildfire conditions in the Zagros Mountains to aid wildfire management.

8:30am - 10:00am

WG II/3A: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

8:30am - 8:45am

GT-LOD3: LOD3 Semantic 3D Building Reconstruction Benchmark Dataset

Han Sae Kim¹, Olaf Wysocki², Ludwig Hoegner³, Ksenia Bittner⁴, Joshua Carpenter⁵, Friedrich Fraundorfer^4,6, Arpan Kusari⁷, Max Mehltretter⁸, Franz Rottensteiner⁸, Anna Schadl⁹, Martin Weinmann¹⁰, Jinha Jung¹

¹Lyles School of Civil and Construction Engineering, Purdue University, West Lafayette, IN, USA; ²CV4DT, Department of Engineering, University of Cambridge, Cambridge, United Kingdom; ³Faculty of Civil Engineering, Hochschule München University of Applied Sciences, Munich, Germany; ⁴Remote Sensing Technology Institute, German Aerospace Center (DLR), Oberpfaffenhofen, Germany; ⁵Department of Civil Engineering, The University of Akron, Akron, OH, USA; ⁶Institute of Visual Computing, Graz University of Technology, Graz, Austria; ⁷University of Michigan Transportation Research Institute, University of Michigan, Ann Arbor, MI, USA; ⁸Institute of Photogrammetry and GeoInformation, Leibniz University Hannover, Hannover, Germany; ⁹Faculty of Geoinformatics, Hochschule München University of Applied Sciences, Munich, Germany; ¹⁰Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany

This contribution introduces GT-LOD3, a new benchmark dataset designed to advance semantic Level of Detail 3 (LOD3) building reconstruction from UAS-based photogrammetric point clouds. Existing benchmarks primarily focus on mesh- or point-level semantic labelling, façade segmentation, or LOD2-level modelling, but high-quality, geometry-accurate LOD3 ground truth paired with real-world photogrammetric observations are still limited. GT-LOD3 fills this gap by offering paired UAS point clouds and manually modeled LOD3 reference data in CityGML format, enabling research on window-level facade reasoning, geometric regularization, and instance-level shape recovery.

The benchmark currently consists of two subsets featuring different architectural styles and environmental conditions: (1) a urban block in Gold Coast (Lakewood, Ohio, USA), and (2) the Technical University of Munich (TUM) campus. The accompanying LOD3 reference models contain explicit window geometry, enabling detailed evaluation of both detection performance and polygon-level geometric accuracy.

We further provide a baseline reconstruction pipeline that combines point-cloud semantic segmentation, facade-aligned 2D projection, window region extraction, and geometric back-projection into CityGML. An evaluation protocol is presented including pixel-level metrics (IoU, precision, recall, F1) and instance-level detection metrics based on optimal assignment via the Hungarian algorithm.

8:45am - 9:00am

LoD2-Former: Multi-Modal Transformer-Based 3D Building Wireframe Reconstruction

Youssef Abdelhedi¹, Daniel Panangian¹, Chaikal Amrullah¹, Houda Chaabouni-Chouayakh², Ksenia Bittner¹

¹Remote Sensing Technology Institute, German Aerospace Center (DLR), Wessling, Germany; ²Sm@rts Laboratory, Digital Research Center of Sfax, Tunisia

Building wireframe reconstruction from LiDAR faces challenges due to sparse and incomplete point cloud data. We present LoD2-Former, a multi-modal Transformer architecture that fuses aerial LiDAR and optical imagery for end-to-end 3D roof wireframe reconstruction. Unlike existing point-cloud-only methods, our dual-backbone approach with bidirectional cross-modal attention leverages complementary geometric and visual information. Experiments on two datasets show consistent improvements in edge detection metrics, with edge F1-scores increasing from 0.874 to 0.899 on Tallinn and 0.968 to 0.974 on Roof-Intuitive, while substantially boosting corner recall (0.630 to 0.729) in complete-data settings. We also contribute a curated multi-modal subset of Building3D with aligned LiDAR and aerial imagery to facilitate future research.

9:00am - 9:15am

Point2WSS: Reconstructing LoD2 Buildings from Aerial LiDAR Data using Multimodal Learning and Weighted Straight Skeleton

Pierre-Loïc Queffélec^1,2, Nicolas Trouvé¹, Stéphane Roussel¹, Teng Wu², Bruno Vallet²

¹DEMR, ONERA, Université Paris Saclay, F-91123 Palaiseau, France; ²Univ Gustave Eiffel, ENSG, IGN, LASTIG, F-77420 Champs-sur-Marne, France

In this paper, a method exploiting aerial LiDAR point clouds to build realistic building meshes suitable for electromagnetic simulation is proposed. One of the main challenges lies in reconstructing regularized building meshes with low polygonal density. Optimization-based methods, commonly used for building reconstruction from point clouds, are highly data-driven, making the quality of results dependent on the quality of input data. Aerial LiDAR scans can be incomplete or sparse, for instance due to occlusion. A novel LoD2 buildings reconstruction method based on deep learning is proposed, assuming that deep learning methods are more robust to incomplete or sparse data than optimization-based methods. A parametric building model is introduced, based on the Weighted Straight Skeleton algorithm, which generates realistic roofs from a building footprint and an associated set of slopes, and subsequently extrudes the roof to the specified building height. This parametric approach guarantees that a given set of parameters (height, footprint and slopes) produces a regularized building mesh with low polygonal density. A multimodal model, named Point2WSS, was trained to recover the variable number of building's continuous parameters from its corresponding point cloud. This approach enables the generation of realistic building meshes suitable for electromagnetic simulation, if the predicted parameters accurately approximate real-world values.

9:15am - 9:30am

Wide-area Scene Reconstruction with polyhedral Buildings featuring recognized Regularities

Jochen Meidow

Fraunhofer IOSB, Germany

The modeling of buildings suffers from a dichotomy between generic and specific representations: the lack of domain knowledge in flexible models that can represent many shapes, and the restricted geometry of pre-specified parametric building primitives. To fill this gap, we propose using general boundary representations enriched with automatically recognized and enforced geometric constraints derived from human-made regularities. The proposed reasoning process relies on the statistics of the planar point groups extracted from airborne-captured point clouds. Hence, a chosen significance level is the only process parameter. To enforce the creation of sound solids, we apply manifold constraints for the generation of the boundary representations. The feasibility and usability of the approach are demonstrated by evaluating an airborne-captured laser scan containing approximately 7,600 buildings over an area of 50 km^2 featuring both inner-city and rural landscapes.

9:30am - 9:45am

The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization

Raphael Sulzer^1,2, Liuyun Duan², Nicolas Girard², Florent Lafarge¹

¹Université Côte d’Azur, INRIA – Sophia-Antipolis, France; ²LuxCarta Technology, Mouans-Sartoux, France

We present P3, a large-scale multimodal dataset for building vectorization, including aerial LiDAR point clouds, aerial images, and vectorized 2D building outlines, collected across three continents. P3 contains over 10 billion LiDAR points with decimeter-level accuracy and RGB images at a ground sampling distance of 25 centimeters. While many existing datasets focus on the image modality, P3 offers a complementary perspective by incorporating dense 3D information. We demonstrate that LiDAR point clouds serve as a robust modality for predicting building polygons, both in hybrid and end-to-end learning frameworks. Moreover, fusing LiDAR and imagery further improves accuracy and geometric quality of predicted polygons. The P3 dataset is publicly available, along with code and pretrained weights of three state-of-the-art models for building polygon prediction at https://github.com/raphaelsulzer/PixelsPointsPolygons .

9:45am - 10:00am

Building height estimation from stereo satellite images using contour vector registration

Yaxuan Duan, Wei Qin, Xin Huang, Pei Mi, Yang Yu, Pengjie Tao

School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China

Accurate building height estimation plays a crucial role in large-scale 3D urban reconstruction. However, conventional stereo matching approaches often suffer from mismatches around building edges, leading to unreliable height retrieval in dense urban areas. To address this issue, this paper presents a novel method for building height estimation based on contour vector registration integrated with the vertical line locus technique. The proposed framework first automatically matches building contour vectors extracted from stereo high-resolution satellite images. Then, for each paired contour, a range of candidate heights is searched using a rational function model to project the reference contour from the image space to object space and then reproject it onto the conjugate image. The elevation that maximizes the overlap ratio between projected and paired contours is identified as the optimal roof elevation. Building height is subsequently derived by subtracting the ground elevation from the estimated roof elevation. Experiments conducted on SuperView-1 (SV-1) satellite stereo images over Jiuyuan District, Baotou, Inner Mongolia, China, demonstrate the effectiveness of the proposed method. The resulting building height estimates achieve a root mean square error of 0.84 m compared to manual measurements, showing strong agreement (r = 0.9993). The proposed contour-based stereo registration approach provides a robust and efficient solution for building height extraction from high-resolution satellite data, supporting precise urban 3D modeling and large-scale spatial analysis.

8:30am - 10:00am

IvS3A: Advancing Digital Twins for Urban Environments: Approaches to Mapping, Monitoring, and Management
Location: 716A

8:30am - 8:45am

A Decade of Aerial Mapping in Singapore

Daniel Kruimel, Esther Seah, Zaffar Sadiq Mohamed-Ghouse

Woolpert, United States of America

In 2024, the Singapore Land Authority (SLA) commissioned Woolpert to conduct a large-scale aerial mapping initiative under the National 3D Mapping Programme to support Smart Nation applications, urban planning, and geospatial analytics. This project, executed between 2024 and 2025, delivered high-resolution imagery and LiDAR datasets across approximately 750 km², covering mainland Singapore and offshore islands. This was the third epoch of 3D mapping in Singapore with previous surveys conducted by Woolpert (then AAM) in 2014 and 2019

8:45am - 9:00am

Large-Scale Urban and Peri-Urban Mapping Using Deep Learning and PlanetScope Imagery

Yuhong He¹, Yindan Zhang¹, Jonathan Ruppert², Lyndsay Cartwright²

¹University of Toronto Mississauga, Canada; ²Toronto and Region Conservation Authority

Accurate, high-resolution land use and land cover data are critical for effective environmental monitoring, watershed management, and sustainable urban and peri-urban planning within rapidly urbanizing regions such as the Toronto and Region Conservation Authority (TRCA) jurisdiction in Ontario, Canada. TRCA has conventionally relied on manual mapping approaches to delineate its LULC inventory; however, this method is labour-intensive and prone to temporal inconsistencies across updates. To address these challenges, we developed TRCA-AutoMap, a deep learning-based automated mapping framework to generate fine-scale LULC products using 3-m PlanetScope imagery. TRCA-AutoMap integrates two principal modules. The first module is designed to enhance the model’s ability to detect and differentiate objects across spatial scales. By leveraging multi-extent feature encoding and pyramid pooling, the convolutional neural networks capture both fine-texture and contextual information, thereby improving segmentation accuracy and spatial coherence . The second module focuses on optimizing the model’s understanding of varying imaging conditions. It utilizes a group of autoencoders to mitigate radiometric and environmental differences among input images, thereby maintaining the model's reliability across varied lighting conditions, sensor types, and atmospheric conditions. This process enhances the stability of PlanetScope imagery over time and consistency between different scenes. The framework significantly reduces manual processing effort, ensures classification consistency, and supports annual LULC updates. Quantitative and visual evaluations confirm that the model accurately captures fine-scale vegetation heterogeneity and urban expansion dynamics.

9:00am - 9:15am

Research on Urban 3D Data Management and Representation Method Based on BeiDou Grid Code

Jiahui Wang, Tao Shen, Liang Huo, Xuejia Wei, Jinlong Wang

Beijing University of Civil Engineering and Architecture, China, People's Republic of

With the advancement of urbanization and digital twin city development, urban 3D data are characterized by large volume, heterogeneity, and structural complexity. Traditional spatial data management methods face limitations in hierarchical organization, retrieval efficiency, and redundancy control, and the lack of a unified spatial coding system hinders multi-source data integration. This paper proposes a method for urban 3D data management and representation based on BeiDou grid coding and adaptive voxel modeling. The method converts point cloud data from local coordinates to the 2000 National Geodetic Coordinate System, applies 36-bit 3D BeiDou grid coding, performs adaptive octree voxel partitioning based on point cloud density, elevation variation, and class entropy, and binds spatial, geometric, and semantic attributes at the voxel level. Using the SensatUrban dataset, the method is compared with fixed-resolution voxel modeling, latitude-longitude indexing, and R-tree indexing in terms of voxel quantity, data storage, and retrieval time. Results show that it reduces voxel count by 28.1% and storage volume by 13.6% while maintaining high-precision representation, and the BeiDou grid-based indexing significantly improves query efficiency and stability. The proposed approach balances visualization quality and computational efficiency, providing an effective solution for large-scale urban 3D data management.

9:15am - 9:30am

Evaluating iPhone-based 3D-Scanning Applications for Heritage Documentation: Controlled Experiments and Future Directions

Abdelrahman Abdelghany¹, Derek Lichti¹, Peter Dawson¹, Shabnam Jabari²

¹University of calgary, Canada; ²University of New Brunswick

Smartphone 3D-scanning apps are becoming popular tools for heritage documentation, but their accuracy and reliability remain unclear. This contribution presents controlled laboratory experiments using several iPhone-based scanning applications, comparing their point clouds to high-precision reference data. The study evaluates geometric accuracy, completeness, and reconstruction geometric stability, highlighting the strengths and limitations of current mobile scanning solutions. Practical recommendations are provided for heritage professionals and field teams, along with future directions for improving smartphone-based documentation using AI-enhanced depth estimation.

9:30am - 9:45am

Automatic DEM-infused 2D to 3D LoD1 Urban Morphology Python Framework

Omar Yasser¹, Mohsen Kalantari², Izni Zahidi¹

¹Monash University, Malaysia; ²The University of New South Wales (UNSW) Sydney

The generation of 3D urban morphology models from 2D urban morphology maps has been widely explored. Traditional methods use modelling software, such as Rhino, which lack georeferencing, elevation, and automation. In this study, we developed an open-source Python framework for automatic generation of 3D city blocks, including elevation, from 2D colour-graded building heightmaps and urban morphology input. We utilised the UT-GLOBUS and GlobalBuildingAtlas building datasets to generate heightmaps and retrieved other urban morphology features, such as waterbodies, parks, roads, and trees, from OpenStreetMap to form the input raster patches. The framework generates height and colour maps based on the input features, which are extruded in 3D and exported into multiple standard 3D GIS formats such as CityGML and CityJSON. Six global cities: Sydney, New York, London, Rio de Janeiro, Hong Kong, and Singapore, were modelled to demonstrate the framework’s applicability. Validation includes qualitative comparison with Google Earth 3D data and quantitative comparison against official LiDAR-derived DSMs for four cities. Quantitative results show moderate height errors and good spatial agreement of building footprints, reflecting the expected differences between simplified LoD1 block models and detailed DSM representations. Our framework results show promising potential in the field of 2D to 3D mapping for the creation of 3D city models for urban climate modelling and environmental analysis. The generated 3D models can be downloaded at https://doi.org/10.5281/zenodo.17620303.

8:30am - 10:00am

ThS14: AI-Augmented Photogrammetry - Bridging Learning-based Approaches and Classical Geometric-based 3D Methods
Location: 716B

8:30am - 8:45am

Combining Photogrammetry and Gaussian Splatting

Fabio Remondino¹, Elisa Mariarosaria Farella¹, Gianluca Bertolasi¹, Simone Rigon¹, Rongjun Qin²

¹3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy; ²Department of Civil, Environmental and Geodetic Engineering, The Ohio State University, USA

Among the image-based methods, traditional photogrammetry is a consolidated 3D reconstruction technique able to provide highly accurate metric products, widely exploited in many domains for documentation and mapping purposes. The reconstruction capability of this technique is, however, conditioned by the characteristics of the captured scene, with high performance in well-textured areas and limits when non-collaborative surfaces, such as reflective or transparent objects, are present. In such cases, the photogrammetric reconstruction is often affected by noise, incomplete geometry and artifacts, reducing its final reconstruction quality. In recent years, different AI-based reconstruction methods have emerged as alternative (or complementary) 3D reconstruction and rendering solutions. In particular, 3D Gaussian Splatting (GS) has demonstrated impressive capabilities in rendering photorealistic scenes in challenging situations with high visual fidelity. However, its application in large-scale scenarios or when highly accurate 3D metric products are required is still limited, due to the hight computational resources needed and the intrinsic optimization of GS methods for photometric rendering quality. To address these bottlenecks, this work proposes a hybrid reconstruction pipeline, leveraging the strengths and benefits of each technique. The method exploits the accurate geometry of photogrammetry in well-textured regions and the higher GS capabilities to improve completeness and visual aspect in areas featuring non-collaborative surfaces. A fusion strategy is proposed to combine the two products into a single 3D model, presenting results on two aerial and one terrestrial dataset.

8:45am - 9:00am

Refraction-Aware Two-Media NeRF for Underwater 3D Reconstruction

Markus Brezovsky¹, Anatol Guenthner², Frederik Schulte³, Lukas Winiwarter³, Boris Jutzi², Gottfried Mandlburger¹

¹Department of Geodesy and Geoinformation, TU Wien, Vienna, Austria; ²Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany; ³Unit of Geometry and Surveying, University of Innsbruck, Innsbruck, Austria

Neural Radiance Fields (NeRFs) (Mildenhall et al., 2020) have revolutionized novel view synthesis, but standard formulations assume straight rays in a single, homogeneous medium. In underwater scenarios, refraction at the air–water interface leads to bent light paths and, if ignored, to distorted 3D structure and missing underwater points. Refraction-aware NeRF variants such as NeRFrac (Xue et al.,2022) demonstrate the benefit of modeling refraction, but are limited to a single underwater medium and standalone implementations. Recent work has applied NeRFrac to through-water reconstruction (Brezovsky et al., 2025), introduced a simulation framework for two-media scenes (Schulte et al., 2025). Building on these ideas, we introduce the general concept of a twomedia NeRF and demonstrate its integration into the Nerfstudio framework (Tancik et al., 2023) with the goal of extracting metrically meaningful underwater point clouds rather than only improving image-based metrics.

9:00am - 9:15am

CENS: A Coverage-efficient Pixel Sampling Strategy for enhancing NeRF-generated Point Cloud Fidelity

Perpetual Hope Akwensi, Frederik Schulte, Lukas Winiwarter

Unit of Geometry and Surveying, Universität Innsbruck, Austria

Many geospatial workflows critically depend on high-fidelity 3D point clouds for applications such as change detection, orthophoto generation, and modeling. However, NeRF-generated point clouds often suffer from sampling inefficiencies inherent in the predominant random pixel sampling approach. We identify spatial redundancy as one such inefficiency: random sampling has the inevitable consequence of sampling large, low-texture patches more frequently than detailed, high-frequency textured regions. As a result, low-texture areas turn to be oversampled and other pixels remain unsampled -- regardless of their importance to the reconstruction task.

To overcome this, we propose CENS (Coverage-Efficient Non-Redundant Sampling), a deterministic pixel sampling strategy that maximizes spatial coverage, eliminates intra-image sample repetition, and ensures reproducibility via structured initialization.

Evaluated on the Jamtal valley dataset, CENS achieves comparable geometric accuracy (C2M: mean = -0.0027 vs. -0.0011 m; standard deviation = 0.027 vs. 0.028 m) using 50% fewer training steps (11,232 vs. 22,464), while yielding 28.2% more points, higher orthophoto fidelity, and improved point cloud completeness. Beyond CENS, we also explored NeRFs for ALS point cloud simulation, achieving realistic occlusion patterns and accuracy within UAV photogrammetry standards (Vertical RMSE} = 24 mm; Horizontal RMSE = 17 mm).

Crucially, CENS positions NeRFs as a scalable, practical solution for geospatial point cloud and orthophoto generation, advancing them toward real-world mapping workflows, and integrates seamlessly into NeRFStudio.

9:15am - 9:30am

Explicit Reconstruction of thermal Environments based on dual-modal neural Radiation Fields for diagnosing Building Facade Defects

Sirui Chen¹, Yipeng Lu², Zhe Chen², Fuxun Liang^1,2, Zhen Dong²

¹School of Urban Design, Wuhan University, Wuhan, China, China, People's Republic of; ²State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing (LIESMARS), Wuhan University, Wuhan, China

This research presents an innovative multi-modal framework for the explicit 3D reconstruction of building thermal environments to diagnose facade defects. We propose a framework which is centered on a dual-branch Neural Radiance Field (NeRF) architecture, which effectively fuses fine-grained geometric information from RGB data with precise quantitative thermal data from TIR data. For practical diagnostics, the framework integrates the Signed Distance Function (SDF) to implicitly learn a high-fidelity surface representation. Subsequently, a final, explicit triangular mesh is extracted from this implicit field using the Marching Cubes algorithm. The resulting model achieves geometric accuracy and thermal fidelity, enabling the clear visualization, localization, and analysis of thermal anomalies such as thermal bridges, cavities, and moisture ingress in their correct spatial context.

9:30am - 9:45am

Assessing the Reconstruction Potential of 3D Vision Foundation Models for Oblique Photogrammetry

Junfan Wang¹, Feng Liu², Zhihao Jia¹, Han Hu^1,3, Min Chen^1,3, Xuming Ge^1,3, Ping Wen^3,4, Chong Wang^3,4, Qing Zhu^1,3

¹Faculty of Geosciences and Engineering, Southwest Jiaotong University, 611756 Chengdu, China; ²CRSC Communication & Information Group Co., Ltd.; ³Yunnan Engineering Research Center of 3D Real Scene, Kunming 650500, China; ⁴Kunming Engineering Corporation Limited, Kunming 650500, China

3D vision foundation models, which directly regress 3D geometry from 2D images in an end-to-end manner, have recently attracted growing attention in the computer vision community. However, their potential for oblique 3D reconstruction has not been systematically evaluated. To this end, we establish an automated evaluation pipeline to benchmark these models on oblique imagery. Our experiments reveal that: benefiting from the powerful zero-shot generalization, 3D vision foundation models can robustly estimate camera parameters and generate dense point clouds under sparse-view and low-overlap conditions, with some rivaling traditional photogrammetry configured with redundant observations. Counterintuitively, two-view reasoning foundation models employing explicit PnP-RANSAC for global alignment consistently outperform multi-view reasoning foundation models inferring multi-view relationships via implicit attention mechanism when processing more than 2 views. Notably, incorporating known camera parameters as conditioning inputs, which act as weak supervision rather than rigid geometric constraints, yields only marginal accuracy improvements. Based on ViT architecture, these foundation models face scalability bottlenecks to large-scale and high-resolution oblique imagery, and their prevalent ideal pinhole camera assumption still makes explicit distortion correction an unavoidable preprocessing step.

9:45am - 10:00am

Evaluating the Performance of 3D Vision Foundation Models for DSM Reconstruction from Satellite Images

Liupeng Su¹, Yuhao Ye¹, Han Hu¹, Zeyuan Dai^2,3, Qianrui Guo⁴, Heyi Li⁴, Yulin Ding¹, Qing Zhu¹

¹Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu 611756, Sichuan, China; ²Department of Military Oceanography and Hydrography and Cartography, Dalian Naval Academy, Dalian 116018, China; ³Key Laboratory of Hydrographic Surveying and Mapping of PLA, Dalian Naval Academy, Dalian 116018, China; ⁴Institute of Remote Sensing Satelite, China Academy of Space Technology, Beijing 100094, China

Three-dimensional (3D) reconstruction from satellite imagery is a critical research topic in the fields of remote sensing and geoinformation science. Although 3D Vision Foundation Models (3D VFMs) have demonstrated remarkable performance in reconstructing natural scenes, their capability to handle high-resolution satellite imagery has not been systematically evaluated. This study presents a comprehensive assessment of seven representative 3D VFMs for satellite-based 3D reconstruction and integrates four point-cloud alignment strategies. Rigorous comparisons were conducted against high-precision LiDAR-derived Digital Surface Models (DSMs) using two publicly available multi-view satellite datasets--WHU-TLC and MVS3D. Experimental results show that, on the high-resolution MVS3D dataset, the Depth Anything v2 (DAV2) model combined with the Affine alignment strategy achieved the best overall performance, producing DSMs with a Mean Absolute Error (MAE) of 1.75 m and a Root Mean Square Error (RMSE) of 3.24 m, corresponding to accuracy improvements of 8.4 % and 13.6 %, respectively--significantly outperforming all other model-strategy combinations. In contrast, on the lower-resolution WHU-TLC dataset, all 3D VFMs exhibited notable performance degradation, and the reconstructed results showed limited practical value, revealing persistent generalization challenges for current models in low-resolution scenarios. Overall, this study systematically quantifies the performance of 3D VFMs in satellite image-based 3D reconstruction, confirming their strong potential for high-resolution satellite applications and providing valuable insights for enhancing model robustness and generalization across complex urban and low-resolution environments.

8:30am - 10:00am

WG III/1M: Remote Sensing Data Processing and Understanding
Location: 717A

8:30am - 8:45am

Evaluating super-resolution models for real-world Sentinel-2 applications: A case study

Ron Mühlhaus², Sandeep Kumar Jangir¹, Cecilia Curreli², Paul Karlshöfer¹, Daniel Cremers²

¹German Aerospace Center (DLR), The Remote Sensing Technology Institute, Germany; ²Technical University of Munich. School of Computation, Information and Technology

High-resolution Earth observation data are crucial for applications such as agriculture, urban planning, and environmental monitoring. Although commercial satellites provide sub-meter imagery, open-access alternatives like Sentinel-2 are limited to resolutions around 10~m ground sampling distance, which is insufficient for many tasks. In this work, we investigate image super-resolution as a method to bridge this gap, enhancing downstream performance on freely available satellite data. We leverage two 16-bit single-band datasets, consisting of Sentinel-2 (20m --> 10m) and Venus (10m --> 5m) images, to train and benchmark state-of-the-art SR methods, including transformer- and diffusion-based approaches, across multiple dataset mixes. These models are evaluated quantitatively using reference-based metrics (PSNR, SSIM) using ground-truth and no-reference scores (FID, NIQE) for native upscaling from 20m --> 10m and 10m --> 5m. We observe that different SR architectures present trade-offs between standard quantitative metrics and perceptual image quality. We further assess their impact on a practical downstream task: field boundary detection from Sentinel-2 imagery. Our experiments demonstrate that SR pre-processing improves quantitative fidelity and downstream task performance, enabling low-resolution satellites to compete more effectively with commercial imagery

8:45am - 9:00am

Fine-Grained Remote Sensing Imagery Generation Driven by Expert Knowledge and Hierarchical Captions

Jiaxin Ren^1,2,3, Wanzeng Liu⁴, Feng Zhang³, Jun Chen^1,4, Jiadong Zhang⁵, Shunxi Yin⁶, Shaoxuan Zhao⁵, Guanfan Xi¹, Di Chen¹, Kuanlin Dong¹

¹Moganshan Geospatial Information Laboratory, Huzhou, China; ²Key Laboratory of Monitoring, Evaluation and Early Warning of Territorial Spatial Planning Implementation, Ministry of Natural Resources, Chongqing, China; ³School of Earth Sciences, Zhejiang University, Hangzhou, China; ⁴National Geomatics Center of China, Beijing, China; ⁵School of Geosciences and Info-Physics, Central South University, Changsha, China; ⁶School of Environment and Spatial Informatics, China University of Mining and Technology, Xuzhou, China

Current diffusion models struggle to achieve fine-grained remote sensing imagery (RSI) generation. This limitation fundamentally stems from their reliance on "flattened" text prompts, which overlook the inherent hierarchical structure of RSI. This paper proposes a fine-grained RSI generation method driven by expert knowledge and hierarchical captions. We first deconstruct RSI into a hierarchical "element-relation-scene" caption and employ an automatic caption optimization mechanism, grounded in spatial relation knowledge, to ensure high fidelity. Critically, we introduce a novel hierarchical caption encoding mechanism that efficiently injects decoupled hierarchical caption segments into the U-Net's cross-attention layers. This design enables the model to exert hierarchical and decoupled attentional control over the global scene, spatial layout, and geographical element details during the denoising process. Experiments demonstrate that, when combined with efficient fine-tuning algorithms such as LoRA, our method significantly outperforms traditional single-level captions across all six evaluation metrics, exemplified by the FID metric decreasing from 228.43 to 205.59 and the GSHPS metric increasing from 0.86 to 0.92. This research provides a new paradigm for controllable remote sensing scene generation, establishing an effective link between hierarchical semantic understanding and the progressive generation process of diffusion models.

9:00am - 9:15am

Image-level and Feature-level Semantic-aware Architecture for Cross Domain Semantic Segmentation of High-resolution Remote Sensing Imagery

Jianhao Miao¹, Xinghua Li², Xuechen Bai¹

¹State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, People's Republic of China; ²School of Remote Sensing and Information Engineering, Wuhan University, People's Republic of China

Semantic segmentation of remote sensing images has attracted considerable attentions. For cross domain semantic segmentation, the images captured at different times inevitably exhibit significant domain and feature gaps. Besides, the labels are precious, given that acquiring adequate annotations is time-consuming and laborious. There are numerous methods to cope with these problems, for example, semi-supervised, weakly supervised learning help for the lack of label, while style transfer and domain adaptation are effective for domain gaps. However, the outcomes are still not ideal. Nearly all methods ignore the combination of image-level alignment and feature-level alignment, while few methods consider class-wise constraint to boost the performance. Towards this end, IFSDA, an image-level and feature-level semantic-aware architecture for cross domain semantic segmentation is put forward. In order to acquire sound outcomes, two branches of alignment strategies are realized by self-supervised learning and generative adversarial learning. Besides, a novel semantic discriminator is utilized in image translation process to optimize class-related information, thereby helping to eliminate the intra-class domain gaps between bi-temporal images and optimize the segmentation results effectively. Experiments on ISPRS 2D Semantic Labeling Contest Dataset have shown the superiority of proposed method over other models.

9:15am - 9:30am

Automating Expansive Cliff-nesting Seabird Colony Counts with Deep Learning: A Case Study of Aerial Photo Surveys of Northern Fulmars in Arctic Canada

Dominique Chabot¹, Sarah Gutowsky², Emily Knighton³, Tomas Naprstek¹, Charles Francis², Clément Chevallier², Mark Mallory³, Lisa Pirie²

¹National Research Council Canada; ²Environment and Climate Change Canada; ³Acadia University, Canada

Reliable estimates of seabird colony size are essential for monitoring population dynamics, yet accurate counts are difficult for expansive colonies on remote Arctic cliffs. Northern fulmars (Fulmarus glacialis) breed in large, unevenly distributed aggregations across extensive, towering cliffs in the Canadian Arctic, posing numerous survey challenges. Side-looking helicopter photo surveys generate thousands of photos where birds are small, variably angled, and of inconsistent sharpness against large, complex backgrounds. We used deep learning to automate fulmar counts in this imagery. Our objectives were to (1) develop an object detection model trained on manually annotated imagery sampled from three Arctic colonies, (2) evaluate model performance, and (3) estimate total size of an entire colony. We trained a YOLOX-based model on >16,000 annotated birds, following a two-stage training approach for small objects interspersed across expansive and heterogenous backgrounds. Compared to ~20,000 additional manual annotations in a sample of the Cape Liddon colony in the territory of Nunavut, the model detected 90% of birds with a 9% false-positive rate (i.e. 90% recall, 91% precision). The model's detection sensitivity was calibrated to achieve a ~1:1 ratio between total model detections (true positives + false positives) and the 'true' count, which required manual annotation of ~15% of the colony imagery. Overall, the model detected 38,723 fulmars across the entire colony, providing a robust estimate of its full population. These results highlight deep learning’s potential to greatly streamline and scale up seabird monitoring in remote polar environments where conventional surveys are constrained.

9:30am - 9:45am

Estimation of surface nitrogen dioxide (NO₂) using TEMPO satellite data and machine learning

Neda Kolahi, Costas Armenakis, Mark D Gordon

York University, Canada

Air pollutants such as nitrogen dioxide (NO₂) have detrimental effects on human health and ecosystems. It is therefore very crucial to pinpoint the location of high pollutant concentrations over large areas. Ground-based stations, while offering continuous temporal measurements, cannot provide broader spatial coverage for regions like cities. This study uses Tropospheric Emissions: Monitoring Pollution (TEMPO) satellite observations and a machine learning model to estimate high-resolution surface-level NO₂ concentrations over the Greater Toronto Area (GTA), Ontario, Canada. The random forest regression model was trained with input parameters such as hourly tropospheric NO₂ vertical column density (VCD) values and boundary layer height (BLH), which are the two most effective parameters in feature importance. The model achieved a coefficient of determination (R²) of 0.84, a root mean square error (RMSE) of 1.703 µg/m³, and a mean absolute error (MAE) of 0.939 µg/m³, indicating strong and reliable predictive performance. The findings of this research can support air quality forecasting, public health studies, and urban planning decisions, especially in regions with scarce ground-based pollutant data.

9:45am - 10:00am

Learning from Maps to Update Them: A Deep Learning-Based Approach Using Multimodal Airborne Data

Geethanjali Anjanappa, Sander Oude Elberink

University of Twente, The Netherlands

Automatic updating of topographic maps remains a significant challenge, as current workflows still rely heavily on manual interpretation of airborne data. This study proposes a method for identifying topographic changes by learning object representations from existing maps and using them as reference data for change detection. Map-derived labels are used to train independent 2D and 3D segmentation networks that generate semantic predictions from orthoimages and point clouds. Unlike conventional change-detection approaches that require temporally aligned datasets of the same modality, the proposed method directly compares newly acquired airborne data with existing map vectors. Semantic predictions from both modalities are vectorized and selectively fused into polygon geometries, which are subsequently compared with reference map vectors to identify object-level "from–to" changes. The workflow highlights potential change regions and their predicted semantic classes, allowing operators to focus inspection on relevant areas rather than the entire dataset. Detected changes include both real-world developments, such as new construction and demolitions, and inconsistencies in the reference map caused by outdated or inaccurate delineations. To assess the effect of multimodal integration, the workflow is compared with a 2D-only baseline. The results indicate that integrating 3D geometric information can reduce noisy detections and improve the spatial consistency of candidate change objects, particularly for water and bridge classes.

10:00am - 10:30am

Morning Coffee Break
Location: Exhibition Hall "E"

10:30am - 12:00pm

Plenary Session 1
Location: Exhibition Hall "G"

Keynote 1: To be announced

Awards Ceremony:

The U.V. Helava Award
The Samuel Gamble Award

Keynote 2: Professor Marc Pollefeys

12:00pm - 1:30pm

Lunch
Location: Exhibition Hall "E"

1:30pm - 3:00pm

WG II/2A: Point Cloud Generation and Processing
Location: 713A

1:30pm - 1:45pm

LGSSM: Local-to-global state space model for serialized point cloud semantic segmentation

Hao Wu, Li Yan, Huchen Li, Qimeng Li, Longze Zhu, Junjie Yuan, Hong Xie

School of Geodesy and Geomatics, Hubei Luojia Laboratory, Wuhan University

Point clouds have become essential data for describing real-world objects. Accurate and efficient 3D semantic segmentation plays a crucial role in environment understanding and scene reconstruction. However, current segmentation methods still face challenges from unordered data, high computational complexity, limited scene perception, and insufficient generalization. To address these issues, we propose a local-to-global semantic segmentation method based on a state-space model (LGSSM). Specifically, the proposed method uses three-dimensional serialization encoding to serialize point clouds along the x, y, and z directions, effectively addressing the inherent disorder of point clouds and enhancing spatial representation. Then, the local state space model extracts fine-grained local geometric structural information and the global state space model captures the overall scene representation, improving the modeling ability for both short and long distances. Finally, the serialized context aggregation module is utilized to fuse contextual features to promote spatial semantic consistency. Extensive experiments conducted on ScanNet, ScanNet200, and S3DIS demonstrate that our model achieves state-of-the-art segmentation accuracy compared with other existing methods.

1:45pm - 2:00pm

Hierarchical Gaussian Partitioning for Semantic Segmentation of Airborne LiDAR Scenes

Moussa Bendjilali^1,2, Nicola Luminari¹, Pierre Alliez²

¹Alteia, France; ²Inria Sophia-Antipolis, France

In this paper, we present a novel approach to semantic segmentation of airborne LiDAR point clouds that integrates a hierarchical Gaussian Mixture Model (hGMM) within the Superpoint Transformer (SPT) framework. The hGMM constructs a coarse-to-fine representation of the scene by recursively fitting Gaussian components to spatially coherent subsets of the point cloud, resulting in a hierarchical and structured decomposition that serves as a structured token set for the segmentation objective. While Gaussian Mixture Models (GMMs) can virtually fit any distribution, we constrain their use to structured suburban scenes, where their parametric form is naturally suited to represent planar and ellipsoidal geometries, hence allowing parsimonious mixtures. Experimental results on the DALES benchmark demonstrate that our method achieves competitive performance with respect to state-of-the-art approaches, with notable improvements on classes such as ground and buildings. Results on indoor S3DIS confirm the method's intended specificity to outdoor environments. These findings validate hGMM as a principled and effective alternative to heuristic partitioning techniques, integrating stochastic modelling with transformer-based semantic reasoning in large-scale 3D environments.

2:00pm - 2:15pm

MCPF-Net: Multi-stage LiDAR-Image Collaborative Perception Fusion Network for Point Cloud Semantic Segmentation in Urban Scenes

Huchen Li¹, Wubiao Huang¹, Haibing Liu¹, Shihan Chen¹, Bin Liu², Fei Deng^1,3

¹School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China; ²Hinton STAI Institute, East China Normal University, Minhang, Shanghai 200241, China; ³Hubei Luojia Laboratory, Wuhan 430079, China

Point cloud semantic segmentation through multi-modal fusion provides a fundamental basis for surface observation and visual perception tasks. LiDAR provides precise geometric structural information, while optical images offer rich semantic and textural details. However, existing fusion methods still suffer from limited cross-modal perception and insufficient information complementarity. To address these challenges, we propose a multi-stage LiDAR-image collaborative perception fusion network (MCPFNet) for point cloud semantic segmentation in urban scenes. In the mid-stage, the network introduces a geometric-aware fusion (GAFM) and a semantic-aware fusion module (SAFM) to achieve bi-directional injection of structural and semantic features between LiDAR and image modalities. In the later stage, an adaptive feature fusion module (AFFM) is designed to refine semantic representations through gated weighting and bi-directional attention mechanisms. Extensive experiments demonstrated that MCPFNet achieved the best mIoU scores of 74.51%, 72.10%, and 95.15% on the ISPRS Vaihingen, FRACTAL, and N3C datasets, respectively, validating its superior performance in multi-modal semantic segmentation.

2:15pm - 2:30pm

Cross-Sensor Robustness and Spatial Generalization for 3D Railway Point Cloud Semantic Segmentation

Arshia Ghasemlou, Mario Soilán, Jesús Balado, Belén Riveiro

CINTECX, GeoTECH group, Universidade de Vigo

This contribution investigates the cross-sensor and spatial generalization of deep learning methods for 3D semantic segmentation in railway environments. Although current models achieve high accuracy on large benchmark datasets, their robustness under real-world acquisition variability remains insufficiently understood. To address this gap, three state-of-the-art architectures—Point Transformer v3, Swin3D, and MinkUNet—were trained on the SemanticRail3D dataset and evaluated on a newly acquired 120-m railway section captured with three heterogeneous LiDAR systems: a Faro Focus S150+ terrestrial laser scanner, a CHCNAV RS10 handheld device, and a GeoSLAM ZEB Go SLAM-based scanner.

The case-study point clouds were carefully registered, normalized, voxelized, and manually annotated to provide consistent ground truth across sensors. A standardized preprocessing and test-time augmentation pipeline was applied to ensure compatibility with the training domain. Generalization performance was analysed through per-class IoU, cross-model agreement, and sensor-dependent degradation patterns. Results show significant variability across acquisition platforms, with denser, low-noise scans enabling better transferability, while sparser SLAM-based point clouds remain challenging for thin or small components such as overhead wires.

To mitigate cross-sensor variability, an IoU-weighted ensemble strategy was introduced, leveraging complementary model strengths without requiring retraining. This ensemble consistently improved or matched the performance of individual models on the case-study datasets.

Overall, the study demonstrates the importance of evaluating semantic segmentation models under realistic multi-sensor conditions and provides a practical benchmark and methodology for assessing domain-shift effects in railway point clouds.

2:30pm - 2:45pm

Revisiting NeRF for Street Scene Point Cloud Semantic Segmentation in the Era of 3DGS

Yuzhou Zhou

University of Oxford, United Kingdom

Accurate semantic segmentation of urban point clouds is fundamental for autonomous driving and city mapping. Recent advances in neural scene representations, such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS), have significantly improved photorealistic reconstruction quality. However, 3DGS is primarily designed for small-scale, object-centric scenes with dense viewpoints, and its optimization becomes sub-optimal in large-scale street scenes with trajectory-constrained observations, leading to semantic errors and distorted geometry.

In this work, we revisit NeRF-based scene representation in the era of 3DGS to address these challenges. Our method integrates the explicit and efficient modeling strategy of 3DGS with the surface-constrained sampling nature of NeRF. Specifically, we employ Deformable Neural Mesh Primitives (DNMPs) to jointly encode geometry and semantics, enabling efficient ray–mesh intersection sampling and neural field interpolation. This formulation achieves 3D-annotation-free point cloud semantic segmentation by leveraging rendered image supervision.

Experiments on the KITTI-360 dataset demonstrate that our approach surpasses the Street Gaussians baseline in overall mIoU and across most semantic categories. The improvement mainly stems from reducing semantic errors caused by limited viewpoints during 3D Gaussian optimization, providing a robust and scalable solution for street scene semantic understanding.

2:45pm - 3:00pm

Extraction of Pole-like Road Objects from MMS Point Clouds Using Deep Learning and Geometric-Topological Feature Fusion

Shu Su, Masataka Shirai, Hiroyuki Yokota

AERO TOYOTA CORPORATION, Japan

This paper presents a fusion framework for the automatic extraction of pole-like road objects—such as traffic lights, road signs, streetlights, and utility poles—from Mobile Mapping System (MMS) point clouds. The proposed method integrates KPConv-based semantic segmentation with geometric–topological reasoning to achieve structural completion and false-positive suppression without retraining or additional annotated data. The framework was trained on 8 km of manually labeled MMS data from the Kinki region, Japan, and evaluated on large-scale unseen data from Hokkaido (≈ 26 km, 2.53 billion points) and the Paris–Lille-3D benchmark (France) acquired with a different LiDAR sensor. The proposed approach significantly outperformed the KPConv baseline. On the Hokkaido dataset, the F₁-score improved from 0.8263 to 0.8689 (+0.0426), successfully reconstructing lamp tops, signal arms, and previously unseen snow delineator posts (snow poles). On the Paris–Lille-3D benchmark, recall increased by 15.5 points, yielding an overall F₁-score gain of +0.0802. The 26 km Hokkaido dataset was processed in less than 13 hours on a single NVIDIA Quadro RTX 8000. These results demonstrate that the proposed deep learning–geometry–topology fusion achieves robust, scalable, and efficient performance across diverse geographic and sensor domains, supporting nationwide road-asset mapping and digital-twin generation.

1:30pm - 3:00pm

WG III/1H: Remote Sensing Data Processing and Understanding
Location: 713B

1:30pm - 1:45pm

Satellite-based Monitoring of Tree Restoration in Ethiopia

Tenaw Workie¹, Alemu Gonsamo¹, Martin Brandt², Philippe Ciais³, Max Gaber², Petri Pellikka⁴

¹McMaster University, Canada; ²University of Copenhagen; ³Laboratoire des Sciences du Climat et de l’Environnement, France; ⁴University of Helsinki

This study presents a deep learning framework integrating Sentinel‑2, Sentinel‑1, and GEDI LiDAR to map Ethiopia’s canopy height at 10‑m resolution from 2019–2024. A shift‑aware loss function was employed to correct geolocation errors inherent in GEDI L2A footprints, and height‑weighted penalties addressed systematic underestimation in tall forests. Results show a national net gain of 23,537 km² in tree cover >8 m, reversing long‑standing deforestation trends. Gains concentrated in low‑to‑mid canopy strata (<20 m), strongly associated with major restoration interventions including the Green Legacy Initiative (GLI), REDD+, and the Sustainable Land Management Program (SLMP). Losses persist in western and southeastern highlands, driven by agricultural expansion, wildfires, infrastructure development, and large‑scale agricultural investments. This work demonstrates the operational value of multi‑sensor deep learning for near‑real‑time monitoring of restoration outcomes in data‑scarce regions.

1:45pm - 2:00pm

Synthetic Forest: A UAV Laser Scanning Benchmark Dataset for Individual Tree Segmentation, Classification, and Wood Volume Estimation

Habib Pourdelan, Martin Tomko, Kourosh Khoshelham

University of Melbourne, Australia

Accurate tree-level analysis in forests via LiDAR scanning is essential for biomass estimation, canopy structure assessment, and carbon monitoring, yet remains constrained by the scarcity of large-scale annotated LiDAR datasets and the high cost of manual annotation. To address this, we present a novel approach that integrates 3D tree models with UAV-borne LiDAR simulation to generate synthetic forest point clouds with comprehensive annotations. Our approach generates diverse woodland, open, and closed forest structures, producing Synthetic Forest, a benchmark datasets of three 1 ha scenes containing 38–47 million points each, with densities of 3300–3860 points/m² and average spacing of 2 cm. Each scene contains between 70 and 216 individual trees, along with understory vegetation, deadwood, stumps, rocks, and bushes, all automatically annotated with semantic classification IDs, instance IDs, and tree IDs for volume estimation. The proposed pipeline provides automated, error-free ground truth for leaf-wood classification, instance segmentation, and wood volume estimation. We provide a guideline for generating forest plots and utilizing the datasets for diverse forestry tasks. By eliminating the need for costly field data collection, our pipeline offers scalable, customizable synthetic datasets that accelerate forest inventory. The Synthetic Forest dataset is publicly released via Zenodo (DOI: 10.5281/zenodo.17568131), enabling reproducible research and supporting further developments in forest monitoring and management.

2:00pm - 2:15pm

Synergizing foundation model transfer and phenological information for fine-grained forest segmentation

Youcef Ben Ghorbel, Guneet Mutreja, Mareike Weishaupt, Jiaojiao Tian

German Aerospace Center (DLR), Germany

Accurate mapping of tree species is essential for forest monitoring, biodiversity assessment, and ecological applications. Very high-resolution UAV imagery provides detailed structural and spectral information, but species-level segmentation remains challenging due to limited annotated data, complex crown geometries, and strong visual similarity among taxa. Recent Remote Sensing Foundation Models (RSFMs) offer new possibilities by providing transferable representations learned from large, multimodal geospatial datasets.

This contribution introduces a two-phase framework that combines foundation model initialization with multi-temporal UAV imagery to enhance fine-grained forest segmentation. In Phase 1, a DeepLabv3+ network is initialized using FoMo-Net, a ViT-based RSFM pre-trained on the multi-scale FoMo-Bench benchmark. This initialization enables strong generalization from heterogeneous global forest datasets to very high-resolution UAV scenes. In Phase 2, phenological information is integrated by fusing May and September UAV acquisitions through temporal difference composites and pseudo-label refinement, allowing the model to resolve species-specific seasonal patterns.

Experiments on the Québec Trees Dataset, covering 14 species at 0.02 m GSD, demonstrate substantial performance gains. Foundation model initialization improves overall accuracy from 52.79% to 71.21%, while incorporating multi-temporal cues further increases accuracy to 78.21%. The results highlight the complementary roles of structural priors learned by RSFMs and phenological information captured by UAV time series for detailed forest species mapping.

2:15pm - 2:30pm

Combining specialized Sentinel-2 time series features with AlphaEarth Foundations for forest type mapping

Benedikt Hiebl¹, Nicola Alessi², Giacomo Calvia³, Alessandro Bricca³, Gianmaria Bonari⁴, Giulio Zangari³, Stefan Zerbe^5,6, Martin Rutzinger¹

¹University of Innsbruck, Austria; ²Italian Institute for Environmental Protection and Research, Rome, Italy; ³University of Bolzano/Bozen, Italy; ⁴University of Siena, Italy; ⁵University of Göttingen, Germany; ⁶University of Hildesheim, Germany

Accurate mapping of forest types and vegetation characteristics is essential for monitoring biodiversity and forest dynamics. Traditional Deep Learning (DL) models trained on Sentinel-2 time series achieve high performance, but require extensive preprocessing and sensor-related fine-tuning. In this study, we evaluate the recently introduced AlphaEarth Foundations (AEF) embeddings, which is global, multi-modal feature representation of the earths surface, for forest mapping in Italy. We compare a) a Multi-Layer Perceptron trained on AEF, b) a Time-Series Transformer trained on Sentinel-2 annual time series and CHELSA climate data, and c) a Cross-Attention fusion model combining both feature sets. Using 5-fold cross-validation in a regression and a classifaction task on two datasets (evergreen broad-leaved tree cover ETC, forest vegetation type FVT) we find that the combined model consistently outperforms the single-source approaches (RMSE = 0.161, Accuracy = 0.757). AEF-based models achieve comparable accuracy to the Sentinel-2-based model while reducing extensive time series preprocessing and training time by an order of magnitude. Feature attribution using integrated gradients reveals that AEF provides stable baseline representations, while Sentinel-2 inputs add phenology-related detail. The results show, that integrating generalized embeddings with specialized spectral-temporal features improves predictive performance for forest mapping.

2:30pm - 2:45pm

Tree species classification based on detailed shape evaluation of bark and leaf using deep learning

Tomohiro Mizoguchi

Sanyo-Onoda City University, Japan

In Japan, many urban park trees are becoming large and aged, increasing the risk of structural failures caused by extreme weather events and biological deterioration. Effective management therefore requires reliable risk assessment, for which accurate tree species identification is one of the fundamental prerequisites. However, species identification still depends heavily on visual assessment by skilled professionals, posing challenges in efficiency and objectivity. This problem is particularly significant for broad-leaved trees, which exhibit high species diversity and morphological variability. In addition, labor shortages have intensified the demand for automated and reliable classification techniques. This study proposes a high-accuracy classification method for broad-leaved tree species using ground-level images captured with a commercially available RGB camera and deep learning. The proposed method extracts small local patches that capture species-specific visual features, such as leaf shape and bark texture, commonly used by professional arborists for species identification. These local features are evaluated individually using deep learning models, allowing fine-scale visual characteristics to be effectively utilized for classification. To address variability in outdoor imaging conditions, including illumination changes, shadows cast by branches and leaves, and moss attachment, multiple patches are classified independently and the results are integrated through majority voting, improving classification robustness. Experiments were conducted on seven tree species commonly found in Japanese urban parks: cherry, ginkgo, zelkova, konara oak, sawtooth oak, plane tree, and flowering dogwood. The results demonstrate that the proposed method achieves a maximum classification accuracy of approximately 95% under real-world conditions, demonstrating its effectiveness for practical urban tree management.

1:30pm - 3:00pm

WG I/6A: Orientation, Calibration and Validation of Sensors
Location: 714A

1:30pm - 1:45pm

Proposal and Verification of AI-Based Automatic Geometric Correction Technology for Satellite Images Using Open Access Basemaps

Han-Gyeol Kim, Jaewon Hur

Data Science Department, TelePIX, Korea, Republic of (South Korea)

Geometric correction of satellite images is an essential pre-processing step for accurate geospatial analysis, but non-experts often face practical limitations because detailed sensor models and Ground Control Point data are not readily accessible. Traditional methods rely on physical sensor models or the Rational Function Model (RFM) using vendor-provided Rational Polynomial Coefficients (RPC). However, this information is often unavailable or lacks sufficient accuracy. This paper proposes a two-stage framework that utilizes AI matching technologies and open access data to automatically correct satellite images lacking georeferencing information. In Stage 1, a coarse Affine correction is executed using SuperPoint and LightGlue with an open basemap (Sentinel-2). In Stage 2, precise corresponding points are extracted through patch-based hierarchical LoFTR matching, and 3D GCPs are generated utilizing the SRTM. Subsequently, sensor-independent RPC are robustly estimated through the rpcfit library, and the final geometrically corrected image is generated through resampling.

This framework was verified by applying it to 4.8m resolution BlueBON satellite images that lack georeferencing information. In seven experimental regions with diverse geographical characteristics, an average Root Mean Square Error (RMSE) of 8.050m (1.68 pixels based on BlueBON resolution) referenced to the Sentinel-2 basemap, and an average of 9.02m (1.88 pixels) referenced to Google Maps, was achieved. This result demonstrates that it is possible to precisely correct 4.8m medium-resolution images using a 10m open basemap, providing a practical, accessible, and automated geometric correction solution for general users.

1:45pm - 2:00pm

An Adaptive Multi-Scale Star Centroid Localization Algorithm with Bayesian Iterative Weighting and Performance Analysis

Tiyou Zhou¹, Mi Wang¹, Huang Zhang², Tengteng Dong¹, Xinsheng Wang¹, Yifan Lou¹, Zhongyu Hu³

¹State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing(LIESMARS), Wuhan University, Wuhan 430072, China; ²Hubei Subsurface Multi-scale Imaging Key Laboratory, School of Geophysics and Geomatics, China University of Geosciences, Wuhan, 430074, China; ³Chang Guang Satellite Technology Company, Ltd., Changchun 130102, China

Star centroid localization accuracy fundamentally limits spacecraft attitude determination precision. Existing methods face a critical accuracy-efficiency trade-off: traditional intensity-weighted approaches achieve computational efficiency (<1 ms/star) but suffer from poor noise robustness, while Gaussian fitting and deep learning methods provide high accuracy at prohibitive computational costs. We address this fundamental limitation by developing a principled Bayesian Multi-Scale Adaptive Iteratively Weighted (BMAI) centroid localization algorithm that achieves high accuracy approaching theoretical limits while maintaining real-time computational efficiency. The algorithm integrates four key technical contributions: (1) SNR-adaptive window extraction with robust threshold estimation, (2) regularized iteratively weighted framework with proven convergence properties, (3) multi-scale fusion with SNR-dependent weighting, and (4) gradient-based refinement to mitigate systematic bias. Rigorous theoretical analysis establishes convergence guarantees, derives error bounds, and evaluates Cramér-Rao Lower Bound (CRLB) efficiency. Comprehensive evaluation on 16,500 synthetic star images across six diverse imaging scenarios demonstrates that under high-SNR conditions (SNR >25, n=2,000), BMAI achieves mean RMSE of 0.0120 pixels (95% CI: [0.0116, 0.0124] pixels), representing a 98.6% relative improvement over intensity-weighted centroiding (0.857 pixels), 35.8% improvement over Gaussian fitting (0.0187 pixels) and 95.3% improvement over CNN methods(0.2566 pixels). The algorithm maintains computational efficiency of 0.89ms per star—8.7× faster than Gaussian fitting—while achieving CRLB efficiency of 79.2%. Robustness analysis demonstrates stable performance across SNR range 3-100 with graceful degradation under challenging conditions. The BMAI algorithm fundamentally resolves the accuracy-efficiency trade-off in star centroid localization through principled Bayesian inference and multi-scale processing.

2:00pm - 2:15pm

Investigating PhaseOne Cameras and its IIQ Format for Photogrammetric Applications

Fabio Remondino¹, Robert Bosch²

¹3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy; ²PhaseOne

This paper presents a systematic investigation of the PhaseOne native IIQ format for drone and aerial cameras (in particular the recent iXM-RS250 and the iMX-GS120), focusing on the influence of different compression levels on geometric, radiometric and computational aspects of the photogrammetry pipeline. The aim of the presented research and experiments is to demonstrate the actual quality of these (compressed) images for photogrammetric purposes.

2:15pm - 2:30pm

Comprehensive Evaluation of Small-Format Multi-Head Camera Systems for 3D Topographic Mapping

Thirawat Bannakulpiphat^1,2, Wilfried Karel¹, Ana-Maria Loghin^3,4, Valeria-Ersilia Oniga³, Norbert Pfeifer¹

¹TU Wien, Department of Geodesy and Geoinformation, Austria; ²Chulalongkorn University, Mapping and Positioning from Space Technology Research Center, Department of Survey Engineering, Thailand; ³Technical University ”Gheorghe Asachi” of Iasi, Department of Terrestrial Measurements and Cadastre; ⁴Federal Office of Metrology and Surveying (BEV), Vienna, Austria

Small format multi-head cameras are becoming available and can be flown on light drones to provide simple access to oblique and nadir views of built-up areas. A number of missions with different parameters (flying height, etc.) are investigated to understand the trade-offs in applying those sensors and question the established accuracy laws. We investigate and quantify the ability to completely cover the facades using those sensors in the different scenarios.

2:30pm - 2:45pm

Geometric performance of the small satellite CE-SAT-IE carrying an optical sensor derived from the COTS camera Canon EOS R5

Junichi Takaku¹, Takeo Tadono², Sota Hirayama², Yuki Asahi³, Fumi Ohgushi¹

¹Remote Sensing Technology Center of Japan; ²Earth Observation Research Center, Japan Aerospace Exploration Agency; ³Canon Electronics Inc.

In recent years, commercial small optical satellites, e.g., Skysat, BlackSky, and PlanetScope, have become widely used for a variety of Earth remote sensing applications, providing high-resolution images with sub-meter resolution. They are operated in a constellation of multiple satellites, which compensates for the spatial and temporal limitations of traditional satellite observations. Moreover, their data acquired during stereo viewing have been experimentally used to generate digital surface models (DSMs). The CE-SAT-IE is a small optical satellite developed by Japan’s commercial company Canon Electronics Inc. (CE) and was launched on 17 February 2024, by Japan Aerospace Exploration Agency’s (JAXA’s) H3 launch vehicle test flight no.2. It is equipped with an optical frame sensor derived from a commercial off-the-shelf (COTS) camera Canon EOS R5. The ground sampling distance (GSD) is 0.8 m with a scene size of 6.5 km × 4.3 km. The calibration and validation of the sensor are being conducted in collaboration between CE and JAXA, drawing on JAXA’s extensive experience with past satellites. The geometric and radiometric performance of the sensor is analysed in detail, and the results will be used for its subsequent mission, which may involve a constellation for stereo observation to generate high-quality DSMs. This paper reports initial results for geometric calibration and validation of the sensor using ground control points (GCPs) and the experimental generation of DSMs from stereo observation images using the calibrated parameters.

2:45pm - 3:00pm

Hybrid Calibration between a Laser Scanner and Smartphone Camera Using hourglass targets and Deep Learning

Lorenz Krefft, Ludwig Hoegner

Munich University of Applied Sciences, Germany

This paper presents a novel hybrid calibration pipeline that jointly estimates the spatial and temporal alignment between a handheld laser scanner and a smartphone camera without any hardware synchronization. The method combines deep-learning-based target detection with classical geometric calibration using 2D-3D correspondences derived from black and white hourglass planar targets. Target centers are precisely localized in both the RGB images and the 3D point cloud using a symmetric templatematching scheme, enabling robust solution of the perspective-n-point (PnP) problem for spatial calibration.

To address the lack of hardware synchronization, we introduce a temporal calibration method that exploits geometric correspondences between rendered intensity images and camera frames.

On a Lixel L2 Pro scanner with a Huawei P20 Pro camera, the pipeline achieves a median Reprojection error of 0.76 px for static calibration and 2.19 px across 91 dynamic evaluations. The approach enables accurate image-pointcloud fusion for scanners without syncronisation interfaces and provides a foundation for colorization, image analysis, and redensification of laser data.

1:30pm - 3:00pm

WG I/2A: Mobile Mapping Technology
Location: 714B

1:30pm - 1:45pm

Evaluation of VGGT with ALS Point Clouds for Large-Scale Dense Mapping

Yandi Yang, Naser El-Sheimy

University of Calgary, Canada

Accurate large-scale dense 3D reconstruction is fundamental for geospatial mapping, robotics, and autonomous navigation. Conventional photogrammetric workflows can reconstruct environments from ground-level imagery but often suffer from cumulative drift over kilometer-scale trajectories and require extensive calibration. Recent advances in feed-forward 3D reconstruction, notably the Visual Geometry Grounded Transformer (VGGT), have demonstrated the ability to generate dense point clouds directly from RGB images without explicit optimization. VGGT jointly estimates camera poses, depth, and dense geometry from multiple uncalibrated frames in a single forward pass. However, its scalability is limited by two factors: (1) the lack of absolute metric scale and (2) high GPU memory demands.

Many national mapping agencies (e.g., USGS, IGN, Ordnance Survey) have released Airborne Laser Scanning (ALS) datasets covering vast urban and rural areas. These high-quality aerial point clouds provide globally consistent, metrically referenced data that can serve as external constraints for ground-level reconstructions. Building upon this opportunity, we propose VGGT-ALS, a framework that leverages open ALS point clouds to enable VGGT-based systems to produce large-scale, metrically accurate dense maps from mobile mapping imagery.

1:45pm - 2:00pm

Semantic-Guided Geometric Feature Extraction from Dense LiDAR for Vehicle Localization with Abstract Maps

Mohamad Wahbah¹, Rozhin Moftizadeh¹, Christopher Klugmann², Daniel Kondermann², Ingo Neumann¹, Hamza Alkhatib¹

¹Geodetic Institute, Leibniz University Hannover, Germany; ²Quality Match GmbH, Germany

High-precision vehicle localization in GNSS-denied urban areas requires alternatives to costly HD maps. In this paper, we present a novel framework for feature extraction and benchmark generation to enable high-precision localization using abstract LoD2/DTM maps as a replacement for HD maps. Our first contribution, a semantic-geometric pipeline, processes dense LiDAR and camera data to extract map primitives. This is accomplished by a RANSAC-fitted ground plane extraction step, followed by a semantic filter that discards dynamic objects. Finally, geometric clustering (HDBSCAN) and RANSAC plane fitting isolate large-scale vertical facades. Our second contribution, a multi-stage GT generation framework, resolves annotation ambiguity using a Human-In-The-Loop (HITL) system. A robust 2D pose is computed by finding the geometric median of bootstrapped transformation samples on the SE(2) manifold, which is then refined to a 6-Degree-of-Freedom pose via point-to-plane ICP, before being validated by a human for a final check. We evaluated our feature extraction pipeline against the generated benchmark, achieving 95.04% precision and 83.74% recall. An analysis of this performance shows the pipeline correctly rejects small, ambiguous features while achieving high recall on all large, stable features, proving its suitability for a robust localization filter.

2:00pm - 2:15pm

Enhanced Path Planning Strategies for Drone-Based Infrastructure Monitoring Under Signal -Denied Conditions

Phillip Kim¹, Junhee Youn², Jiheon Jung¹

¹Department of Future&Smart Construction Research, Korea Institute of Civil Engineering and Building Technology, Republic of Korea; ²Corresponding Author, Department of Future&Smart Construction Research, Korea Institute of Civil Engineering and Building Technology, Republic of Korea

As civil infrastructure in South Korea ages, the demand for systematic monitoring has grown. While Unmanned Aerial Vehicles (UAVs) provide a safer alternative to manual inspections, Global Navigation Satellite System (GNSS) signal degradation beneath bridge structures remains a critical barrier to autonomous flight. Unlike existing hardware-centric or SLAM-based solutions that require high costs and computational overhead, this study proposes a robust, algorithm-based path planning methodology using a 3D spatial grid framework.

Two strategies were evaluated through field tests at the Bukhangang Bridge: the Photography Point Method (PPA) and the GNSS Non-Shadowing Area Method (WPS-GNSA). Results demonstrated that WPS-GNSA significantly enhances signal reliability by strategically positioning waypoints outside electromagnetic shadow zones. At an optimal 19-meter separation distance, WPS-GNSA maintained GNSS Level 5 connectivity—the threshold for recommended autonomous flight—for 9.4% of the duration and Level 4 or higher for 79.7%, whereas the PPA peaked at Level 4.

These findings indicate that WPS-GNSA enables reliable autonomous inspections using standard commercial drones without specialized hardware modifications. While the current model relies on pre-existing digital blueprints, future research will integrate AI-based navigation and real-time environment perception to enhance scalability and adapt to dynamic obstacles in complex infrastructure environments.

2:15pm - 2:30pm

UAV-Assisted Collaborative Positioning in GNSS-Denied Environments

Andrea Masiero¹, Charles Toth², Luca Morelli³, Pawel Trybala³, Fabio Remondino³

¹University of Padua, Italy; ²The Ohio State University, US; ³Fondazione Bruno Kessler, Italy

Accurate and reliable positioning is fundamental for the development of a wide number of applications. Despite in most of the regular working conditions the use of a GNSS receiver is sufficient for properly solving the problem, determining a reliable solution in challenging conditions can be difficult. In such conditions, exploiting the information shared by different sensors and platforms can be useful for reliably determining the platforms' positions. In this work, both ground and aerial platforms are considered: each platform is assumed to be provided with communication capabilities, which can be exploited to share its knowledge. Since GNSS positioning is usually less effective at ground level than on a flying platform, the aerial platforms are assumed to be provided with good GNSS-based positioning information. Instead, GNSS is assumed to be unavailable to the ground vehicles, which, instead, can use LiDAR/visual odometry for dead-reckoning positioning, UWB inter-platform ranging for relative positioning, and camera-based positions, provided by aerial platforms, for assessing their georeferenced positions. This work focuses on assessing the positioning performance when exploiting vision-based information about the georeferenced ground vehicle positions from a camera mounted on a UAV. The camera acquired oblique views of the scene while moving over the case study area during the test. YOLO was used to detect cars from the image frames and the vehicle coordinates have been extracted from 3D reconstructions obtained from the MoGe-2 network. Average errors at meter level on the determined georeferenced coordinates were obtained when combining UWB vehicle-to-vehicle ranges with MoGe-2 reconstructions.

2:30pm - 2:45pm

Melbourne multi-sensor urban positioning and mapping dataset

Junjie Zhang^1,2, Marko Radanovic², Yuan Zhao², Amir Khodabandeh², Kourosh Khoshelham²

¹University of Southern Queensland; ²The University of Melbourne

Reliable positioning and mapping in dense urban environments remain challenging due to signal blockage, multipath, and dynamic scenes. Progress on multi-sensor integrated positioning and visual/lidar SLAM has been driven by open datasets, yet most existing resources are either perception-centric with limited raw navigation data, focused on controlled environments, or built around outdated software platforms and/or data formats. In this paper, we present the Melbourne Multi-Sensor Urban Positioning and Mapping Dataset, a new resource targeting urban vehicle navigation and mapping tasks. The dataset was collected using a custom mobile mapping platform equipped with a tactical-grade INS, a survey-grade Leica GNSS receiver, a low-cost UBLOX GNSS receiver, a high-resolution Ouster OS1 128 lidar, and four industrial FLIR cameras providing 360° coverage. Seven data collection trips were recorded on dynamic streets in several inner suburbs of Melbourne, including multiple closed loops and a repeated route with day–night variation. For better compatibility and future-proofing, all raw data are provided as standard ROS2 message streams in MCAP format, complemented by commonly used individual formats and GNSS products for multi-sensor integrations. We benchmark three GNSS--based positioning packages (RTKLib, Net_Diff and Ginan) and four state-of-the-art lidar(-inertial) odometry/SLAM methods (FAST-LIO2, KISS-ICP, KISS-SLAM and PIN-SLAM), demonstrating the applicability and compatibility of our dataset for modern positioning and mapping software pipelines. The dataset is designed as a robust, ROS2-native testbed for research on GNSS/IMU/lidar/camera fusion for the testing and validation of vehicle positioning and mapping in urban environments, which is available open-source at https://github.com/zjjdes/melbourne_dataset.

2:45pm - 3:00pm

CMLGF-LIO: A Cross-Modal Local-Global Fusion Framework for Robust LiDAR-Inertial Odometry

Xin Yang, Li Yan, Hong Xie

School of Geodesy and Geomatics, Wuhan University, Wuhan, China

Accurate and robust localization is essential for autonomous vehicles and mobile robots operating in complex, dynamic environments. However, existing learning-based LiDAR-inertial odometry (LIO) methods typically rely on simple weighted fusion or purely global attention, which may not fully exploit cross-modal complementarity. In this paper, we propose CMLGF-LIO, a cross-modal local-global fusion framework that improves LIO accuracy and robustness. At the local level, we design a Local Split-Attention (LSA) module that injects IMU-derived motion priors into local LiDAR feature groups and adaptively allocates attention weights, suppressing redundant information while preserving discriminative local geometry for fine-grained fusion. At the global level, we introduce a Global MLP-Mixer (GMM) module that aligns LiDAR and IMU token sequences and models global cross-modal interactions using an MLP-Mixer backbone. Experiments demonstrate that CMLGF-LIO is more robust than learning-based baselines under challenging conditions, and ablation studies validate the effectiveness of the proposed local-global fusion strategy.

3:00pm - 3:15pm

A Low-Cost Vehicle-Based Mobile Mapping System: LiDAR SLAM with Multi-GNSS/IMU Fusion

Rami Tamimi¹, Baris Suleymanoglu², Abdelgwad Elashry³, Toth Charles¹

¹The Ohio State University, United States of America; ²Yildiz Technical University, Istanbul, Türkiye; ³University of Hertfordshire, New Administrative Capital, Egypt

This paper presents a low-cost mobile mapping system built on a consumer vehicle (2026 Tesla Model Y) equipped with a roof-mounted Velodyne VLP-16 LiDAR and three post-processed kinematic (PPK) GNSS receivers. A processing pipeline was developed that fuses scan-to-scan LiDAR registration via KISS-ICP with multi-receiver PPK trajectories to produce georeferenced 3D point clouds. The system was tested on a 2 km loop on the Ohio State University campus. KISS-ICP achieved a registration fitness of 1.0 across all 3,400+ frames, producing locally crisp point clouds. A yaw-only Procrustes alignment followed by interactive 7-parameter refinement maps the SLAM trajectory into an East-North-Up geodetic frame. We document the complete pipeline architecture, including automated GPS time synchronization, multi-receiver vehicle pose estimation, and a streaming LAS export capable of handling 70+ million points. We systematically evaluate ten post-hoc trajectory correction strategies and identify a fundamental trade-off between inter-frame consistency (point cloud crispness) and absolute geodetic accuracy. The primary unresolved challenge is a staircase artifact caused by ~40 m of accumulated SLAM drift over the loop, which cannot be corrected without degrading local registration quality. We conclude that loop closure detection and pose graph optimization within the SLAM pipeline are necessary to resolve this tension and outline a path toward survey-grade mobile mapping from consumer vehicle platforms.

1:30pm - 3:00pm

WG III/8G: Remote Sensing for Agricultural and Natural Ecosystems
Location: 715A

1:30pm - 1:45pm

Biomass Distribution Mapping of Boreal Forests using GEDI, Sentinel-2, and SRTM Data

Chandra Sekhar Utla¹, Brijesh Kumar Yadav¹, Vaasudevan Srinivasan², Rakesh Kumar Mishra², Ajay Dashora¹, Yun Zhang²

¹Indian Institute of Technology Guwahati, India; ²University of New Brunswick, Fredericton, Canada

Estimating carbon stock is important for understanding ecosystem dynamics and mitigating climate change. However, biomass mapping in boreal forests faces challenges due to harsh conditions and limited ground truth data for large scale studies. This study presents a parametric model for accurate biomass estimation in the Acadia and Taiga Forest using GEDI Level 4A, Sentinel-2, and SRTM DEM data. We integrated these datasets, and developed the parametric model consisting of spectral bands, vegetation indices, and topographic information with regression techniques, Random Forest and K-nearest neighbour. Results showcase performance of the parametric model with relative weights of variables for accurate Aboveground Biomass Density (AGBD) predictions for the two forest sites. With an average RMSE ranging between 9 Mg/ha to 31 Mg/ha and R^2 values of 0.54 to 0.60, the study reveals the importance of variables like slope, aspect and specific vegetation indices along with raw bands of Sentinel-2 data. Results also demonstrate potential and accuracy limitations of the proposed model with for biomass estimation with high resolution open-source satellite data without ground control. Further research include assessing the model robustness across diverse ecosystems and geographical settings, contributing to sustainable resource management practices.

1:45pm - 2:00pm

Aboveground biomass estimation using a transformer framework with multi-temporal Sentinel-1/2 data and growth constraints

Wen Zhang, Baoxin Hu

York University, Canada

Accurate estimation of aboveground biomass (AGB) is essential for quantifying forest carbon stocks, monitoring ecosystem change, and supporting greenhouse-gas reporting frameworks. While field measurements remain the benchmark, their limited spatial coverage has driven increasing reliance on remote sensing. Existing global AGB products such as ESA CCI Biomass and GEDI represent major advances but still suffer from signal saturation, sparse sampling, and limited ability to resolve fine-scale structural variation, highlighting an ongoing gap in effectively fusing information from different sensors. Recent studies combining Sentinel-1 (S1) SAR and Sentinel-2 (S2) multispectral imagery have demonstrated improvement in biomass modelling, with machine learning and deep learning achieving R² values from 0.57 to 0.80. However, most work focuses on tropical or broadleaf forests, leaving boreal mixedwood systems underrepresented, and models often rely on short-term composites that overlook multi-year temporal dynamics important for distinguishing long-term growth from seasonal phenology. This study addresses these gaps by developing a Transformer-based deep learning framework that integrates multi-temporal S1 and S2 time series and incorporates Growth & Yield (G&Y) variables as temporal constraints. By leveraging complementary radar–optical interactions and long-range temporal dependencies, the model is designed to reduce signal saturation, enhance structural sensitivity, and improve generalizability across heterogeneous boreal forest conditions.

2:00pm - 2:15pm

Spatio-Temporal Inversion of Forest Fuel Moisture Content Using Multi-Source Remote Sensing: A Deep Learning Framework Incorporating Vegetation Spatial Autocorrelation

Cui Zhou, Junxiang Zhang, Wenjun Wang

Central South University of Forestry and Technology, China

Fuel Moisture Content (FMC) serves as a vital indicator for monitoring vegetation health and predicting wildfire risk. While existing approaches have largely emphasized temporal variations in FMC, they frequently overlook the inherent spatial clustering patterns of vegetation, leading to compromised spatial prediction accuracy. To overcome this limitation, we introduce a Transformer-based Spatio-Temporal Estimation Framework (TSTEF) that preserves sensitivity to temporal dynamics while incorporating spatial aggregation mechanisms to achieve robust and spatiotemporally consistent FMC estimates. The framework combines spatial autocorrelation analysis with gated recurrent unit (GRU)-based temporal modeling to effectively capture spatiotemporal dependencies, and utilizes Triangular Topology Aggregation Optimization (TTAO) for hyperparameter calibration. The proposed framework was validated using Sentinel-1/2 imagery and MODIS products in California, USA, where it demonstrated: (1) outstanding performance with an average R² > 0.8, MAE < 9%, and relative RMSE of 12.35%; (2) strong agreement between estimated FMC distributions and ground observations, with wildfire burned areas significantly expanding when FMC fell below the 120% critical threshold; and (3) excellent generalization ability during cross-regional validation, achieving relative RMSE values of 20.46% in France, 25.62% in Spain, and 20.76% in Colorado. This study provides a reliable analytical framework for wildfire risk early warning and contributes meaningful insights for ecosystem management amid global environmental change.

2:15pm - 2:30pm

Tracking the efficacy of prescribed burns in three phases: fuel removal, wildfire mitigation, and vegetation recovery

Tianjia Liu¹, Kento Otobe¹, Makoto Kelp²

¹Department of Geography, University of British Columbia, Vancouver, BC, Canada; ²Department of Atmospheric Sciences, University of Utah, Salt Lake City, UT, United States

In the United States (U.S.), aggressive suppression policies in the 20th century reduced wildfires in the short term, but accumulated fuels contributed to increased wildfire risk in the long term. Land managers are slowly reintroducing the use of fuel treatments, including prescribed (Rx) fires, to remove fuels and mitigate future wildfires. To date, efforts to systematically quantify the efficacy of fuel treatments for wildfire mitigation have been limited in spatial and temporal scope or rely on proxies, such as low-intensity wildfires. Here we use the 30-m Harmonized Landsat and Sentinel-2 (HLS) dataset to analyze timeseries of vegetation “greenness” indices, such as the Normalized Burn Ratio (NBR), with observations up to every 2-3 days. We apply a causal inference approach, difference-in-differences (DID), on HLS-derived timeseries of NBR to compare outcomes in treated and surrounding control areas in three different time phases: 1) post-treatment fuel reduction, 2) wildfire-induced burn severity, and 3) post-wildfire vegetation recovery. As a case study, we targeted 37 Rx fires that preceded and intersected the 2024 Park Fire, a large wildfire in northern California. We show statistical evidence that Rx fires reduce fuel loads (12 Rx fires), wildfire burn severity (12 Rx fires), and post-wildfire vegetation recovery (14 Rx fires). Our approach requires only the spatial footprint and timing of the fuel treatments, thus enabling regional to nationwide analyses using a large number of fuel treatments to quantify the general efficacy of fuel treatments across a variety of treatment types, fuel types, and topography.

2:30pm - 2:45pm

Winter coherence as an indicator of fire-influenced vegetation for mapping and monitoring Canada’s Sub-arctic wetland ecosystem extents

Laura Dingle Robertson¹, Emily Dyer¹, Sarah Banks¹, Marco van der Kooij²

¹Environment and Climate Change Canada, Canada; ²Van der Kooij Consult

Ecosystem extent has been selected as an indicator of biodiversity under the Kunming-Montreal Global Biodiversity Framework. With wildfires on the rise in Canada and around the World there is a need to understand their impact on ecosystem extent for Framework reporting requirements. In Canada’s Arctic and Sub-arctic the growing season is much shorter than at lower latitudes, resulting in few cloud-free optical images and making it a challenge to monitor the impacts of fire and recovery at fine temporal resolutions. Synthetic Aperture Radar (SAR), particularly winter coherence, can be an indicator of ecosystem extent in the sub-arctic because burned areas will exhibit patterns that reflect more dynamic freeze-thaw cycles in the winter period than non-burned areas. Winter coherence and phase was calculated for 39 Sentinel 1 images over the time periods of 2017-2018, 2018-2019, 2019-2020, 2020-2021 in Northern Manitoba, Canada. When relating these values to known fire areas it was found that there is a distinct difference between areas recently burned as opposed to those burned further in the past. These data will be used as predictor variables in a Random Forest ecosystem classifier with outputs of overall accuracy and Shapley feature importance assessed.

2:45pm - 3:00pm

Boosting Accuracy with the Synergistic Use of Sentinel-1, Sentinel-2, and EnMAP Data for Land Cover & Crop Type Mapping in Greece

Christina Karakizi^1,2, Eleni Sofikiti^1,2, Olympia Gounari², Zacharias Kandylakis², Konstantinos Karantzalos²

¹Hellenic Space Center, Greece; ²Remote Sensing Laboratory, National Technical University of Athens

Accurate and frequently updated land cover maps are vital for various scientific communities, as well as for public and regional authorities, supporting decision-making, planning, sustainable development, and natural resources management. Regular monitoring and mapping also play a crucial role for agricultural areas, particularly considering the projected population growth and shifting dietary patterns in many of the fastest growing regions of the world, that pose significant challenges for humanity. Over the past decade, the availability of Sentinel-1 and Sentinel-2 data has significantly increased the potential for high spatial resolution land cover mapping using dense time series. However, mapping croplands and distinguishing between crop types remains a more complex task, often requiring data of higher spatial, spectral, and temporal resolution. In this context, this study aims to evaluate the synergistic use of multi-temporal data from Sentinel-1, Sentinel-2, and EnMAP data for detailed land cover and crop type mapping in agriculural regions of western Greece.

1:30pm - 3:00pm

WG IV/1B: Spatial Data Representation and Interoperability
Location: 715B

1:30pm - 1:45pm

Zonology: An Ontology-Based Framework for Harmonizing Zoning Semantics Across Multi-Jurisdictional Greater Toronto Area (GTA) Planning Systems

Arian Mesbahian¹, Amin Sarang², Arash Shahi³, Mojgan Jadidi¹

¹Department of Civil Engineering, Lassonde School of Engineering, York University, Canada; ²DevNext Inc., Canada; ³AECO Innovation Lab Inc., Canada

Urban development in the Greater Toronto Area faces significant challenges because zoning abbreviations and terminology vary widely between municipalities. This provides the background and motivation for the study, as labels such as “R2” in Toronto and “R2 S” in Markham appear similar yet represent different permissions and development standards, creating confusion and slowing planning workflows in a region with growing housing pressures. The problem addressed in this research is the absence of a unified, machine-readable framework that standardizes zoning terminology across municipalities, which limits automated compliance checking, GIS integration, and cross municipal comparison. The objective of this work is to create Zonology, an ontology-based framework that harmonizes zoning abbreviations, permitted land uses, and development standards, beginning with the City of Toronto and the City of Markham. The methodology follows the Linked Open Terms approach, using the Web Ontology Language to encode zoning by laws, land use categories, development standards, and spatial relationships. The model is evaluated through reasoning tasks, competency questions, and semantic alignment tests to ensure clarity, consistency, and interoperability. The results show that Zonology successfully aligns more than sixty zoning categories and over one hundred fifty land use permissions, enabling consistent semantic interpretation and cross municipal queries. The overall significance of this work is that the ontology improves regulatory clarity, strengthens data driven planning, and provides a scalable foundation for harmonized zoning governance across the Greater Toronto Area.

1:45pm - 2:00pm

GeoGraphJSON: A lightweight semantic data model integrating spatial geometry and graph connectivity for AI-driven spatial reasoning

Muhamad Alrajhi¹, Christian Heipke², Mohammed Afroz Khan¹

¹RASIKH Institute for Education and Training, Riyadh; ²Leibniz Universität Hannover

Urban systems are increasingly complex, interconnected, and dynamic, yet most geospatial data models continue to represent them as static geometric layers with limited support for explicit relationships and semantics. This restricts advanced spatial reasoning, network analysis, and AI-driven applications.

This paper introduces GeoGraphJSON, a lightweight semantic data model that extends GeoJSON by integrating spatial geometry with graph-based connectivity. The framework represents spatial entities as nodes and explicitly encodes relationships as typed edges, enabling unified representation of geometry, topology, and semantics within a single interoperable structure. A hierarchical Unique Identifier (UID) system ensures consistent lineage and cross-layer integration across administrative, transportation, and urban asset datasets.

The approach is validated using a large-scale urban dataset from Riyadh, comprising over 10,000 nodes and 13,000 edges. Graph-based analysis demonstrates realistic spatial patterns, including right-skewed degree distribution, strong network connectivity, and identifiable community structures. These results highlight the ability of GeoGraphJSON to capture hierarchical organization and functional relationships while supporting efficient analytical workflows.

By bridging geometry-centric GIS models and graph-based approaches, GeoGraphJSON provides a scalable foundation for urban analytics, digital twins, and GeoAI applications, enabling geospatial systems to evolve from static representations toward intelligent, relationship-aware spatial models.

2:00pm - 2:15pm

Urban Morphological Clustering of Cairo, and Makkah A Comparative Analysis Using Spatial Networks

Ahmad M. Senousi^1,2, Wael Ahmed^1,2, Adel Elshazly¹, Moustafa Baraka³, Walid Darwish^1,2

¹Geomatics Engineering Lab, Public Works Department, Cairo University, Giza 12613, Egypt;; ²NAMAA for Engineering Consultations, Dokki , Giza 12612, Egypt; ³Civil Engineering Program, German University in Cairo 11835, Egypt

Urban morphology quantitatively reveals how distinct historical and functional drivers shape city form. This study employs a computational morphometric approach using the Momepy library to analyze and compare the urban structures of Cairo, Egypt, and Makkah, Saudi Arabia. These cities represent paradigmatic cases: Cairo exemplifies long-term, organic layering, while Makkah demonstrates rapid, purpose-driven transformation for religious pilgrimage. We calculated key metrics—including tessellation area, convexity, elongation, equivalent rectangular index, and edge betweenness centrality—for building footprints and street networks sourced from OpenStreetMap. Results show Cairo possesses a heterogeneous, polycentric fabric with complex plot shapes and a distributed street network, reflecting its layered history. Conversely, Makkah exhibits a more monocentric, consolidated form with standardized building geometries and a hierarchical street network channeling movement toward its core. The findings demonstrate that quantitative morphology effectively captures how Cairo's organic evolution and Makkah's centralized planning produce fundamentally different, yet equally revealing, urban spatial structures, offering a replicable framework for cross-city analysis in the region

2:15pm - 2:30pm

An Assessment of Spatiotemporal Dynamics of Urban Illumination and Socioeconomic Patterns in Delhi Using VIIRS Nighttime Light Data

Manisha Kumari¹, Aditya Kumar Thakur²

¹Tilka Manjhi Bhagalpur University, India; ²Indian Institute of Technology Roorkee, India

Urban illumination, as captured through Nighttime Light (NTL) data, serves as a powerful indicator of human activity, infrastructure development, and socioeconomic progress in rapidly growing cities. However, previous studies on Delhi have largely focused on temporal NTL trends without integrating multi-year statistical and spatial analyses to reveal underlying urban and socioeconomic dynamics. This study investigates the spatiotemporal dynamics of urban illumination and development over Delhi using VIIRS Day/Night Band (DNB) NTL data for the years 2015, 2020, and 2025. NTL intensity was used as a proxy for urbanization and socioeconomic activity. Monthly composite datasets for January of each year were processed, clipped to the Delhi administrative boundary, and analyzed using statistical, temporal, and correlation-based methods. The results revealed a slight decline in mean NTL intensity from 26.34 in 2015 to 24.95 in 2025, indicating stabilization in overall light emissions may be due to the adoption of energy-efficient technologies. However, the maximum and range values increased markedly (166.85 to 228.04), signifying intensified illumination in high-activity commercial and infrastructural zones. Temporal change analysis showed balanced positive and negative illumination shifts, with over 50% of pixels exhibiting moderate growth during 2020–2025. Strong Pearson and Spearman correlations (r = 0.83–0.92; ρ = 0.910.95) confirmed the temporal consistency of illumination distribution. The socioeconomic assessment highlighted spatial disparities in light intensity might be corresponding to varying economic activity levels. Overall, the study demonstrates that VIIRS-derived NTL data provide an effective and robust approach for monitoring urban growth, socioeconomic variability, and sustainable lighting transitions in metropolitan environments.

2:30pm - 2:45pm

Artificial Intelligence for territorial interpretation: from image clustering to perceptual mapping

Fabio Bianconi, Marco Filippucci, Chiara Mommi

University of Perugia, Italy

The research investigates artificial intelligence as a device for the automatic interpretation of landscape, reframing representation not as a neutral reproduction but as a cognitive operation in which perception, description, and evaluation converge. Moving from the assumption that landscape is not an objective given but a culturally and perceptually constructed form, the study proposes a fully data-driven methodology based on geolocated images. Through a systematic grid sampling, street-level panoramic views are collected and processed within a multimodal pipeline integrating visual analysis, language models, and multi-agent evaluation. Images are first translated into textual descriptions and semantically clustered, allowing territorial classes to emerge from the data rather than from predefined taxonomies. In parallel, a simulated cognitive framework, structured through four agent profiles, produces evaluative scores and textual judgments, later analysed through sentiment detection. The integration of these layers generates a georeferenced dataset from which a perceptual cartography of the territory is constructed. Applied to the urban context of San Giustino (Italy), the method reveals a continuous gradient from dense urban cores to rural landscapes, while exposing differentiated perceptual readings across observer profiles. Within this framework, artificial intelligence does not replace human interpretation; it operates as an epistemic extension, transforming the landscape into a distributed field of comparable perceptions, where representation becomes a computable form of shared knowledge.

2:45pm - 3:00pm

Towards the Development of a Metadata-driven Usability Awareness Prototype for Interoperable GIS Operation Design

Jumg-Hong Hong, How-Han Chang, Ting-Yi Lee, Ting-Yu Chang

Dept. of Geomatics, National Cheng Kung University, Chinese Taipei

This study focuses on bridging usability information between data providers and data users through standardized metadata. By further integrating standardized metadata with geographic information system operation design, the operations gain automated and awareness capabilities, allowing usability information based on data specifications and quality considerations to be incorporated into relevant processes, thereby avoiding erroneous decisions. The research references international standards such as ISO 19115 and ISO 19157 to meet the requirements of open geographic information technologies.

1:30pm - 3:00pm

IvS3B: Advancing Digital Twins for Urban Environments: Approaches to Mapping, Monitoring, and Management
Location: 716A

1:30pm - 1:45pm

Digital Building Analysis (DBA): Cloud-GIS-Based 3D Building Modelling and Multi-Agent AI Analytics Using Gaussian Splatting and Google Maps Platform

Kyle Gao¹, Jonathan Li^1,2, Linlin Xu³, David Clausi¹

¹Department of Systems Design Engineering, University of Waterloo, Canada; ²Department of Geography and Environmental Management, University of Waterloo, Canada; ³Department of Geomatics Engineering, University of Calgary

This invited talk presents Digital Building Analysis (DBA), a unified framework for intelligent, cloud-based building-scale reconstruction and analysis. Building on our prior advances in Gaussian Splatting for photorealistic 3D scene generation and the Gaussian Building Mesh (GBM) framework for accurate mesh extraction, DBA introduces a new layer of integration between cloud mapping and artificial intelligence. The system connects directly to Google Maps Platform APIs to retrieve geospatial data, imagery, and elevation models from a building’s address or coordinates, while employing Gaussian Splatting to reconstruct high-fidelity 3D models from multi-view imagery. This combination enables seamless digital twin creation without ground-based measurements or proprietary datasets and can be operated through natural language queries, allowing users to simply describe a location or request a building analysis conversationally.

The key component of DBA leverages multi-agent large language models (LLMs) for both natural language interfacing and data interpretation. These models autonomously generate Google Maps API calls, interpret retrieved imagery, extract visual features, and compose semantic building descriptions. Working in tandem, the agents merge 3D geometry, visual realism, and semantic understanding into a single automated process.

Together, these innovations mark a major step forward in Canada’s AI-enhanced remote sensing research, enabling interactive, query-driven urban analytics and advancing the next generation of intelligent digital twins for sustainable urban development.

1:45pm - 2:00pm

A Comprehensive Evaluation of the Spatial Accuracy of Building Gaussian Splatting

Samuel McNally¹, Shabnam Jabari¹, Heather McGrath², Mark Masry³

¹Dept. of Geodesy and Geomatics Engineering, University of New Brunswick, Canada; ²Natural Resources Canada; ³Modelar

3D building models are powerful visual tools, typically generated with well-established image-matching or LiDAR methods. However, they do not capture the view-dependent colour characteristics possible with Gaussian splatting. Despite the visual potential of Gaussian splatting, there is limited knowledge on its spatial accuracy and influencing factors, particularly for buildings. To address this gap, a two-building dataset was collected with terrestrial laser scans, images, phone LiDAR, and target points, and the visual and spatial effects of numerous factors were analyzed. These factors included the source and quality of the input camera poses and point cloud, the number of images and training iterations, and the Gaussian splat method. Gaussian splats were trained from open source and commercial image-based reconstruction methods, COLMAP and Pix4D, and phone LiDAR reconstructions. Applying Gaussian splatting to these inputs had minimal impact on the target points and the overall structure of the buildings, but the positions of Gaussians deviated from the initial point cloud, particularly before 15,000 iterations, resulting in more floaters and lower spatial accuracy. Image-based reconstruction methods outperformed phone LiDAR methods on visual and spatial metrics. Cleaning COLMAP point clouds considerably decreased Gaussian floaters, while downsampling input point clouds increased the percentage of floaters and yielded similar visual results. 2D Gaussian splatting provided geometric constraints, removing some floaters, but sacrificed visual quality. Increasing the number of images to three loops around the building improved visual and spatial results. Overall, the spatial accuracy of building Gaussian splatting was heavily dependent on the factors studied.

2:00pm - 2:15pm

Geopose-enabled Urban Digital Twin for Rapid Road Quality Analysis using Geo-AI

Soheil Sabri

University of Central Florida, United States of America

Urban Digital Twins (UDT) are vital tools for smart city development, enabling data-driven management and analysis of urban infrastructure (Sabri and Witte, 2023). A persistent challenge in realizing the potential of UDTs is the interoperability of disparate geospatial datasets, particularly camera imagery and sensor data, requiring precise synchronization, georeferencing, and integration. Existing implementations often

rely on costly, proprietary hardware, limiting scalability and adoption, especially for organizations constrained by limited budgets (Thakkar et al., 2025). This research addresses the need to develop a cost-effective, standardized framework to capture, integrate, and standardize camera imagery and geospatial metadata for Machine Learning (ML)-driven analysis within spatially enabled UDTs.

2:15pm - 2:30pm

Towards Roof Material Identification by Fusing Aerial and Street View Imagery

Faezeh SoleimaniVostikolaei¹, SeyedPooya Soofbaf¹, Travis Moore², Shabnam Jabari¹

¹University of New Brunswick, Canada; ²Construction Research Centre, National Research Council Canada

Roof material identification is a critical component of energy-aware 3D city modeling, supporting applications such as thermal analysis, climate resilience, and digital twins. Traditional approaches relying solely on aerial imagery struggle with shadows, low contrast, and spectrally similar roof materials. This study introduces a dual-branch deep learning framework that combines high-resolution aerial orthoimages with GoPro-based street-view imagery to overcome these limitation and improve roof material classification.

The aerial branch employs a ResNet-18 model fine-tuned on 120 manually labelled roof samples in New Brunswick, Canada, covering four material classes: asphalt, metal, membrane, and gravel. The street-view branch utilizes GoPro field-survey images, where roof regions are extracted using the Segment Anything Model (SAM) before classification with a second ResNet-18. Although street-view imagery captures only materials visible from ground level, it offers rich textural information that complement nadir imagery.

Because the two modalities are unpaired, fusion is performed at the decision level using learnable weights to combine the softmax probabilities of both branches. Experimental results show that street-view imagery achieves 90.9% accuracy, outperforming aerial imagery alone (77.8%). The combined bimodal framework leverages complementary modality strengths, resulting in improved detection performance for all roof material classes.

2:30pm - 2:45pm

Evaluating Comparative Performance of 2D and 3D Feature Detection Models for Digital Twinning

Colin Richardson¹, Shabnam Jabari¹, Liam Delong¹, Darian Crouse¹, Travis Moore², Derek Lichti³

¹University of New Brunswick, Canada; ²National Research Council Canada; ³University of Calgary, Canada

This study investigates the comparative performance of state-of-the-art 2D and 3D feature-detection models applied to multimodal airborne datasets for digital-twin generation. Using RGB, LiDAR, and nighttime thermal imagery collected over the University of New Brunswick’s Fredericton campus, a fused RGB–LiDAR–thermal point cloud was created to support building-scale analysis of energy-relevant features, specifically windows and doors. Three 2D object-detection models Faster R-CNN, Mask R-CNN, and YOLOv8 were applied to both RGB and thermally registered imagery, incorporating phase-congruency-based alignment to address differences in sensor geometry and spatial resolution.

Complementing the 2D analysis, three 3D semantic-segmentation models KPConv, PointCNN, and RandLA-Net were implemented to evaluate geometry-driven, order-aware, and scalable point-cloud classification strategies using multimodal attributes. The dataset was divided into 70% training and 30% testing, and evaluated using standard metrics such as accuracy, mean Intersection-over-Union, and per-class F1 score. Preliminary results for the 2D methods have been realased in the abstract, with further evaluation of all models currently underway.

The objective of this work is to establish a unified framework for understanding how 2D and 3D feature-detection approaches perform under low-light and thermally dominant conditions, where conventional RGB-based workflows often fail. The outcomes of this study will support improved digital-twin development for building-energy diagnostics and contribute to future thermal-efficiency modeling workflows in partnership with the National Research Council of Canada.

1:30pm - 3:00pm

Forum1A: Observing the Earth as One: Making space for everyone in Remote Sensing, Photogrammetry, and Spatial Information Science
Location: 716B

1:30pm - 3:00pm

Youth Forum: Why Join? Early Career Engagement in Professional Associations
Location: 717A

Awards Ceremoney for the ISPRS Student Consortium Excellence Award

1:30pm - 3:00pm

InS1: Industry Tech Session
Location: 717B

3:00pm - 3:30pm

Afternoon Coffee Break
Location: Exhibition Hall "E"

3:30pm - 5:15pm

ThS4A: Toward Smart Forests: Emerging Tools in Remote Sensing, Artificial Intelligence, and Field Robotics
Location: 713A

3:30pm - 3:45pm

AI-Enabled Forest Inventory in TerraScan: integrating Georeferencing, Species Identification, and Volume Computation

Anna Shcherbacheva, Helen Rost, Ana Puttonen, Arttu Soininen

Terrasolid LTD, Hatsinanpuisto 8, 02600, Espoo, Finland

The Terrasolid software suite provides an automated and scalable framework for large-scale LiDAR data processing, widely adopted in both national and private forest inventories. Its unified processing pipeline covers all essential steps—from point cloud import and georeferencing to ground classification, object detection, tree segmentation, and computation of individual-tree attributes such as diameter at breast height (DBH), height, volume, and tree species. Georeferencing is initially performed in TerraScan using signal markers or automatically detected tree trunks, with optional refinement in TerraMatch, which corrects angular misalignments between flight lines.

Following object classification, individual trees are extracted from points labeled as trees. The semi-manual Group Inspection tools support efficient correction of segmentation errors, such as merged or over-segmented trees, after which stem-wise metrics are automatically updated. These conventional modules rely on optimized algorithms capable of processing hundreds of millions of points within minutes.

A recent innovation, the Tree Species tool, introduces one of the first AI-based extensions within Terrasolid software. It employs a machine learning approach that integrates 2D raster-based features with 3D point cloud descriptors to achieve accurate tree species identification. Validation was conducted using the FOR-species20K dataset, comprising 33 species collected worldwide. Among several tested classifiers, the Histogram Gradient Boosting Classifier (HGBC) achieved the highest accuracy. To mitigate class imbalance, multiple side-view rasterizations and SVM-SMOTE oversampling were applied, significantly improving the separability of underrepresented species and overall classification robustness.

3:45pm - 4:00pm

Spatiotemporal Foundation Model for Aboveground Biomass Estimation: A case study in Mixedwood Plains Ecozone, Ontario, Canada

Zilong Zhong¹, José Bermúdez¹, Dominic Cyr², Alemu Gonsamo¹

¹McMaster University; ²Environment and Climate Change Canada

Traditional aboveground biomass estimation for forested areas relies on allometric equations (Návar, 2009), which use input variables such as diameter at breast height (DBH), tree height, and tree species or broader taxonomic group. Although allometric equations can estimate the biomass of individual trees, and stand-level equations exist for larger scales, they often require extensive field data, making them less suitable for densely clustered or remote forests. However, satellite images provide increasingly detailed global observations of forested areas, and spaceborne lidar data like GEDI (Duncanson et al., 2022) provide accurate measurements for canopy height across different ecozones worldwide. In recent years, foundation models (FMs) inspired by large language models (Vaswani et al., 2017) have become the new paradigm to leverage large amounts of unlabelled data through self-supervised pre-training and have shown capacity to benefit multiple downstream tasks. In this work, we adopt the Granite foundation model (Muszynski et al., 2024) as a baseline to improve aboveground biomass estimation on different satellite data, using the Mixedwood Plains Ecozone (MPE) as a case study. We also explore adding temporal, geospatial, and spatiotemporal features and validate the proposed spatiotemporal foundation model with field sampling plots.

4:00pm - 4:15pm

Improving Tree Species Detection for Operational Forestry: The Role of Dataset Design

Mirela Beloiu, Khatereh Meshkini, Zhongyu Xia, Severin Broch, Verena C. Griess

Department of Environmental Systems Science, Institute of Terrestrial Ecosystems, ETH Zurich, 8092 Zurich, Switzerland

Accurate detection and mapping of individual trees and their species are vital for sustainable forest management. Traditional field-based inventories remain the golden standard in forest monitoring, but are increasingly overwhelmed by temporal, spatial and accessibility constraints. Remote sensing offers faster, repeatable, and high-resolution data that complement and scale beyond field inventories. However, species-level detection remains difficult due to overlapping crowns, and spatial mismatches between crowns and trunks. Deep learning (DL) methods, particularly convolutional neural networks, have advanced crown delineation by automatically learning spatial and spectral patterns from imagery. Yet, their success depends heavily on dataset quality, class balance, and diversity. To address this, we applied a DL object detection framework for tree crown and species detection in Swiss forests and evaluate how dataset composition and training strategies influence accuracy and generalization. We test three dataset configurations: (1) an unbalanced masked dataset, (2) a class-balanced masked dataset, and (3) a mixed dataset combining masked and unmasked imagery. Results show that class balancing improved accuracy for both dominant and minority species, while mixed data enhances generalization.

4:15pm - 4:30pm

Self-Supervised Leaf-Off Segmentation of Tree Functional Types and Buildings from Airborne NIRGB and LiDAR Data in Southern Ontario

Jose David Bermudez Castro¹, Zilong Zhong¹, Dominic Cyr², Alemu Gonsamo¹

¹McMaster University, School of Earth Environment Society, Canada; ²Environment and Climate Change Canada

High-resolution airborne sensing enables joint mapping of urban infrastructure and forest composition at ecological scales. This study presents a self-supervised segmentation framework that fuses 0.5 m Near-Infrared + RGB (NIRGB) orthophotography from the Ontario Imagery Program (2013–2026) with Canopy-height models (CHM) derived from the Ontario Elevation Mapping Program (8–10 pulses m⁻², 5–10 cm vertical accuracy). Imagery was collected during the leaf-off season, providing strong spectral–structural contrast between evergreen and deciduous crowns, to produce high-fidelity land- cover segmentations that differentiate vegetation functional types and built structures as a prerequisite for tree-level biomass and carbon-stock estimation.

4:30pm - 4:45pm

Updating Forestry Road networks in Ontario Using Single Photon LiDAR and Deep Learning-enhanced algorithms

Hamid Ghanbari, Emmanuelle Baby-Bouchard, Catherine Chagnon, Alexandre Morin-Bernard, Alexis Achim

Department of Wood and Forest Sciences, Université Laval, Québec, Canada

Spatially accurate forestry road networks are essential for effective forestry operations, sustainable resource management, and conservation. Current forestry road databases in Ontario have significant location errors due to limitations and human errors associated with conventional road delineation approaches such as GPS-based field surveys and photointerpretation. A previously developed algorithm, which used airborne laser scanning (ALS) data, successfully corrected road locations in Quebec. However, its design limited its application in other landscapes, ALS instruments, and road construction and maintenance practices. This study advances that algorithm by integrating a deep learning component to improve its robustness and scalability for diverse forest conditions. A hybrid workflow combines the original friction-based conductivity surface with a road probability surface generated by an Attention Residual U-Net model trained on 11 LiDAR-derived features using road segments from five forest sites in Quebec. The enhanced workflow was applied to two forest management units in Ontario: Nipissing and Dryden. The results showed significant improvement in road alignment when compared to the existing provincial data and the outputs from the earlier automated approach. The deep learning-enhanced algorithm lowered mean positional error by 78% (from 9.36 m to 2.07 m) and increased the proportion of road centerline points within 3 m of the reference from 66.7% to 87.2%. These improved centerline accuracies will further support a scalable tool for rapid and accurate forestry road network mapping, which in turn will aid sustainable forest management and conservation planning at both provincial and national scales.

4:45pm - 5:00pm

Attention-guided Multi-Scale Deep Learning Approach for Tree Health Detection Using Very High-Resolution Aerial Imagery

Khatereh Meshkini, Mirela Beloiu, Zhongyu Xia, Verena C. Griess

Department of Environmental Systems Science, Institute of Terrestrial Ecosystems, ETH Zurich, Zurich, Switzerland

Monitoring tree health is essential for detecting early signs of stress, defoliation, and potential mortality, supporting effective forest management, ecosystem conservation, and early warning systems. Advances in deep learning have enabled automated analysis of trees in remote sensing imagery through object detection methods that leverage both spectral and spatial information. However, assessing tree defoliation remains challenging, as subtle differences between defoliation levels make accurate classification difficult. To address this, we propose the hybrid ResNet-Swin Transformer, an object detection architecture built on a Faster R-CNN framework, incorporating a fused ResNet and Swin Transformer backbone with attention-based feature fusion. This design captures rich, multiscale representations by combining convolutional and transformer-based features and progressively refines them through channel-wise attention blocks for robust detection and classification. The architecture was evaluated on a very high-resolution aerial dataset from Switzerland, partially annotated with five classes: Conifer (healthy), Conifer (defoliated), Broadleaf (healthy), Broadleaf (defoliated) and Dead. Comparative experiments with state-of-the-art object detection and classification methods demonstrate that the proposed approach achieves higher accuracy and robustness, highlighting its potential for precise and reliable automated tree health monitoring.

5:00pm - 5:15pm

Fine-grained vegetation segmentation in complex urban park environments using a deeply supervised parallel SegFormer

Haixin Zhang, Qinying Zhang

Department of Landscape Architecture, Tianjin University, 300072 Tianjin, China,

Accurate vegetation mapping in complex urban environments is essential for ecological monitoring, biodiversity assessment, and sustainable park management. However, fine-grained vegetation segmentation remains challenging because of the high diversity of plant species, overlapping canopies, and the interference of artificial objects. To address these challenges, a deeply supervised parallel architecture based on the SegFormer backbone was proposed in this paper. The model incorporated a SegFormer-ASPP-low-level (SAL) head, which fused high-level semantic representations, multi-scale contextual information, and low-level spatial details through a parallel decoding mechanism. Two auxiliary heads, a pyramid pooling module (PSP) and a fully convolutional network (FCN), were added to provide deep supervision and improve the recognition of blurred boundaries and rare categories. High-resolution UAV imagery was used to perform fine-grained semantic segmentation of 17 vegetation categories. The dataset included multiple tree species as well as non-tree classes such as Nelumbo sp. (lotus) and dead trees. Experimental results showed that our model achieved a mean intersection over union (mIoU) of 73.57%, outperforming architectures such as SegFormer-b1, DeepLab v3+, ConvNeXt and SCTNet. Visual analysis further demonstrated the model's robustness in complex urban park scenes, showing superior boundary delineation, improved recognition of small and spectrally similar species, and resilience to interference from artificial objects like plastic lawns and landscape lighting. The proposed approach offers valuable insights for precision forestry, ecological monitoring, and intelligent UAV-based remote sensing applications.

3:30pm - 5:15pm

WG IV/5: Extended Reality and Visual Analytics
Location: 713B

3:30pm - 3:45pm

Towards evaluating the effects of visualization and task types on urban planning decisions

Bettina Lobriza-Kalt¹, Elif Gurcinar², Halil Ibrahim Senol³, Arzu Çöltekin²

¹Department of Geography, University of Zurich, Switzerland; ²Institute of Interactive Technologies, University of Applied Sciences and Arts Northwestern Switzerland (FHNW), Switzerland; ³Department of Geomatics, Harran University, Turkey

This study compares visualization types (3D, 2D, Oblique 3D, and Combined 2D+3D, coupled in a pairwise fashion for different tasks and scenarios), investigates their influence on decision-making across selected urban planning tasks (Site selection, Scenario Selection), and Distance Estimation as a baseline task that we assumed is relevant in both. Our goal was to inform the participatory urban planning process. In a controlled user study with 40 participants, we evaluated whether visualization type affects decision outcomes and distance estimation, complemented by participants’ visualization preferences before and after the experiment. The results confirm the previously well documented evidence that participants are considerably more successful with distance estimation with 2D visualizations, and their decisions vary depending on the examined visualization and task types. We observe that different formats support different task requirements, i.e., each visualization type exhibits distinct strengths depending on the task. These findings indicate that visualization choice in urban planning should be adapted to the task and context rather than treated as an interchangeable artifact.

3:45pm - 4:00pm

Collaborative Wildfire Planning with Agentic AI: Automated Simulation and Mixed-Reality Visualization for Community Engagement

Haowen Xu¹, Sisi Zlatanova¹, Ruiyu Liang², Ismet Canbulat²

¹GRID, School of Built Environment, UNSW Sydney, NSW 2052 Australia; ²School of Minerals and Energy Resources Engineering, UNSW Sydney, NSW 2052 Australia

With the rising number and severity of WUI wildfire episodes and the necessity to improve community preparedness, planning strategies have to be devised that integrate foresight into wildfires, with active community participation. This paper presents an intelligent collaborative environment that seeks to engage citizens, planners, and emergency services in co-creation of fire-resilient strategies through Agentic AI-driven wildfire simulations and mixed-reality visualization. A serious game environment is designed for hands-on exploration of alternative wildfire spread scenarios and community-scale prevention practices such as prescribed burning, fuel treatment through vegetation control, and structural hardening measures. The objective is to promote public awareness and adaptive behavior as well as provide science-based operational decision support to emergency responders in evaluating tactical options specific to terrain, infrastructure, and fuel conditions of the locale. The system operates on a Large Language Model (LLM)–powered agentic AI architecture designed to automate and orchestrate 2D and 3D wildfire simulations, providing guidance that supports users from diverse technical backgrounds. To give the results of the simulations, 3D web visualization and immersive holographic display were used to enable cycles of iterated explorations into fire spread in dense urban settings. With AI-assisted wildfire intelligence, this particular flow works through a set of intuitive interaction mechanics so that communities can evaluate risk levels, weigh alternatives for mitigation, and better prepare for an actual fire event.

4:00pm - 4:15pm

Situated augmented reality for urban planning: A privacy-aware on-device localization pipeline

Muhammad Alfakhori, Sven Schneider, Volker Coors

Stuttgart Technical University of Applied Sciences, Germany

Accurate spatial alignment is a key requirement for situated Augmented Reality (AR) in urban planning, where citizens and planners can visualize proposed designs in real outdoor environments. However, existing AR localization approaches often rely on smartphone GNSS, vendor-specific cloud anchors, or cloud-based visual positioning, which introduce accuracy limitations, privacy concerns, or dependencies that restrict their use in participatory planning workflows. This paper presents a privacy-aware on-device localization pipeline for outdoor urban planning scenarios. The approach aligns LiDAR scans captured on smartphones with pre-scanned reference point cloud tiles to enable stable and accurate placement of urban planning models. Approximate GNSS is used only to retrieve a relevant reference tile, while all preprocessing and registration steps are performed locally on the device. The pipeline combines voxel downsampling, local geometric descriptors, and global registration to estimate alignment without relying on GNSS for pose estimation or on cloud-based visual localization services. A mobile demonstrator was developed to support situated AR in urban planning scenarios, allowing users to explore design proposals directly in context. Initial validation under controlled conditions showed that the system can recover translations and rotations with errors on the order of a few centimeters, while processing times remained suitable for mobile use. The approach was also deployed in an urban planning case study and enabled stable outdoor visualization of planning elements on-site.

4:15pm - 4:30pm

What Features of the Street Influence Visual Walkability? An Innovative Approach Using Cinematic Virtual Reality

Chongan Wang, Florian Massuyeau, Vincent Tourre, Thomas Leduc, Myriam Servières

Nantes Université, ENSA Nantes, Ecole Centrale Nantes, CNRS, AAU-CRENAU, UMR 1563, F-44000 Nantes, France

We present a new method for assessing visual walkability using 360° videos and an eye-tracking in Cinematic Virtual Reality (CVR). Visual walkability refers to the walkability perceived by pedestrians through visual stimuli in the urban environment. Our method uses semantic segmentation, viewport exposure, gaze measures, and a custom walkability questionnaire, enabling comparison between scene content, participant's viewport, and their gaze focus. The 10 videos used, including 2 calibration videos, exhibit distinct semantic characteristics, validated by segmentation analysis. Analysis of the 35 participants’ responses shows that walkability ratings at the video level correlate with several environmental parameters (e.g., road, sidewalk, sky) consistent with previous studies. However, these parameters do not have a similar influence in gaze-based visual attention analysis within the CVR setting, suggesting that CVR attention would requiere further work. Furthermore, our results suggest that unexpected semantic classes may also play a role in perceived walkability and should be considered exploratory pending further validation. This paves the way for further research on using CVR as an assessment tool for visual walkability and for developing methodological guidance on which visual cues are robust across measures (content/viewport).

4:30pm - 4:45pm

Cartography-oriented Visual Design of Hydrodynamic Ocean-Physics Datasets

Brad W. Robertson, Steffen D. Frey, Christian Kehl

Bernoulli institute, Rijksuniversiteit Groningen, The Netherlands

Oceanographic data and their related simulation have a key role in addressing EU and UN societal challenges in marine environments. Visualising marine data is challenging for different visual-communication intents and audiences, despite existing guidelines on the subject. A main visual-design limitation for existing techniques is the co-visualization of multiple hydrodynamic field attributes in an accessible, comprehensible and engaging manner. This paper addresses this limitation in two ways: first, existing techniques for cartographic-oriented design of waterlines are adopted and extended towards multivariate hydrodynamic field datasets. Secondly, experimental results on the intermixing different visual-channel mapping of hydrodynamic attribute data are presented in a case study on ocean-flow patterns around the Hebrides island chain (UK). The results demonstrate a simultaneous co-visualization of up to five unique, independent scalar attributes in a comprehensible manner while preserving the geographic context. Moreover, best-practice guidelines are stated in conclusion of the experimental case study to help oceanographic practitioners adopt the presented technology in their professional workflows.

4:45pm - 5:00pm

Night Sky Explorer VR

Maxim Spur¹, Philippe Deverchère², Michèle Atié³, Olivier Augereau¹, Edna Hernández González⁴

¹ENIB, Lab-STICC UMR 6285 CNRS, Brest, France; ²ScotopicLabs, Lyon, France; ³Archimmersion, Nantes, France; ⁴Univ Brest (UBO), Institut de Géoarchitecture, Brest, France

Artificial light at night (ALAN) degrades nocturnal ecosystems and complicates astronomical observation. Although all-sky imaging and GIS-based light-pollution mapping are well established in the analysis of light pollution, identifying local contributors to ALAN still requires time-consuming cross-comparisons, done in separate views, making light halo--source attribution slow and manual. We present an interactive system that addresses this gap by co-registering Sky Quality Camera all-sky imagery and OSM-derived candidate emitters (e.g., settlements, roads, aerodromes, industrial sites) in one observer-centered scene. The viewer is placed at the locations of the captured all-sky images in 3D digital terrain model-based scenes, realistically illuminated by the sky under selected conditions for an immersive view of nighttime scenarios. OpenStreetMap features are projected onto a surrounding sphere via inverse stereographic projection, with point markers and horizontal-extent indicators to support rapid visual matching between observed halos and plausible sources. Users can switch scenes and processed sky images, adjust projection parameters, and inspect scenes in VR or in an additional cylindrical projection for a panoramic desktop view. A companion web tool configures location classes and display ranges. The presented system primarily targets exploratory analysis, with its main contribution being the novel co-visualization of light sources and light halos; expert interviews positively validated this analytical focus. As a secondary outcome, the system's immersive first-person representation may also enrich educational communication and outreach on ALAN impacts.

5:00pm - 5:15pm

STAG: System for ouTdoor Augmented reality using GeoWebXR

Corentin Gautier, Mathieu Brédif

Univ Gustave Eiffel, Géodata Paris, IGN, LASTIG

Accurate and intuitive visualization of urban development projects is a persistent challenge in spatial planning and public participation. Recent advances in Extended Reality (XR) offer new opportunities to integrate geospatial data directly within the user’s real environment. This paper introduces GeoWebXR, an extension of the WebXR API designed to provide absolute georeferencing of the XR reference space via a standardized geopose. We present an outdoor proof-of-concept implementation that integrates a dual-antenna RTK GNSS receiver mounted on an XR headset. High-precision GNSS measurements are fused with the device’s local pose estimates to compute a consistent and accurate geopose, enabling decimeter-level alignment between virtual and physical environments.

Leveraging GeoWebXR, WebGL applications can render georeferenced 3D content in situ through a web browser. We demonstrate this capability using the iTowns geospatial visualization framework to deliver an XR experience for urban planning. The system supports both 1:1-scale in-situ visualization and reduced-scale overview modes, enabling seamless multiscale exploration of planning scenarios. To mitigate cognitive overload in dense urban contexts, we implement and evaluate several visualization and interaction strategies. We assess the usability and spatial appropriation enabled by our system, and discuss how it may support both expert analysis and citizen participation in urban planning processes.

3:30pm - 5:15pm

ICWG III/IIA: Planetary Remote Sensing and Mapping
Location: 714A

3:30pm - 3:45pm

LunarDEM2025: A near-global lunar topography model using fused multi-sensor data

Bin Liu^1,3, Xiang Shen^2,3, Kaichang Di^1,3

¹State Key Laboratory of Remote Science and Digital Earth, Aerospace Information Research Institute, Chinese Academy of Sciences; ²State Key Laboratory of Geodesy and Earth’s Dynamics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences; ³University of Chinese Academy of Sciences

LunarDEM2025 is a lunar topography model (±60°) created by fusing JAXA’s SLDEM2013, CAS’s CE2TMap2015 and NASA’s LOLA laser altimetry tracks. A tile-based, terrain-aware co-registration aligns photogrammetric DEMs to LOLA points, while a slope-constrained residual-compensation filter eliminates striping, voids and artefacts. The resulting dataset shows visibly smoother relief, smaller vertical biases and fewer tile-boundary discontinuities than its predecessor SLDEM2015. The product is ready for landing-site analysis, rover path planning and various other applications.

3:45pm - 4:00pm

1:1,000,000-scale Geologic Map of the Copernicus Quadrangle (LQ-58) on the Moon

Kai Zhu¹, Jianzhong Liu¹, Jingwen Liu¹, Jian Chen²

¹Center for Lunar and Planetary Science, Institute of Geochemistry, Chinese Academy of Sciences, Guiyang 550081, China; ²Shandong Key Laboratory of Optical Astronomy and Solar-Terrestrial Environment, Institute of Space Sciences, School of Space Science and Physics, Shandong University, Weihai 264209, China

After completing the 1:2,500,000-scale (1:2.5 M) lunar geologic atlas, our team began exploring the techniques and work flows for compiling larger scale lunar geological maps. Geologic maps integrate multidimensional information such as lithology, structure, and geologic age. Using the Copernicus crater region (0°–16°N, 30°W–10°W) as a case study, this research develops a 1:1,000,000-scale (1:1 M) regional geologic map and, in turn, explores the lithologic and structural classification systems applicable to lunar geologic maps at different scales. Based on imagery, topography, spectral, gravity, and sample data, we analyze geologic features including impact craters, impact basins, compositions, and structures, and subsequently delineate geological units. In the study area, the Copernicus crater and Imbrium basin represent the most prominent geological events and can serve as benchmarks for relative age determination. The cross-cutting relationships among geological units, together with existing absolute age constraints (from isotopic dating and crater size-frequency distribution chronology), are used to establish the stratigraphic relationships among mapped features and layers, ultimately producing a regional geologic map. Based on this map, the geological evolution history of the region is reconstructed.

4:00pm - 4:15pm

Quality Control for Large-scale Bundle Adjustment of Planetary Remote Sensing Images

Xun Geng, Pengying Liu, Honghai Ma, Jia Wang, Zhen Peng, Jin Wang, Yuying Wang, Yinhui Wang, Jianwei Xu, Guohua Chang, Can Feng

State Key Laboratory of Spatial Datum, Henan University, Zhengzhou, China, 450046

High-accuracy planetary mapping products are increasingly required for landing-site assessment, precision navigation, and future surface operations on the Moon and Mars. Although massive orbital remote sensing images are available, the geometric accuracy and spatial resolution of many existing mapping products is still insufficient for engineering applications. A major bottleneck is large-scale bundle adjustment, whose reliability is strongly affected by data quality, control network strength, as well as engineering experience. Compared with Earth observation photogrammetry, planetary mapping faces great challenges such as heterogeneous sensor models, complex illumination, sparse absolute control. This paper summarizes a practical quality control framework for large-scale bundle adjustment of planetary remote sensing images. The workflow is divided into four coupled stages: data preprocessing, control network construction, parameter setting, and accuracy evaluation. The framework is distilled from previous planetary mapping studies, open-source software platforms and our practical experience in processing tens of thousands of planetary images. Experiments using LRO NAC datasets demonstrate that satisfactory bundle adjustment results can be achieved when the proposed strategy is applied. The framework improves the overall efficiency, controllability, and reliability of large-scale planetary photogrammetric processing.

4:15pm - 4:30pm

Advances and Applications of Spatio-Temporal Intelligence in China’s Lunar and Mars Explorations

Kaichang Di¹, Zhaoqin Liu¹, Bin Liu¹, Man Peng¹, Wenhui Wan¹, Yexin Wang¹, Wai Chung Liu¹, Zongyu Yue², Sheng Gou²

¹Aerospace Information Research Institute, Chinese Academy of Sciences, China; ²Institute of Geology and Geophysics, Chinese Academy of Sciences, China

China has successfully carried out the Chang'e-1 to Chang'e-6 lunar missions and the Tianwen-1 Mars mission. In these missions, planetary photogrammetry and remote sensing technologies provide timely spatio-temporal information services across all phases of the missions, playing a crucial supporting role in ensuring the mission safety and scientific output. In the current era of artificial intelligence (AI), the deep integration of photogrammetry and remote sensing, geomatics, and artificial intelligence is gradually evolving into Spatio-Temporal Intelligence (STI). This paper presents an overview of the advances and applications of STI in China’s lunar and Mars explorations, and discuss the future directions of STI in deep space exploration.

4:30pm - 4:45pm

Eliminating Latitudinal Bias for Improved Correlation Between Microwave Data and (FeO+TiO₂) Abundance on the Moon

Liansheng Mei¹, Cai Liu¹, Zhiguo Meng¹, Roberto Bugiolacchi²

¹jilin university, China, People's Republic of; ²Macau University of Science and Technology, China, People's Republic of

Based on microwave radiometer (MRM) data from China's Chang'e (CE)-1/2 satellites, the Brightness Temperature Difference (TBD) technique offers a method for probing lunar regolith properties. However, its global application is compromised by systematic latitudinal biases and an unverified link to subsurface deposits.

This study introduces a novel parameter, the effective TBD (TBDeff), to overcome these limitations. The methodology first defines an equivalent TBD (eTBD), simulating the TBD for a location as if it were on the lunar equator to mitigate latitudinal effects. Recognizing inherent limitations in this simulation, a supplementary parameter (sup_TBD) is derived. TBDeff is then developed by integrating sup_TBD with the observed TBD (TBDobs) from CE-2 data.

Results demonstrate that TBDeff successfully removes latitudinal bias on a global scale, enabling clearer discrimination between lunar maria and highlands. Furthermore, extensive low-TBDeff signals in polar regions (>85°) suggest a new potential for detecting subsurface deposits in permanently shadowed areas.

Crucially, correlation analysis with (FeO+TiO₂) abundance reveals that TBDeff exhibits a significantly stronger relationship with regolith composition than traditional TBD or simple brightness temperatures (TB), especially at lower frequencies (reaching a correlation coefficient of 0.86 at 3.0 GHz). This confirms that (FeO+TiO₂) abundance is a key factor influencing the dielectric properties of subsurface materials, a effect previously obscured by latitudinal interference. The TBDeff method thus provides a more reliable tool for interpreting lunar composition from microwave data.

4:45pm - 5:00pm

Spectroscopy of lunar surface：remote sensing, In situ and laboratory measurements

Yunzhao Wu^1,2, Yimin Chai¹, Hengyue Jiao¹

¹Purple Mountain Observatory, Chinese Academy of Sciences, China, People's Republic of; ²Space Science Institute, Macau University of Science and Technology, Macau, China

This study analyzed and compared in situ spectral obtained by the Chang’E-3(CE-3) and Chang’E-4(CE-4) rovers, laboratory spectra of Chang’E-5(CE-5) soils and remote sensing spectra. The remote sensing spectra exhibit significantly darker and shallower absorption features than laboratory or in situ spectra, reflecting highly weathered nature of the undisturbed lunar surface. The spectral upturn even just right >2 μm can be contributed by thermal emission, revealing micro-scale temperature variations and low thermal inertia of lunar soils. CE-5 sample spectra show significantly higher reflectance and absorption depths than in situ and remote sensing, indicating samples are fresh and couldn’t represent pristine/true lunar surface. The CE-5 samples provide a new ground truth for estimating the TiO2 content of young basalts, which have the largest uncertainty in TiO2 content. Contrary to traditional opinion, CE-3 in situ spectra revealed that the effect on the spectral slope caused by space weathering is wavelength-dependent: the visible slope (VS) decreases not increases. The optical effects of space weathering and TiO2 are identical: both reduce albedo and blue the spectra. This suggests that a new TiO2 abundance algorithm is needed.

3:30pm - 5:15pm

WG IV/9A: Spatially Enabled Urban and Regional Digital Twins
Location: 714B

3:30pm - 3:45pm

The UoC Virtual Campus 3D Geospatial Data Infrastructure

Rijin Shaji¹, Calvin Hinzer², Elena Luisa Castrucci¹, Jessica Schuster³, Nils Naatz³, Julia Rauchalles¹, Christian Willmes¹

¹Institute of Geography, University of Cologne, Germany; ²Advanced Media Institute, Cologne University of Applied Sciences, Germany; ³Department for Digital Humanities, University of Cologne, Germany

The Virtual Campus Project at the University of Cologne (UoC) has as a main objective the creation of a highly detailed 3D model of the university campus and its publication and distribution through OGC 3D Tiles. Further objectives include the development of integrated applications leveraging this 3D model, such as a web-based 3D viewer, game engine-driven geospatial augmented reality (GeoAR) and virtual reality (VR) experiences, and an indoor positioning system utilizing 3D building models indoor geometries. This paper focuses on and details the methodology for developing and implementing the georeferenced 3D model and establishes an Open Geospatial Consortium (OGC) 3D Tiles-compliant Spatial Data Infrastructure (SDI). The main result is a Tool Suite or Software Framework and the description of the tool pipeline or workflows for collecting, creating and modelling the 3D geospatial data and publishing it as OGC 3D Tiles data. This framework ensures campus-wide 3D data accessibility through 3D Tiles standard clients, including Desktop GIS like ArcGIS or QGIS, game engines like Unity, Unreal or O3DE and Webmapping libraries like MapLibre, three.js or CesiumJS.

3:45pm - 4:00pm

BirdCV-LiDAR: A Multi-Modal Data Fusion Framework for Automated Sidewalk Infrastructure Assessment

Chan Young Koh¹, Marwan Abdelatti², Abdeltawab Hendawi¹

¹University of Rhode Island, United States of America; ²Providence College, United States of America

The assessment of sidewalk infrastructure for accessibility compliance is an important task in urban planning; however, traditional methods are often manual, subjective, and resource-intensive. This paper introduces BirdCV-LiDAR, a multimodal data fusion framework for an automated assessment of sidewalk infrastructure. The proposed approach integrates high-resolution bird's-eye-view (HR-BEV) imagery with aerial LiDAR point cloud (ALPC) data to automatically detect, measure, and assess sidewalk features for compliance with accessibility standards. By combining YOLO-oriented bounding box (OBB) models with precise LiDAR-based elevation data, the framework enables accurate dimensional and slope evaluations of sidewalk features, such as crosswalks and truncated domes. Validation with a 12-inch inclinometer shows that LiDAR-based slope measurements achieve 84.7% accuracy, with a root-mean-square error (RMSE) of 0.1152 meters for crosswalk width measurements. The framework achieves 81.0% accuracy in determining ADA-PROWAG compliance, providing an adaptable, expandable solution for improved urban accessibility assessments.

4:00pm - 4:15pm

A Micro-Scale Walkability Metric for Pleasant Pedestrian Route Planning

George Taylor¹, Kaloyan Karamitov¹, Dessislava Petrova-Antonova^1,2

¹GATE Institute; ²Faculty of Mathematics and Informatics, Sofia University "St. Kliment Ohridski"

This paper proposes a micro-scale walkability metric based on harmonised indicators that supports pedestrian route planning, which prioritises pleasant environments alongside distance efficiency. The employed method quantifies street segments and crossings using geospatial indicators, including pavement width, slope, shade, adjacency to traffic, park context, and crossing type and width. Indicator values are transformed to percentile ranks to harmonise heterogeneous inputs and are aggregated into a single edge-level walkability score on a 0 to 1 scale. The score is integrated into a routing cost function that reduces edge cost with higher walkability, which favours calmer, greener, and wider links while bounding detours relative to the shortest path. The method also accommodates the incorporation of street-level perceptions through a structured survey instrument and a confidence-weighted fusion scheme.

The results show various spatial patterns. Central areas and park-adjacent segments exhibit higher scores, while steep, narrow, and traffic-exposed links score lower, and several suburban and foothill districts display reduced walkability. The comparison with a distance-only baseline shows selection of quieter alignments with modest length increases, indicating potential gains in perceived pleasantness.

4:15pm - 4:30pm

Building upward, dividing deeper: Three-dimensional urban expansion assessment reveals regional heterogeneity of preferential developments worldwide

Yifan Teng¹, Liqun Sun², Xuefeng Guan¹, Xiaozhen Ren¹, Tao Zhou², Qingyang Xu¹, Chang Liu¹, Zhangyan Xu¹, Xu Li¹

¹State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China; ²Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China

Urban areas are continuously expanding outward with economic development and demographic growth, while simultaneously growing higher vertically. However, few efforts have been made to evaluate the impact of development priorities in different regions on urban sustainability, which limited our understanding of how urbanization has been affected by imbalanced evolution rhythm. Here we developed a 3D structure-based approach to assess volumetric urban expansion, as well as a refined evaluation system for assessing urban imbalanced growth trends. Results show that the 3D expansion patterns of urban areas exhibited significant heterogeneity globally. As urbanization accelerates, urban areas in the Global South are showing a trend of faster expansion accompanied by faster vertical growth. In addition, imbalanced growth types across different dimensions are significantly more complicated in the Global South than in the Global North, indicating the variance of development priorities is greater in the Global South. Furthermore, the imbalances are intensifying over time, as indicated by the temporal indices. Our study enhances the understanding of urban 3D patterns and imbalanced urban evolution, providing crucial insights for more balanced urbanization especially in the Global South.

4:30pm - 4:45pm

Quantifying vertical Differences in Green Visibility in High‑Density Cities: A Voxel‑Based Analysis Method

Haoyu Wang¹, Liu Liu¹, Sisi Zlatanova²

¹College of Architecture and Urban Planning,Tongji University, Shanghai, People's Republic of China; ²UNSW Built Environment, Red Centre Building, Kensington NSW 2052, Sydney, Australia

Urban green spaces are important for residents’ physical and mental health, but green visibility is difficult to quantify in high-rise, high-density cities, especially across different height levels. To address this problem, this study proposes a stratified green visibility framework based on airborne LiDAR point clouds and a voxel model. Using the Dutch AHN5 dataset, the study area was converted into a unified 3D voxel space and classified into trees, grass, buildings, ground, and empty space. A voxel-level penetration probability model based on the Beer–Lambert law was introduced to represent the semi-transparent blocking effect of tree canopies, improving upon conventional binary visibility models. Multi-directional line-of-sight (LOS) tracing was then applied to calculate green visibility (GVI) and spatial openness (SOP) at different height layers. The results show that GVI is generally high around parks, large green spaces, and some enclosed courtyards, but its contribution from street trees is limited. Vertically, GVI decreases with height, while SOP tends to increase. Combining the two indicators helps identify different spatial types with distinct visual characteristics. The study demonstrates that airborne LiDAR, combined with voxelization and probabilistic 3D simulation, can effectively capture the vertical variation of urban GVI and support large-area assessment in high-density residential environments.

4:45pm - 5:00pm

Urban Building-Level Positioning using Data-driven Algorithms enhanced by Spatial Variations in Sensor Features

Die Zhang^1,2, Xin Liu³, Yong Ge^1,2,4, Yixi Wei⁵, Mengxiao Liu⁶

¹School of Geography and Environment / Key Laboratory of Poyang Lake Wetland and Watershed Research (Ministry of Education), Jiangxi Normal University, Nanchang, People’s Republic of China; ²Jiangxi Province Key Laboratory of Ecological Intelligent Monitoring and Comprehensive Treatment of Watershed, Jiangxi Normal University, Nanchang, People’s Republic of China; ³2012 Lab, Huawei Technologies Co. Ltd., Shenzhen, People’s Republic of China; ⁴State Key Laboratory of Resources and Environmental Information System, Beijing, China; ⁵Department of Mathematics, Xi’an Medical University, Xi’an, People’s Republic of China; ⁶Chinese Research Academy of Environmental Sciences, Beijing, People’s Republic of China

Accurate building-level mobile device positioning is critical for fine-grained location-based services and human activity analysis, as people spend 80–90% of time indoors. Existing techniques rely on dedicated infrastructure or dense fingerprinting, limiting scalability. This study proposes a lightweight, infrastructure-free framework integrating two core modules: 1) Indoor/outdoor classification via a random forest model trained on a multi-scene sample library, using satellite, Wi-Fi, Bluetooth, and cellular sensor features with similarity-guided training selection; 2) Building matching through a Bayesian inference model leveraging three-scale spatial features (device, building, area) and prior knowledge from anonymous crowdsourced data. Validated in Beijing, Nanjing, and Xi’an, the framework achieves over 90% overall precision for indoor/outdoor classification and ≥70% precision for building matching with satellite or Wi-Fi features alone. It requires no extra infrastructure or extensive labeled data, offering a scalable solution for smart city applications like population analytics, emergency response, and context-aware services across heterogeneous urban regions.

5:00pm - 5:15pm

Detecting Urban Spatial Porosity and Fragmentation from Local Population Patterns

Kiichiro Kumagai, Ayano Shiraishi

Setsunan University, Japan

In Japan, the combined effects of declining birth and marriage rates have accelerated population decline, leading to spatial porosity and fragmentation in urbanised areas: a phenomenon known as “Urban spongification”. This study analyses local population distributions in order to identify localised low-population areas embedded within densely populated urban environments, with the aim of understanding spatial porosity and fragmentation in Osaka Prefecture. A multi-scale spatial autocorrelation approach was applied to detect the spatial extent of localised low-population areas, and results were compared between 1995 and 2020. The analysis further examined how the formation and change of localised low-population areas differ across Use Districts and according to long-term land-use transition histories. The findings reveal pronounced spatial variability within districts that cannot be captured by conventional population density metrics alone. The study demonstrates that the emergence, persistence, and transformation of localised low-population areas are closely related to zoning regulations and historical land-use processes. These results provide insights into the spatial processes contributing to urban porosity and fragmentation and offer a basis for future evaluations of residential inducement areas designated under Location Optimisation Plans.

3:30pm - 5:15pm

WG I/8: Multi-sensor Modelling and Cross-modality Fusion
Location: 715A

3:30pm - 3:45pm

Geometry-aware Subsampling and pole-enhanced Map Constraints for urban Localization of LiDAR-based Systems

Marvin Scherff, Dominik Ernst, Hamza Alkhatib

Leibniz University Hanover, Germany

Urban localization for autonomous driving requires accurate 6-DoF vehicle pose despite GNSS multipath, occlusions, and rapidly changing visibility. We fuse LiDAR, IMU, and GNSS in an error-state Kalman filter against a high-resolution (HR) map, aiming (i) to reduce LiDAR load without degrading accuracy and (ii) to improve robustness in building-sparse areas such as open junctions. The reference trajectory and HR map stem from a dedicated urban measurement campaign; Monte-Carlo simulations use ray-cast LiDAR, synthesized IMU, and GNSS tied to this trajectory so that only sensor noise is varied. A geometry-aware farthest-point sampling scheme prioritizes points informative for building/ground planes and pole-like structures, while an extended functional model introduces poles as additional vertical constraints. A retained-point rate of 10 % preserves trajectory-wide millimetrelevel and sub-milliradian accuracy, meeting in theory automotive requirements. Filter runtime is reduced by about 82 % relative

to the full LiDAR data. Compared with plane-only variants, the planes+poles configuration yields statistically significant but globally modest improvements in longitudinal, lateral, and yaw accuracy. More importantly, a sliding-window analysis reveals that it markedly stabilizes pose in plane-sparse junctions. Overall, the results suggest that task-aware subsampling preserves trajectory-wide performance while pole constraints add local robustness in challenging urban scenes; validation with real sensor logs remains necessary to confirm these accuracy margins, but the proposed filtering scheme shows promising potential for practical deployment.

3:45pm - 4:00pm

Tracking topological relationships and spatiotemporal changes occurring in vague shape phenomena monitored by sensor network: a distributed fuzzy reasoning approach

Roger Ntankouo Njila, Mir Abolfazl Mostafavi

Universite Laval, Canada

Sensor data are increasingly used for monitoring and observation of spatiotemporal phenomena for diverse applications such as in flood management, urban traffic, air quality control, forest fire management, etc. Real time modelling and representation of such evolving phenomena is fundamental for efficient and timeliness decision-making processes. In the context of multisensory systems, where two phenomena (e.g.: air pollution index and windy condition) can both be sensed by networked sensors, analysing the relationship that hold between them is a major issue for decision making. Knowing if the pollution extent is expanding or contracting around a given spot or if it is within a windy zone can help in adopting more appropriate strategies. Sensing system equipped with rule-based reasoning engine to infer on spatiotemporal changes or topological relationship that holds between sensed phenomena with broad boundaries over time will provide decision-maker with adequate and non-ambiguous information. In this paper spatial changes and topological relationship about fuzzy-crisp object modelling the geometry of vague shape phenomena are conceptualized using an Extended Fuzzy Spatiotemporal Change Pattern (FESTCP) and a 5x5 Intersection model (I5x5M) respectively. The rule-based reasoning engine proposed in this paper is based on this conceptualisation. To evaluate our method, a simulated case study of air pollution in Quebec City is carried out. The results reveal that the proposed method captures well the spatiotemporal evolution of a given air pollution episode that served for an on-the-fly decision-making process in real life situations.

4:00pm - 4:15pm

An INS-Centric Locator for Autonomous Vehicles Aided by GNSS, Monocular Visual-Inertial Odometry, and HD Vector Maps

Chin-Chia Hsu, Chi-Hsin Huang, Syun Tsai, Yi-Feng Chang, Kai-Wei Chiang, Meng-Lun Tsai

Dept. of Geomatics, National Cheng Kung University, Tainan, Taiwan

Reliable lane-level localization remains difficult for autonomous vehicles (AVs) when Global Navigation Satellite System (GNSS) observations are degraded by blockage, multipath, and non-line-of-sight reception in urban environments. This paper presents PointLoc, an Inertial Navigation System (INS)-centric locator aided by GNSS, monocular visual-inertial odometry (VIO), and High-Definition (HD) Vector Maps. The proposed method is formulated as an INS-centric error-state extended Kalman filter (EKF), in which the INS provides persistent state propagation, while GNSS, VIO, and map matching are incorporated as aiding updates according to their availability and reliability. This design preserves a unified position, velocity, and attitude solution and enables graceful degradation when some aiding sources become unavailable. The method is validated through real-vehicle experiments in Taichung Shuinan and Tainan Shalun under mixed GNSS conditions. The results show that PointLoc achieves the best overall full-route performance in Taichung Shuinan and remains broadly comparable to GNSS/INS/VIO, while still outperforming GNSS/INS, in Tainan Shalun. In the mapped GNSS-denied segment of Taichung Shuinan, PointLoc effectively suppresses vertical drift and substantially improves three-dimensional positioning. The mapped-road analysis further shows that the INS-centric design avoids the planar instability observed in a vision-centric benchmark and provides a more continuous localization solution.

4:15pm - 4:30pm

Motion Correction for Scanning of Moving Objects using LiDAR: Experimental Validation and Analysis

Salil Goel, Abhishek Yadav

Indian Institute of Technology Kanpur, India

Conventional laser scanning techniques (such as in a Terrestrial Laser Scanner or Mobile mapping), whether used in a static or mobile mode require the object of interest to remain stationary during the scanning stage. Any motion of the object during scanning results in the apparent distortions in the resulting point cloud. The authors in Goel and Lohani (2014b) proposed a motion correction technique to estimate the 3D geometry of a moving object, utilizing a fusion of inertial and GNSS (Global Navigation Satellite Systems) sensors and transformation of the resulting point cloud to an object body coordinate system (OBCS). This paper aims to carry out the experimental validation and performance analysis of the motion correction method. Field experiments are designed and conducted in three phases to verify the correctness of the method. Through this, the paper aims to uncover insights into the performance of the motion correction algorithm and provide the first experimental validation of the proposed technique.

4:30pm - 4:45pm

Multi-sensor Modelling for Temporal Gait Analysis: Evaluating IMU and UWB-Based Approaches

Aditi Sahu, Ashwani Kumar, Salil Goel

Indian Institute of Technology Kanpur, India

Wearable sensors are essential for gait analysis outside of traditional laboratory environments. However, selection of the right sensor technology involves several trade-offs. Inertial Measurement Units (IMUs) offer high temporal resolution which are ideal for detecting gait events but they suffer from drift. Ultra-Wideband (UWB) provides stable spatial data, but are less precise for detecting event timing. This paper presents a comparative study of three distinct foot-mounted sensor methodologies for heel strike detection and cadence estimation: (1) IMU-Only approach, (2) UWB-Only approach, and (3) a multi-sensor IMU+UWB fusion approach. Each method is evaluated against a camera-based ground truth system using data from four subjects. Results show the IMU-Only method is inconsistent, with moderate event precision (Avg. F1: 0.798), temporal accuracy (Avg. MAE: 47.99 ms), and subject-dependent cadence accuracy (Avg. Acc: 89.59%). The UWB-Only method provides robust event detection (Avg. F1: 0.811) with similar temporal error (Avg. MAE: 49.0 ms) but is exceptionally accurate for cadence estimation (Avg. Acc: 96.94%). The IMU+UWB fusion approach achieves the highest event precision (Avg. Precision: 0.835) and the best temporal accuracy (Avg. MAE: 46.51 ms), while also maintaining robust cadence accuracy (Avg. Acc: 95.62%). In conclusion, while the UWB-Only method is ideal for cadence-only applications, the IMU+UWB fusion approach provides the best overall balance of high event precision, superior temporal accuracy, and reliable cadence estimation.

4:45pm - 5:00pm

A Non-rigid Polygon Registration Framework and its Application to Enhancing Building Footprint Accuracy using Aerial LiDAR

Amine Boussik¹, Bruno Vallet¹, Liuyun Duan², Nicolas Girard²

¹Univ Gustave Eiffel, IGN - LASTIG lab, Géodata Paris, France; ²LuxCarta Technology, Mouans Sartoux, France

Accurately registering building footprints from heterogeneous datasets with LiDAR data remains a critical challenge in urban mapping and 3D reconstruction. The objective of this work is to register source data, defined as 2D cadastral vector footprints from structured, regularized, or manually-verified datasets to target building footprints derived from classified aerial LiDAR. LiDAR provides direct 3D information with precise footprint positioning and high spatial resolution, enabling a geometrically reliable representation of dense 3D structures. Conversely, source datasets are not always up-to-date, and may exhibit geometric distortions such as translational offsets, rotational deviations, or local deformations, yet they remain valuable due to their structured organization and metadata content. To enhance geometric fidelity while preserving semantic structure, we propose a practical framework for non-rigid polygon registration that adjusts the geometry of cadastral footprints toward LiDAR-derived targets. The framework consists of two core components: (1) establishing correspondences between source and target polygons, and (2) minimizing a robust distance function that governs the registration process. Three deformation models are introduced: a rigid model allowing translations only, a semi-rigid model allowing deformations while keeping the overall structure of source footprints, and a non-rigid model allowing rotations. We evaluate our method by aligning real cadastral datasets to aerial LiDAR data. The results confirm the effectiveness and robustness of the proposed framework in the context of 2D polygonal cadastral data. This work thus represents the first practical solution for non-rigid polygon registration in this domain.

5:00pm - 5:15pm

Multi-stage mask-aware Depth Enhancement for RGB–IR–stereo Fusion on historic Timber Surfaces

Junquan Pan¹, Maria Chizhova¹, Mona Hess¹, Thomas Luhmann², Ferdinand Maiwald³

¹Digital Technologies in Heritage Conservation, Centre for Heritage Conservation Studies and Technologies (KDWT), University of Bamberg, Bamberg, Germany; ²Institute for Applied Photogrammetry and Geoinformatics (IAPG), Jade University of Applied Sciences, Oldenburg, Germany; ³Chair of Optical 3D-Metrology, Dresden University of Technology, Dresden, Germany

This paper presents a mask-aware multi-stage depth enhancement framework for digital documentation of historical timber surfaces using RGB–Stereo-IR fusion. Accurate geometric recording of aged wood features such as wooden knots remains challenging due to uneven illumination and weak texture. The proposed pipeline, which aims to stabilise depth geometry under uneven illumination and low-texture surface conditions, integrates object detection, instance segmentation and confidence-guided depth refinement across three stages: (A) TV(total variation)-regularized mask-aware refinement, (B) confidence-weighted multi-view fusion, and (C) patch-based stereo reconstruction. Experiments on historical timber beams under varying illumination demonstrate improved depth completeness and geometric consistency, achieving a residual standard deviation below 0.6 mm in bright scenes and stable reconstruction in low-light conditions. The framework offers a practical solution for depth reconstruction of cultural heritage timber, supporting more reliable feature detection and analysis.

3:30pm - 5:15pm

WG II/3B: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

3:30pm - 3:45pm

3D gaussian splatting for large-scale 3D reconstruction: an evaluation and quality analysis

Jiangxue Yu¹, Yueling Liao¹, San Jiang^2,3, Xing Zhang^2,3, Zhijun Wang⁴, Qingquan Li^2,3

¹School of Computer Science, China University of Geosciences, Wuhan 430074, China; ²Guangdong Key Laboratory of Urban Informatics, Shenzhen University, Guangdong Shenzhen, 518060, China; ³MNR Key Laboratory for Geo-Environmental Monitoring of Great Bay Area, Shenzhen University, Guangdong Shenzhen, 518060, China; ⁴Guangdong Laboratory of Artificial Intelligence and Digital Economy (Shenzhen), Guangdong Shenzhen, 518060, China

Large-scale 3D reconstruction has emerged as a key research in the fields of photogrammetry and computer vision. 3D Gaussian Splatting (3DGS) has become a mainstream approach due to its efficient rendering, but it confronts critical challenges in large-scale scenarios: excessive memory overhead and inadequate geometric accuracy. Meanwhile, the traditional Structure from Motion and Multi-view Stereo (SfM-MVS) framework, despite its cumbersome process, continues to exhibit robust performance. Notably, a systematic evaluation comparing these two paradigms in large-scale scenes remains absent. To address this, we develop a unified verification framework to evaluate the texture rendering quality and geometric reconstruction precision of several recent methods using real-world datasets. The results indicate that SfM-MVS methods still maintain an advantage in the completeness and accuracy of geometric reconstruction. In contrast, 3DGS methods have achieved breakthroughs in local accuracy or rendering-geometry synergy, yet their global consistency requires further improvement.

3:45pm - 4:00pm

RobustGauss: Robust 3D gaussian splatting for distractor-free 3D scene reconstruction

Haibing Liu¹, Shihan Chen¹, Huchen Li¹, Wubiao Huang¹, Shuai Zhang¹, Fei Deng^1,2

¹School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China; ²Hubei Luojia Laboratory, Wuhan 430079, China

3DGS-based methods often render transient distractors in 3D scenes as significant floating artifacts. Existing works for removing transient distractors suffer from under-identification or over-identification, resulting in residual transient distractors affecting reconstruction quality or loss of scene information, preventing the reconstruction of fine details. To address these challenges, we propose RobustGauss. We first rely solely on the cosine similarity of DINOv2 features to robustly predict uncertainty masks and accurately identify the main regions of transient disturbances and their corresponding shadows. Due to the limited resolution of DINOv2 features, we use high-resolution image residuals to refine the edges of the initial uncertainty masks, thereby accurately identifying all transient distractors and minimizing their impact on 3D scene reconstruction. Experiments on two challenging datasets demonstrate that our method achieves state-of-the-art performance.

4:00pm - 4:15pm

BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model

Yuci Han¹, Charles Toth¹, John Anderson², William Shuart², Alper Yilmaz¹

¹the ohio state university, United States of America; ²USACE ERDC GRL

N/A

4:15pm - 4:30pm

EMVSNet: Evidential multi-view stereo reconstruction for sampling-free depth and uncertainty estimation

Christian Grannemann, Max Mehltretter

Leibniz University Hannover, Germany

We present EMVSNet, a sampling-free Multi-View Stereo (MVS) method that, to the best of our knowledge, is the first to integrate Evidential Deep Learning into MVS. Given a set of overlapping images, our method predicts a depth value together with its associated uncertainty per pixel of a reference image, incorporating uncertainty from aleatoric and epistemic sources. Specifically, we use an existing convolutional neural network architecture designed for MVS as backbone and extend it to regress evidential parameters per pixel, describing the probability distribution over the depth corresponding to this pixel. In contrast to existing MVS methods that often neglect epistemic uncertainty or obtain it via sampling at inference, our evidential formulation does not require sampling, but enables single-pass inference. We evaluate the uncertainty estimation capabilities of our method using two publicly available datasets and compare the depth predictions against a deterministic variant. The experimental results demonstrate that EMVSNet achieves competitive depth accuracy while, at the same time, providing uncertainty estimates that enable us to reliably rank depth estimates according to their risk of being incorrect and to automatically identify out of distribution data. Our model shows only slightly increased inference time compared to a deterministic baseline while giving comparable uncertainty estimates to an computationally expensive sampling based approach, marking a first step towards real-time capable uncertainty estimation for image-based 3D reconstruction.

4:30pm - 4:45pm

Adaptive Scaling with Geometric and Visual Continuity of completed 3D objects

Jelle Vermandere, Maarten Bassier, Maarten Vergauwen

KU Leuven, Belgium

Object completion networks typically produce static Signed Distance Fields (SDFs) that faithfully reconstruct geometry but cannot be rescaled or deformed without introducing structural distortions. This limitation restricts their use in applications requiring flexible object manipulation, such as indoor redesign, simulation, and digital content creation. We introduce a part-aware scaling framework that transforms these static completed SDFs into editable, structurally coherent objects. Starting from SDFs and Texture Fields generated by state-of-the-art completion models, our method performs automatic part segmentation, defines user-controlled scaling zones, and applies smooth interpolation of SDFs, color, and part indices to enable proportional and artifact-free deformation. We further incorporate a repetition-based strategy to handle large-scale deformations while preserving repeating geometric patterns. Experiments on Matterport3D and ShapeNet objects show that our method overcomes the inherent rigidity of completed SDFs and is visually more appealing than global and naive selective scaling, particularly for complex shapes and repetitive structures.

4:45pm - 5:00pm

MambaPanoptic: a Vision Mamba-based Structured State Space Framework for panoptic Segmentation

Qing Cheng^1,2, Damiano Bertolini^1,3, Wei Zhang⁴, Dong Wang⁵, Niclas Zeller⁶, Daniel Cremers^1,2

¹Technical University of Munich, Germany; ²Munich Center for Machine Learning; ³Polytechnic University of Milan; ⁴University of Stuttgart; ⁵Wuhan University; ⁶Karlsruhe University of Applied Sciences

Panoptic segmentation requires the simultaneous recognition of countable thing instances and amorphous stuff regions, placing joint demands on long-range context modelling, multi-scale feature representation, and efficient dense prediction. Existing convolutional and transformer-based methods struggle to satisfy all three requirements concurrently: convolutional architectures are limited in their capacity to model long-range dependencies, while transformer-based methods incur quadratic computational cost that is prohibitive at high resolutions. In this paper, we propose MambaPanoptic, a fully Mamba-based panoptic segmentation framework that addresses these limitations through two principal contributions. First, we introduce MambaFPN, a top-down feature pyramid that leverages Mamba blocks to generate globally coherent, multi-scale feature representations with linear computational complexity. Second, we adopt a PanopticFCN-style kernel generator that produces unified thing and stuff kernels for proposal-free panoptic prediction, enhanced by a QuadMamba-based feature refinement module applied at multiple network stages. Experiments on the Cityscapes and COCO panoptic segmentation benchmarks demonstrate that MambaPanoptic consistently outperforms PanopticDeepLab and PanopticFCN under comparable model sizes, and matches or surpasses Mask2Former on Cityscapes in PQ and AP while requiring fewer parameters.

5:00pm - 5:15pm

GeoPrior-Diff: Using Stable Diffusion as a geometric Prior for single-view 3D Point Cloud Reconstruction

Youssef Korny¹, Sunghwan Yoo¹, Mohammad Moein Sheikholeslami¹, Daniel Panangian², Ksenia Bittner², Andreas Wichmann³, Gunho Sohn¹

¹Dept. of Earth and Space Science and Engineering, York University, Canada; ²Remote Sensing Technology Institute, German Aerospace Center (DLR), Germany; ³Institute for Applied Photogrammetry and Geoinformatics (IAPG), Jade University of Applied Sciences, Germany

Single-view 3D reconstruction from monocular aerial imagery presents a fundamental challenge in remote sensing due to the inherent scale ambiguity and the complex geometry of urban environments. Traditional regression-based methods often struggle to recover high-frequency structural details, leading to over-smoothed or noisy outputs. To address this, we introduce GeoPrior-Diff, a novel two-stage framework that leverages the generative capabilities of Latent Diffusion Models to reconstruct high-fidelity 3D point clouds.

Unlike direct generation approaches, our method explicitly bridges the domain gap between 2D texture and 3D structure by utilizing an intermediate geometric prior. In the first stage, we predict an oblique normal map from the input satellite imagery, capturing essential surface orientation and structural boundaries. In the second stage, this normal map serves as a strong conditioning signal for a probabilistic diffusion model, guiding the denoising process to synthesize accurate 3D point clouds. Preliminary results demonstrate that decoupling geometric estimation from point generation significantly enhances structural consistency and reduces artifacts compared to baseline methods. This work highlights the potential of using generative priors for robust 3D urban modeling from limited data.

3:30pm - 5:15pm

IvS4: Operationalizing Earth Observation for Sustainable Resource Development
Location: 716A

3:30pm - 3:45pm

Supporting Canada’s Ring of Fire Regional Assessment Through Earth Observation

Darren Janzen, Wenjun Chen, Sylvia Thomas, Sylvain Leblanc

Natural Resources Canada, Canada

This presentation examines the integration of Earth Observation (EO) data into Canada’s impact assessment (IA) processes, highlighting the progress and applications of the Earth Observation for Cumulative Effects – Phase 2 (EO4CE-2) program. Despite rapid growth in EO data acquisition, analytics, and delivery systems, uptake by IA practitioners has been limited due to persistent barriers such as awareness of EO capabilities, data accessibility, and analytical capacity. EO4CE-2, led by Natural Resources Canada’s Canada Centre for Mapping and Earth Observation (CCMEO), aims to address these challenges by providing high-quality, standardized EO datasets and operational frameworks to support transparent, data-driven IA processes.

EO4CE-2 has produced a library of EO-derived products leveraging decadal satellite records and advanced machine learning, enabling comprehensive analysis of land use, water resources, vegetation, lake and river ice, and terrain stability. These datasets allow decision-makers to evaluate environmental status and trends.

A key application is the Regional Assessment of the Ring of Fire (ROF) area in northern Ontario, where eleven EO-based indicators—covering water systems, wildlife habitat, forest ecosystems, permafrost, and terrain deformation—support assessment priorities such as environmental health, social equity, and community well-being. Indigenous communities have played a central role in validating these indicators and contextualizing EO data. The results demonstrate that combining satellite observations with local knowledge enhances regional assessments, supports sustainable resource management, and informs evidence-based decision-making. This presentation highlights EO4CE-2’s achievements, challenges, and lessons learned in advancing the use of EO for cumulative effects assessment in Canada.

3:45pm - 4:00pm

Forest Biomass Estimation in Québec with Multi-Source Earth Observation and Machine Learning in Google Earth Engine

Maryam Rahimzad, Anne Ola, Saeid Homayouni

INRS, Canada

Forest biomass plays a central role in carbon accounting, climate modeling, and sustainable forest management. However, large-scale biomass estimation remains challenging due to the limited spatial coverage of field inventories and the inherent spectral saturation issues of optical remote sensing in dense forest canopies. This study presents an operational workflow for mapping above-ground biomass (AGB) across southern Québec using multi-source Earth observation data and machine learning implemented in Google Earth Engine. The approach integrates Sentinel-2 optical composites, Sentinel-1 dual-polarization SAR metrics, and a high-resolution 1-m canopy height model with detailed plot-level biomass derived from Québec’s Placettes-Échantillons Permanentes (PEP) network. A Gradient Tree Boosting model was trained on 4,083 quality-controlled field plots to capture species, structural, and spectral variability. Validation results show strong agreement between predicted and observed biomass (R² ≈ 0.76, RMSE ≈ 14.4 Mg ha⁻¹), demonstrating the value of fusing optical, radar, and structural predictors. The resulting biomass and carbon maps provide actionable information for forest monitoring, regional reporting, and environmental decision-making. This contribution highlights the effectiveness of cloud-based multi-sensor fusion for operational AGB estimation and offers a scalable methodology applicable to broader Canadian forest regions.

4:00pm - 4:15pm

The Terrestrial Snow Mass Mission (TSMM) Academic Consortium: Ku-Band SWE Retrieval Advances and Validation from Mountainous and Arctic Field Campaigns

Nicolas Marchand, Alexandre Langlois

Université de Sherbrooke / CARTEL, Canada

Seasonal snow remains a critical component of Canada’s water cycle, yet consistent, high-resolution monitoring of snow water equivalent (SWE) is still lacking at national and hemispheric scales. The Terrestrial Snow Mass Mission (TSMM) proposes a dedicated dual-frequency Ku-band radar satellite designed to deliver spatially continuous SWE estimates at 500 m resolution with a 5–7 day revisit rate. To prepare the scientific foundations of this mission, the TSMM Academic Consortium has expanded to 16 Canadian institutions and now integrates data from over 40 long-term snow research sites.

Between 2024 and 2026, the consortium conducted ten coordinated field campaigns across mountainous and Arctic environments, in collaboration with Environment and Climate Change Canada, the Canadian Space Agency, and the European Space Agency. These campaigns combined ground-based and airborne Ku-band radar, detailed snowpit measurements, microstructure characterization, UAV surveys, and GNSS mapping. Joint ESA–TSMM activities at Cambridge Bay further enhanced Ku-band validation in deep Arctic snow.

Recent advances include improved dual-frequency Ku-band inversion methods, refined radiative transfer models, enhanced wet/dry snow classification, and integration of radar-derived SWE into snow model simulations and CaLDAS assimilation frameworks. Together, these developments confirm TSMM’s feasibility and scientific readiness.

This contribution summarizes the consortium’s field results and retrieval advances, demonstrating the mission’s potential to provide operational SWE monitoring essential for hydrology, climate science, wildfire preparedness, and Arctic environmental security.

4:15pm - 4:30pm

Gaussian Process Regression-Based Geospatial Framework for Emergency Shelter Suitability Assessment

Sanmitha V S, Shanjanaa G, Mukilarasan V, Nijanthanathan M, Vani K

College of Engineering Guindy, India

The disaster resilience in urban environments remains a critical yet often underexplored component of sustainable development, particularly in densely populated regions where schools and community shelters serve as vital emergency infrastructure. Despite their importance, the systematic assessment of these shelters’ suitability is frequently overlooked, leading to disparities in safety, accessibility, and preparedness during crisis events. This research introduces a comprehensive, data-driven framework for evaluating the suitability of educational institutions and community shelters using Gaussian Process Regression (GPR). The proposed model integrates multiple geospatial and infrastructural parameters, including environmental risk exposure, proximity to fault lines and water bodies, structural integrity, road connectivity, and population density. By modeling the complex nonlinear relationships among these significant factors, the Gaussian Process Regression (GPR)-based approach predicts shelter safety scores that reflect the relative resilience and accessibility of each location. The predicted scores are spatially visualized using interactive geospatial mapping tools, allowing decision makers to easily identify safer zones or shelters and high-risk clusters across Delhi. The areas with higher scores correspond to shelters with strong infrastructure and better access to emergency resources and open spaces, whereas lower-scoring regions indicate vulnerable areas in need of immediate policy attention and structural reinforcement. The outlier detection techniques further enhance the interpretability of results by identifying anomalous schools with unusually high or low suitability for deeper investigation. The model’s performance, evaluated through five-fold cross-validation, reveals variability in Mean Squared Error (MSE) across folds, indicating sensitivity to spatial heterogeneity and highlighting potential improvements through hyperparameter tuning and ensemble learning strategies.

4:30pm - 4:45pm

Earth Observation–Based Geospatial Analysis of Population–Air Quality Interaction

Xuejia Wei¹, Zechang Li², Liang Huo¹, Tao Shen¹, Jiahui Wang¹, Jinlong Wang¹

¹Beijing University of Civil Engineering and Architecture, China, People's Republic of; ²Hebei Provincial Coalfield Geological Bureau New Energy Geological Team

Rapid urbanization has profoundly reshaped the spatial dynamics of population distribution, environmental quality, and resource utilization, particularly in megacities such as Beijing. As population density, industrial activity, and transportation intensity continue to rise, air pollution has become a key constraint on sustainable resource development and urban livability. In recent years, the integration of Earth Observation (EO) with geospatial analytics has provided new opportunities for monitoring, modeling, and managing urban environmental systems.

For instance, Liu et al. employed complex network theory to analyze regional air quality variations in the Yangtze River Delta[1], while Rabie et al. developed a CNN–Bi-LSTM hybrid framework to predict spatially resolved AQI in megacities[2]. Similarly, Similarly, Ma et al. used a temporal-encoded Informer model to forecast AQI in northern China[3],and Ahmed et al. demonstrated that EO-derived hydro-climatological variables can substantially enhance AQI prediction accuracy when combined with deep learning models[4].Moreover, Sarkar et al. proposed an effective hybrid deep learning model for AQI prediction, which further validates the potential of hybrid approaches in capturing complex urban air pollution patterns[5].However, most existing studies emphasize temporal forecasting or algorithmic improvement, while the spatial interaction between population distribution and air quality remains insufficiently explored. To bridge this gap, this study develops an EO-supported geospatial framework that integrates demographic and environmental data to analyze spatial heterogeneity and exposure inequality in Beijing, providing data-driven insights for sustainable resource and environmental governance.

4:45pm - 5:00pm

Comparing PlanetScope and Sentinel 2 for mapping water quality using machine learning models in Fanshawe Lake, Ontario, Canada

Kathy Tang, Jinfei Wang

Western University, Ontario, Canada

In this study, we compared the performance of four machine learning (ML) models, including Multiple Linear Regression (MLR), Random Forest (RF), XGBoost, and Support Vector Regression (SVR), for predicting and mapping key water quality parameters, namely dissolved oxygen (DO), total phosphorus (TP), total nitrogen (TN), and turbidity in Fanshawe Lake using two distinct satellite datasets: the high-resolution PlanetScope and the Sentinel-2 imagery.

Eleven commonly used spectral indices sensitive to suspended particles and algae were derived from PS and Sentinel-2 imagery, combined with in situ measurements collected from 2018 and 2024 to train and validate the models. We evaluated the ML models using R², mean absolute error (MAE), and root mean squared error (RMSE). Our study shows that using machine learning with satellite imagery can provide encouraging predictions of key water quality indicators in Fanshawe Lake. There are certain benefits of using high spatial and temporal resolution PS satellite imagery instead of Sentinel-2 datasets to capture localized changes in water quality parameters. The Upper Thames River Conservation Authority can use these results to predict when algal blooms might occur in Fanshawe Lake. Future research may investigate the capture of seasonal trends through the integration of additional field and satellite datasets with time-series models, such as Long Short-Term Memory.

5:00pm - 5:15pm

Climate-Induced Changes in Glacier and Snow Dynamics Using Integrated Remote Sensing for Water Resource and Ecosystem Resilience

Dr. Hania Arif

LCWU, Pakistan

Climate change is rapidly affecting glaciers and seasonal snow in high-altitude regions, which in turn threatens water resources and mountain ecosystems. In this study, I aim to understand and quantify these climate-driven changes by combining data from Sentinel-1 radar and Sentinel-2 optical satellite imagery. By analyzing datasets collected over multiple years, I can observe how glaciers are retreating, snow cover is changing, snow grain size is evolving, and seasonal melt patterns are shifting. To achieve this, I use a combination of advanced spectral and radar indices along with machine learning techniques to extract detailed information about snow and glacier characteristics and track their changes over time. These results allow me to evaluate when snow melts and how it may affect downstream water flow, which is essential for sustainable water management and maintaining ecosystem health. I also make use of cloud-based platforms like Google Earth Engine to efficiently process large volumes of satellite data. By integrating AI-driven analysis with remote sensing, I can produce accurate, large-scale maps and insights that help predict future trends. The outcomes of my study are not only important for understanding how climate change is impacting glaciers and snow in my study area but also provide a framework that can be applied to other mountain regions around the world. Ultimately, my research offers valuable information for planning climate adaptation strategies and ensuring the resilience of both water resources and mountain ecosystems.

3:30pm - 5:15pm

Forum1B: Observing the Earth as One: Making space for everyone in Remote Sensing, Photogrammetry, and Spatial Information Science
Location: 716B

3:30pm - 5:15pm

Forum6: UN-IGIF: Capacity Building and Education Opportunities
Location: 717A

3:30pm - 5:30pm

InS2: Industry Tech Session
Location: 717B

3:30pm - 5:30pm

P1: Poster Session 1
Location: Exhibition Hall "E"

Denoising microwave interferometry data for high-Rise buildings with CEEMDAN energy-Correlation dual Criteria

Haiqian Wu¹, Runjie Wang¹, Xianglei Liu¹, Zhao Lu¹, Yuqi Zhang¹, Songxue Zhao²

¹Beijing University Of Civil Engineering And Architecture, China, People's Republic of; ²School of Land Science and Technology, China University of Geosciences, Beijing 100083, China

High-rise buildings are fundamental components of modern urban infrastructure, and their structural safety under dynamic loads such as wind and earthquakes is critically important In recent years, ground-based radar interferometry has been widely employed for monitoring vibrations and deformations in tall structures, owing to its high precision, non-contact operation, and full-field measurement capability. In practical monitoring, displacement signals are affected by various types of noise, leading to unstable and nonlinear variations in the signal. This makes it difficult to accurately obtain structural vibration characteristics (such as frequency and damping ratio) and micro-deformation data with precision. Conventional denoising techniques are often applied for noise reduction. Nevertheless, these methods exhibit notable limitations. Bandpass filtering requires a predefined frequency passband, becoming ineffective in cases of spectral overlap between signal and noise. Wavelet denoising lacks adaptability due to its strong dependence on the selected wavelet basis and decomposition level, often introducing signal distortion. Kalman filtering, meanwhile, relies on an accurate state-space model, the construction of which is challenging for complex high-rise structures, thereby limiting its practical utility. In response to these challenges, this paper proposes a fully adaptive denoising method based on CEEMDAN, incorporating dual criteria of energy distribution and correlation. The proposed approach effectively processes non-stationary and nonlinear signals while avoiding the subjectivity associated with basis selection in traditional methods. It significantly improves both mode separation accuracy and denoising reliability, establishing a robust foundation for structural state assessment and safety early warning based on radar monitoring data.

Precision Increase for LiDAR-based Localisation using a predefined global Map

Martin Hesse, Andreas Nuechter

Julius-Maximilians-Universität Würzburg, Germany

Localisation remains a crucial aspect of robotic design. It forms the basis of any kind of autonomous navigation for drones, cars and other specialized robots. This is usually achieved using a Simultaneous Localisation and Mapping (SLAM) algorithm, which uses an input sensor to localise the robot within a map that is created simultaneously. The input sensors are either cameras, which provide visual data, or Light Detection And Ranging (LiDAR) sensors, which automatically deliver a point cloud up to surveying quality. In recent years, LiDAR inertial odometry (LIO) algorithms, which combine measurements from a LiDAR sensor and inertial measurements from an IMU, have become more popular. These algorithms do not use a previously recorded map, but rather create their own map during runtime. This paper contributes an improvement to the precision by integrating a predefined 3D global point cloud map into the localisation algorithm. Over the course of multiple experiments in different testing scenarios, we have achieved a 71% reduction of the distance error for localisation, while there was no significant change regarding the orientation error. This makes the presented system a suitable localisation option for real-world robotic operations at construction sites.

Monocular ORB-SLAM3 Evaluation for Multi-Altitude VTOL UAV Mapping

Ming-Jyun Yang¹, Jyun-Ping Jhan², Ru-Shan Tang³

¹Graduate Institute of Artificial Intelligence Cross-disciplinary Technology, National Taiwan University of Science and Technology, Taipei, Taiwan; ²Graduate Institute of Artificial Intelligence Cross-disciplinary Technology, National Taiwan University of Science and Technology, Taipei, Taiwan; ³Systems Development Center, National Chung-Shan Institute of Science and Technology, Taiwan

Reliable visual localization is essential for long-range VTOL UAV mapping in GNSS-degraded environments. This paper presents a quantitative evaluation framework for monocular ORB-SLAM3 using a 66.48 km multi-altitude UAV mission and aerial-triangulation-derived camera poses as reference data. The workflow associates SLAM and reference trajectories by image key, applies Sim(3)-based metric alignment, corrects coordinate-axis inconsistency, and refines attitude by a global rotation offset, enabling full-mission and segment-level comparison in a common metric frame. The evaluation covers four altitude segments, namely 100, 150, 200, and 250 m AGL, under three protocols: No-Loop (NL), With-Loop Global Slice (GS), and With-Loop Local Re-Sim(3) (LR). For the full mission, the proposed alignment achieves a 3D position RMSE of 7.41 m over 5330 matched frames and substantially reduces the geometric deformation observed in the S+T baseline. Segment-level results show a strong altitude dependency in the isolated NL runs, with 3D RMSE decreasing from 22.95 m at 100 m to 5.49 m at 250 m. Among the three protocols, LR consistently yields the best segment-level position accuracy, reaching 4.00, 8.26, 3.94, and 3.92 m at 100, 150, 200, and 250 m, respectively. Long-range analysis further shows that the trajectory remains globally bounded, while cumulative 3D endpoint drift increases from 0.35 m at 50 m to 10.66 m at 25.6 km. These results indicate that ORB-SLAM3 can support large-scale trajectory estimation for UAV mapping, but its evaluated quality depends strongly on alignment, segmentation, and evaluation strategy.

Mitigating InSAR Tropospheric Delays via Least Squares Collocation: GNSS-Based Correction and Data-Driven Filtering

Hailu Chen, Yunzhong Shen, Lei Zhang

Tongji university, China, People's Republic of

Tropospheric delays significantly hinder accurate InSAR deformation mapping, and their complex spatiotemporal vari-ability makes effective mitigation challenging. When GNSS are available, conventional functional models interpolate GNSS-derived delays to unobserved locations, but their low-order form mitigates only long-wavelength errors and neglects the stratified component. In phase-based correction, temporal low-pass filters such as the Gaussian filter suppress high-frequency turbulence but ignore the strong distance/elevation-dependence of tropospheric delays, making the results highly sensitive to the chosen time window .

In response, we adopt a Least Squares Collocation (LSC) scheme, an effective approach that treats spatially correlated turbulence as a stochastic variable, characterizes it through a variance–covariance model , and estimates it jointly with other deterministic parameters. With external GNSS data, a joint correction that accounts for both the strati-fied and turbulent components is constructed, and are simul-taneously estimated using LSC. For the data-driven case, the deformation phases are parameter-ized by a time-domain polynomial, while the turbulence are treated as spatially correlated stochastic variables defined by spatial variance-covariance functions. LSC is employed to estimate the deformation model parameters through sliding time-window filtering process .

Multi-sensor fusion

Wang Yankun¹, Fan yong², Wang Weixi³, Li Xiaoming⁴, Guo Renzhong⁵

¹Shenzhen Polytechnic University, China, People's Republic of; ²School of Artifcial Intelligence, Shenzhen Polytechnic University, Shenzhen; ³School of Artifcial Intelligence, Shenzhen Polytechnic University, Shenzhen; ⁴School of Artifcial Intelligence, Shenzhen Polytechnic University, Shenzhen; ⁵School of Artifcial Intelligence, Shenzhen Polytechnic University, Shenzhen

We design a voice-interactive indoor positioning method that jointly utilizes spatial “near” relationships extracted from verbal descriptions and multiple sensor sources

Hybrid Explicit–Implicit Dense Mapping with Quality-Guided Refinement and Residual Feedback

Xing Lin, Leyang Zhao, Weixi Wang, Yizhen Yan, Rui Yuan, Zirui Liu, Junjie Huang, Hao Zhang

Research Institute for Smart Cities, School of Architecture and Urban Planning, Shenzhen University, Shenzhen, 518060, PR China

Real-time dense SLAM must balance geometric fidelity and computational efficiency for autonomous navigation. Explicit mapping methods provide stable global structure and fast updates, but suffer from discretization artifacts and memory overhead. Implicit neural representations capture continuous surfaces and fine details, yet require expensive optimization and are sensitive to initialization. Existing hybrid approaches combine both paradigms, but often allocate neural refinement inefficiently and remain vulnerable to pose errors. To address these limitations, we propose a selective hybrid dense mapping framework that couples a scene-wide TSDF backbone with quality-guided implicit local refinement and residual-guided sliding-window pose feedback. Neural refinement is activated only in low-quality regions identified by multi-indicator assessment, while keyframe poses are re-optimized using residuals from explicit raycasting and implicit rendering. Experiments on TUM RGB-D and Replica demonstrate improved mapping accuracy, localization robustness, and real-time efficiency.

Simultaneous Calibration of Boresight and Lever Arm for mobile LiDAR Systems on hydrographic Platforms using synthetic and real Data

Jordan McManus¹, Christian Larouche¹, Mohsen Hassanzadeh Shahraji²

¹Laval University, Canada; ²Quebec Geomatics Center

We present a simultaneous calibration of the 6 installation parameters (3 boresight angles and 3 lever arm offsets) for a mobile LiDAR system on a hydrographic platform using spherical targets. This algorithm finds the boresight angles and lever arm offsets that minimize the sum of positive distances from points to their corresponding sphere surfaces. The calibration method is first developed and tested using synthetic data generated by a ray-tracing algorithm using a line-sphere intersection model and is subsequently validated using real scan data. The spherical targets are installed on tripods at the calibration site, where their centers are surveyed using postprocessed GNSS observations. The RMS error for the distance between the surveyed sphere centers and the fitted sphere centers is 3.6 cm, which we attribute to the propagation of GNSS and LiDAR scanner errors. The MATLAB code developed for the simultaneous estimation of a complete set of 6 LiDAR installation parameters using spherical targets is available as open-source software on GitHub.

Road Surface Condition Evaluation Using Multi-Grade Accelerometers

Aser Eissa, Mona Hodaei, Raja Manish, Ayman Habib

Lyles School of Civil and Construction Engineering, Purdue University, West Lafayette, USA -

Efficient road surface condition evaluation is critical for ensuring transportation safety, maintaining ride quality, and supporting informed pavement management decisions. Conventional methods such as laser profilers and specialized inertial systems provide highly accurate measurements but are expensive, limited to specialized vehicles, and difficult to scale for network-level monitoring. To address these limitations, this study presents an accelerometer-based framework that leverages survey-, mapping-, and consumer-grade GNSS/INS units to detect pavement surface anomalies in a cost-effective and scalable manner.

Vertical acceleration data were collected using three inertial systems mounted on the Purdue Wheel-based Mobile Mapping Systems: the high-accuracy PWMMS-HA, the ultra-high-accuracy PWMMS-UHA, and a compact low-cost OpenIMU paired with a SparkFun GPS-RTK2 unit. All systems were driven along a 59 km closed-loop roadway network in West Lafayette, Indiana, capturing diverse pavement conditions under identical driving trajectories. The proposed pipeline includes two complementary anomaly detection approaches. The first applies an Isolation Forest model, an unsupervised machine learning technique that identifies abnormal vibration patterns using statistical window-based features. The second employs an Adaptive Threshold method that flags acceleration windows exceeding a dynamic statistical threshold. Both methods categorize detected anomalies into mild, moderate, and severe levels.

Across all IMU grades, the Isolation Forest detected 962–965 anomalies, while the Adaptive Threshold identified 989–996 anomalies, with more than 91% spatial agreement between sensors and over 96% consistency between detection approaches. Results demonstrate that even low-cost inertial sensors reliably capture pavement disturbances.

Design and Development of a Livox-Based Indoor Surveying System for Floor Mapping Applications

Walid Darwish^1,2, Mostafa G. Abdelmageed³, Moustafa Baraka⁴, Adel Elshazly¹, Ahmad M. Senousi^1,2, Wael Ahmed^1,2

¹Geomatics Engineering Lab, Public Works Department, Cairo University, Giza 12613, Egypt;; ²NAMAA for Engineering Consultations, Dokki , Giza 12612, Egypt; ³Mechanical Design & Production Department, Faculty of Engineering, Cairo University, Giza 12613, Egypt.; ⁴Civil Engineering Program, German University in Cairo 11835, Egypt

Accurate three-dimensional (3D) data acquisition of indoor environments remains a challenging and resource-intensive task, particularly for fully furnished spaces. This study presents the development and implementation of a low-cost wearable surveying system for efficient 3D indoor data acquisition. The proposed system integrates a Livox Mid-360 LiDAR sensor and an RGB camera mounted on a helmet, both controlled via a min-PC unit for synchronized data collection. The captured LiDAR frames and inertial measurement unit (IMU) data are fused with RGB imagery using a Simultaneous Localization and Mapping (SLAM) framework to generate 3D reconstruction of interior structures. The resulting point clouds and wall models are evaluated based on RANSAC line fitting method to assess their geometric accuracy and structural consistency. Moreover, a ground truth measurements were collected to verify the absolute accuracy of the resulting point clouds. The proposed approach demonstrates the potential of cost-effective, portable solutions for indoor 3D mapping and documentation with a cm-level of accuracy.

The Potential of HT-1 Spaceborne InSAR for Forest Vertical Structure Inversion

Jiawei Ye¹, Lijun Lu¹, Xu Zhang², Shibo Guo³, Ruiyun Yang³, Hanchao Zhang¹

¹State Key Laboratory of Spatial Datum, Chinese Academy of Surveying and Mapping; ²State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University; ³Zhuzhou Space Interstellar Satellite Technology Co., Ltd.

Hongtu-1 (HT-1) InSAR satellite, as an innovative multi-baseline X-band InSAR system, employs a four-satellite cartwheel formation to achieve single-pass interferometry. HT-1 has been extensively used for topographic mapping, disaster monitoring, and assessment. This research focuses on assessment of HT-1's capability for high-precision forest vertical structure inversion, especially for forest height estimation (X-band SAR is sensitive to canopy of forest due to the higher frequency). This paper develops a complete interferometric processing for HT-1 multi-baseline data. By applying two representative InSAR techniques to HT-1 multi-baseline InSAR data, this results demonstrate the forest vertical structure profile and canopy height map derived over the test site, which shows good agreement with the LiDAR data. The results confirm HT-1's feasibility for tomography and demonstrate the potential of multi-baseline satellites for future missions.

An integrated HSI Reconstruction Model combining supervised and unsupervised Learning

Dongyi Li, Xiaobin Guan, Zhonghang Qiu, Huanfeng Shen

wuhan university, China, People's Republic of

Hyperspectral images (HSIs) provide rich spatial–spectral information for applications such as environmental monitoring, land cover mapping, and mineral exploration. However, their practical utility is often severely degraded by mixed noise (Gaussian, impulse, and other unstructured components), striping artifacts, and partially missing data, especially in bands affected by strong water vapor absorption. Existing methods typically treat these degradations separately and struggle to jointly correct them within a unified framework.

This work presents an integrated HSI reconstruction method that couples a low-rank decomposition model with a hybrid supervised–unsupervised deep architecture. The HSI is factorized into spatial abundance maps and spectral endmember signatures, which are respectively modeled by a Transformer-based abundance reconstruction network and a 1D convolutional endmember smoothing network. The abundance network is first supervisedly pre-trained on large-scale natural image datasets and then fine-tuned, together with the spectral network, using unsupervised loss terms tailored to spatial and spectral fidelity. A weighted group-sparse regularization is further introduced to explicitly capture striping noise and constrain the learned subspaces.

Extensive experiments on simulated Washington DC Mall data and real Gaofen-5 (GF-5) satellite imagery demonstrate that the proposed method effectively suppresses unstructured noise, removes striping artifacts, and recovers missing information, achieving superior visual quality, higher spectral fidelity, and fewer artifacts compared with state-of-the-art baselines.

Evaluation of TLS and PMLS sensors for cultural heritage documentation and HBIM modelling: the case study of San Giacomo Church in Como, Italy

Manuel Garramone, Luigi Barazzetti, Marco Scaioni

Dept. of Architecture, Built Environment and Construction Engineering, Politecnico di Milano, via Ponzio 31, 20133 Milano, Italy

Static Terrestrial Laser Scanning (TLS) and SLAM-based Portable Mobile Laser Scanning (PMLS) are increasingly adopted in Cultural Heritage (CH) documentation, but their suitability for Historic/Heritage Building Information Modelling (HBIM) depends on both data quality and acquisition conditions. This paper compares a 2022 TLS survey and a 2025 handheld PMLS survey of San Giacomo Church (Como, Italy) to assess whether the latter can reliably support HBIM-oriented documentation. The methodology combines dataset-level comparison and ROI-based analysis on four stable architectural elements: apsis, pillar, timber roof truss, and central dome. Three complementary metrics were used: a local density proxy, a scale-dependent coverage ratio, and M3C2 distance statistics for geometric agreement. Results show that PMLS is consistently less dense than TLS but remains effective for 1:100 scale documentation and, in several cases, also for 1:50. Statistics on M3C2 distance remain generally within centimetric ranges, indicating good local agreement where surfaces are effectively observed. The study demonstrates that sensor suitability not only depends on the geometric complexity but also on sensor-to-surface distance, visibility, and acquisition geometry, supporting hybrid TLS–PMLS workflows for CH HBIM.

Evaluating RTK GNSS-Assisted Close-Range Photogrammetry for Cultural Heritage Applications without GCPs

Wojciech Ostrowski^1,2, Kinga Węzka¹, Aleksandra Brzozowska-Jawornicka³

¹Warsaw University of Technology, Poland; ²Jagiellonian University, Poland; ³Wrocław University of Science and Technology, Poland

This study examines the potential of RTK GNSS-integrated close-range photogrammetry for documenting cultural heritage without the need for ground control points (GCPs). The research focuses on evaluating the GEOSTIX-X5 GNSS receiver, which enables direct synchronisation with a camera via the flash hot shoe, providing precise time-stamping of image capture events. The case study was conducted at Tomb 8 of the Tombs of the Kings in Paphos, a UNESCO World Heritage Site, and compares two datasets: a conventional photogrammetric survey from 2022 using GCPs and a 2025 survey employing GNSS-assisted photogrammetry. Both terrestrial and UAV imagery were acquired and processed in Agisoft Metashape, with accuracy assessment performed through cloud-to-cloud comparison in CloudCompare. Results indicate that the GNSS-integrated approach achieved single centimetre-level accuracy and no systematic scale errors. The findings demonstrate that RTK GNSS-assisted photogrammetry can significantly reduce fieldwork complexity while maintaining high accuracy, offering a promising alternative for heritage documentation where GCP placement is impractical or undesirable.

Comparative Analysis of UAS Photogrammetric Accuracy: Influence of Flight Altitude on Accuracy and Operational Efficiency in Urban Mapping

Emanuel Amorim, Diogo Inojosa, Jailson Rodrigues Júnior, Eveliny Matias, Leandro França, Simone Sato

Universidade Federal de Pernambuco, Brazil

UAS photogrammetry has become an efficient solution for acquiring high-resolution geospatial data for urban mapping, environmental monitoring, and 3D modelling. However, mission planning still involves a trade-off between data quality and operational efficiency, particularly regarding flight altitude, which directly affects ground sample distance (GSD), point cloud density, and positional accuracy. This study evaluates the influence of flight altitude through a controlled comparison of two urban photogrammetric surveys: a low-altitude flight at 61.2 m (GSD = 1.56 cm/pix, 420 images) and a higher-altitude flight at 121 m (GSD = 3.11 cm/pix, 116 images). Both surveys used RGB cameras with equivalent image resolution mounted on different platforms, which constitutes an experimental limitation, while overlap and processing parameters were kept constant. The results show that the lower-altitude flight produced denser data and better geometric performance, with lower reprojection error and lower check point RMSE. In contrast, the higher-altitude flight provided greater operational efficiency, covering a larger area with fewer images and lower computational demand. These findings indicate that both strategies are technically viable but suited to different objectives: lower altitudes favour geometric detail and positional accuracy, whereas higher altitudes improve productivity and area coverage. Therefore, flight altitude should be selected according to project requirements, balancing geometric quality and operational efficiency.

The concept of metrological validation of active measurement sensors - CENAGIS-MET

Jakub Markiewicz¹, Sławomir Łapiński¹, Adrian Macek¹, Mariusz Wiśniewski²

¹Warsaw University of Technology, Faculty of Geodesy and Cartography, Plac Politechniki 1, 00-661 Warsaw, Poland; ²Central Office of Measures, Ul. Elektoralna 2, 00-139 Warsaw, Poland

Advances in optical measurement technologies have increased demands for accuracy, speed, and automation in coordinate metrology. This contribution introduces CENAGIS-MET, a metrological verification standard developed at the Warsaw University of Technology for assessing active range-based systems such as terrestrial laser scanners (TLS). Unlike traditional calibration fields designed for small ranges, CENAGIS-MET enables evaluation over large measurement areas using modified VDI/VDE guidelines.

The methodology incorporates probing error, sphere-spacing error, and flatness assessment using high-precision ceramic artefacts. Tests were conducted on Leica RTC360, Leica Nova MS60, Z+F 5006h, and a handheld Livox-based scanner (Mendeye). Results show RTC360 and MS60 fully meet the 1/5 relative error criterion, confirming suitability for engineering-grade applications. Z+F 5006h achieves partial compliance, requiring careful configuration, while Mendeye exceeds permissible thresholds, limiting its use to qualitative documentation.

In the full version of the article, broader analyses will be provided regarding the accuracy of 3D shape reconstruction used for the probing error, as well as roughness and planarity assessment for evaluating the overall distribution of the reference plane

Fathom Topo-bathymetric Airborne System for Shoreline Mapping: Preliminary Results

Hongxin Yang¹, Ziwen Kang¹, Wenhan Mi¹, Bingnan Wei¹, Mingming Guo², Dedong Zhang², Jonathan Li²

¹Hinton STAI Institute and Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University, Minhang, Shanghai 200241, China; ²Department of Geography and Environmental Management, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada

Accurate topo-bathymetric shoreline mapping and semantic segmentation of remote sensing imagery are fundamental to monitoring dynamic coastal systems, with significant implications for sustainable management, ecological preservation, and climate resilience planning. In vulnerable regions such as Lake Huron, Ontario — where sensitive ecosystems face growing anthropogenic pressures—precise delineation of land-water interfaces enables critical applications including coastal habitat mapping, sediment flux quantification, and erosion vulnerability assessment. This study presents a training-free, open-vocabulary segmentation framework that adapts frozen vision-language models (VLMs) to automatically extract shoreline features from near-infrared (NIR) imagery. By harnessing the inherent semantic reasoning abilities of VLMs, the method achieves accurate segmentation without relying on large, annotated datasets. Extensive evaluation on the Fathom Topo Bathymetric Dataset demonstrates the model's robustness across diverse nearshore environments, highlighting its applicability as a scalable solution for coastal mapping. This research underscores the potential of integrating foundational vision language models into geospatial workflows to enable automated, high-resolution environmental monitoring in data-limited settings.

Enhancing hyperspectral VNIR spatial resolution on the coastal landscape: getting 63 bands at 3 m through the PRISMA VNIR and PlanetScope Dove-R fusion

Antoine Collin^1,2, Laurine Nogue^1,2, Manuel Salgueiro Simon³, Gwenal Hervouët⁴, Tony Durozier⁵

¹Coastal GeoEcology Lab, EPHE-PSL University, France; ²Laboratory of Biology of Aquatic Organisms and Ecosystems, France; ³Laboratory of Geoarchitecture – University of Western Brittany, France; ⁴Délégation Bretagne, Conservatoire du Littoral, France; ⁵Délégation Normandie, Conservatoire du Littoral, France

The coastal zones consist of the interfaces between land and sea, undergoing the mobility of the shoreline at an unprecedented pace over the last centuries. Such a trajectory, at the global scale, exacerbates the coastal risks (intersecting hazards, exposure and vulnerability), calling upon a scalable methodology to ensure the precise and accurate monitoring. One of the observation solutions resides in the satellite platform provided with the finest spatial and spectral resolutions. Because remote sensing is a science of trade-offs, no sensors can be both excellent in spatial and spectral specificities.

We propose an original research study to create an imagery endowed with both high spatial and spectral characteristics, purposed to classify a representative coastal zone (12 habitat classes) in a temperate area in Brittany, France. The methodology highlights a transferable fusion procedure based on the simultaneous acquisition (10-min difference) of the 30-m hyperspectral PRISMA satellite imagery and the 3-m PlanetScope (Dove-R) imagery, made possible given the very high temporal resolution of the PlanetScope constellation. The spatial resolution of the hyperspectral PRISMA imagery, in the visible and near-infrared spectrum (63 bands), was successfully upscaled at 3 m, using a bandwise linear prediction from the 4 PlanetScope Dove-R bands (collected at 3 m). The model residuals showed that the pansharpened PRISMA imagery (5 m) was better enhanced (absolute deviation of 0,011) than the original PRISMA imagery (30 m, absolute deviation of 0,015). Seawater and mudflat were the best habitats upscaled, whereas the road and the roof were the worst classes predicted.

Calibration and Georeferencing for Consumer - Tesla Model Y (HW4) Video Mapping

Rami Tamimi, Charles Toth

The Ohio State University, United States of America

The evolution of mapping platforms has followed a consistent pattern: professional instruments are complemented by consumer devices that trade precision for scalability. Unmanned aerial systems transformed aerial photogrammetry by making it accessible beyond traditional aircraft, and smartphones equipped with RTK have demonstrated viable terrestrial mapping. This paper extends that progression to vehicle-based mapping by presenting SurveyXR, a web-based calibration and georeferencing framework that converts consumer vehicle dashcam video into photogrammetry-ready georeferenced imagery. The system addresses two technical problems: determining the geometric relationship between uncalibrated consumer cameras and a known navigation trajectory and producing per-frame exterior orientation parameters suitable for Structure-from-Motion processing. This pipeline implements checkerboard-based intrinsic calibration with automated quality diagnostics, Perspective-n-Point exterior orientation solving, and GNSS-synchronized frame extraction with lever arm correction. All computation runs in a browser or lightweight cloud backend, requiring no local software installation. The framework was tested on a 2026 Tesla Model Y equipped with PPK GNSS on the Ohio State University campus. Georeferenced frames were verified against the GNSS trajectory, confirming correct spatial positioning. The paper documents the calibration methodology, time synchronization model, and coordinate geometry, and discusses error sources and the path toward quantitative accuracy validation.

Insights into the PAS Pana Stitching Algorithm

Richard Ladstaedter, Karlheinz Gutjahr, Erich Riegler, Roland Perko

Joanneum Research Forschungsgesellschaft mbH, Austria

In this paper, we describe a modern, efficient, accurate and reliable stitching algorithm that JOANNEUM RESEARCH (JR) has developed for the PhaseOne PAS Pana multi-camera system. We present a new "constraint" projective transformation (CPT) approach, reducing the eight parameters of a standard projective transformation to only six, physically meanigfull parameters: Correction scale, parallax in x- and y-direction and three relative orientation angles. Based on the CPT, tie point measurements of all available image overlaps (NIR/NIR, RGB/NIR and RGB/RGB) are adjusted simultaneously within a common virtual image plane. As the CPT contains no over-parametrization any more for modelling the relative orientation of the (calibrated) camera modules we expect a more accurate and stable stitching result which will be evaluated by analysing the stitching parameters of consecutive PAS Pana shots of a flight line.

Lidar-Camera Integration for High Precision Airborne Mapping

Bernhard Schachinger¹, Mohamed Mostafa², Nico Jager², Armin Borovcnik¹, Jonathan Hall Riaza¹, Marc Muick¹

¹Vexcel Imaging GmbH, Austria; ²Trimble Applanix

This paper presents tests of a new fully integrated multi-sensor airborne system that comprises LiDAR, multiple cameras, inertial measuring unit (IMU), GNSS, and their associated software for data acquisition, processing, integration, calibration, and map production. The technical analysis presented in this paper focuses on multi-sensor system integration that statistically addresses a multi-stream of LiDAR ranges, pixels from multiple cameras, position and orientation of each LiDAR range and each photo center derived from the GNSS/IMU trajectory. The impact of processing the trajectory in two different ways, namely: Post-Processed Kinematic (PPK using Single Base Station) and Trimble Post-Processed RTX (PP-RTX) is evaluated. Real-world data sets acquired with the Vexcel UltraCam Dragon in Austria and USA are used in this paper to address system performance in a real-world environment. Test results confirm the suitability of both approaches for trajectory processing, Single Base and PP-RTX, as well as the consistent positional accuracy of georeferencing solutions for imagery and lidar.

Radiometric features and ground processing for high-resolution Earth observation satellites Bayer matrix-based images like CO3D

Sylvain Lucas¹, Olivier Amram², Christophe Latry¹, Colin Thomas³, Marc Schillinger³

¹Centre National d'Etudes Spatiales (CNES), France; ²Magellium Artal Group, France; ³Airbus Defense and Space, France

Matrix detectors and colour filters arrays are more widely used for satellites and rover missions in the past years. Recently, four CO3D (from “Constellation Optique 3D” in french) satellites equipped with COTS matrix Bayer sensor were launched and calibrated. Both the sensor sampling distinctive features and the new Step & Stare guidance mode are leading to new calibration and processing paradigms. In this paper, we delve into techniques dedicated for such Bayer matrix-based system, mainly but not limited to high-resolution (HR) Earth-observation (EO) satellite missions. We first describe dedicated techniques for in-orbit radiometric performance assessment like signal-to-noise ratio (SNR) and modulation transfer function (MTF). Then we address ground processing dedicated to Bayer acquisitions. Finally, we demonstrate the validity of our approach with CO3D in-orbit measurements. We also apply the radiometric ground processing on real images and provide a comparison with Pléiades-HR imagery, demonstrating the many benefits of the CO3D mission and all its novelties. CO3D in-orbit testing (IoT) is still ongoing eight months after launch, the in-flight performances are not presented in this paper due to confidentiality agreement.

Relative Accuracy Evaluation of UAV Photogrammetry for Drifting Arctic Sea Ice

Zhiqi He¹, Shuhang Zhang^1,2, Daikun Yang¹, Wuming Zhang^1,2

¹School of Geospatial Engineering and Science, Sun-Yatsen University, Zhuhai, China; ²Key Laboratory of Comprehensive Observation of Polar Environment (Sun Yat-sen University), Ministry of Education, China

This study presents a systematic evaluation of the relative geometric accuracy of UAV photogrammetry over drifting Arctic sea ice, addressing critical challenges posed by textureless surfaces and dynamic motion. Utilizing data from 18 shipborne UAV flights during the FACE2024 expedition, the research quantifies the impact of sea ice drift on orthomosaic horizontal accuracy. A methodological framework is established that incorporates shipborne GNSS data for drift correction, aligning image positions to a common reference frame under the assumption of consistent icebreaker–ice motion. Accuracy assessment is performed using onboard control and check lines of known lengths, enabling reliable relative error measurement without traditional ground control points (GCPs), which are infeasible on drifting ice.

Results demonstrate that drift velocity and total drift distance have a strong positive correlation with root mean square error (RMSE) before correction (r = 0.70 and r = 0.79, respectively), while the flight-drift angle has minimal influence (r = –0.13). The application of ship-position-based drift correction significantly improves accuracy, reducing RMSE by an average of 0.23 m and achieving a high relative accuracy of approximately 10 cm for imagery with 2–4 cm ground sampling distance. The use of control lines alone also substantially enhances results. This work validates the efficacy of drift correction and provides practical guidance for mission planning and data processing, confirming that standard UAVs and commercial photogrammetric software can produce reliable results in challenging polar environments when appropriate corrections are applied.

Radiometric Intercalibration Methodologies for High-Resolution Satellite Imagery in Precision Agriculture

Christian Massimiliano Baldin, Vittorio Marco Casella

Università degli Studi di Pavia, Italy

This paper examines how to align PlanetScope and Sentinel-2 vegetation indices, focusing on the Normalized Difference Red Edge (NDRE) index, which is commonly used in precision agriculture for prescription maps. While Sentinel-2 is popular for crop monitoring, its low spatial resolution limits use in small or irregular fields. PlanetScope provides higher-resolution, more frequent imagery, but its sensor differs from the Sentinel-2, limiting compatibility with current research and tools. By testing three adjustment methods, the study shows that it is possible to align PlanetScope NDRE values with Sentinel-2: M1 (Linear Regression + Histogram Shifting + Histogram Matching), M2 (Histogram Matching), and M3 (per-band linear regression before index calculation). Two dates from 2022 were selected as representative seasonal extremes from the broader 2021–2023 dataset of 56 image pairs (Baldin, 2025), which was further analyzed through time-series methods. Resampling direction (PS→10 m, S2→3 m) minimally affects RMSE/MAE but significantly alters spatial structure and Moran’s I values; downscaling PS to 10 m decreases Moran’s I. M2 is suitable for standard applications, whereas M3 is preferable when preservation of spatial structure is important. Across the four examined scenarios, all methods reduce RMSE below the 0.07 agronomic threshold, with calibrated RMSE ranging from 0.02 to 0.05 (up to 0.06 across the full 56-pair dataset). M3’s advantage lies in how effectively it reduces spatial autocorrelation mismatch: a 43.4% reduction in Moran’s I (versus ~18.2% with M1 and M2) in the four example scenarios, and 39.5% versus 28.4% (M1) and 28.2% (M2) reduction over the full dataset.

Integrated Airborne Sensor System for MWIR–Aerial Camera–GNSS/IMU Synergy in High-Resolution Remote Sensing

Yanning Wang

Beijing University of Civil Engineering and Architecture, China, People's Republic of

This study introduces an integrated airborne sensor system that combines mid-wave infrared (MWIR) imaging, a high-resolution aerial camera, and GNSS/IMU navigation for all-day, high-precision remote sensing. The MWIR subsystem adopts a Frame-scanning mechanism to achieve wide-swath and efficient thermal data acquisition. A unified calibration and synchronization framework was developed to ensure temporal and spatial consistency among sensors, including precise time synchronization, lever-arm and boresight calibration, and radiometric correction. The refined GNSS/IMU trajectory supports accurate co-registration between MWIR and optical imagery. Field experiments in China demonstrated stable system performance and consistent geometric–radiometric alignment under various illumination conditions. The integrated dataset enables detailed thermal–optical reconstruction, revealing thermal features and material contrasts not observable in visible imagery. The system supports applications such as infrastructure inspection, environmental monitoring, and emergency response. With its compact structure and modular design, the proposed platform provides a practical reference for next-generation airborne sensor integration and real-time data fusion in high-resolution mapping missions.

An Integrated Multi-Mode Imaging Task Scheduling Framework for Remote Sensing Satellite in Diverse Observation Scenarios

Wei Yao¹, Xin Shen¹, Guo Zhang¹, Zezhong Lu², Jiaying Wang¹

¹State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China; ²Urban and Environment Sciences, Hubei Normal University, Huangshi, China

Existing satellite mission-planning algorithms are primarily designed for homogeneous single-payload constellations, making them insufficient for coordinating heterogeneous satellites such as optical and SAR systems. Moreover, most current approaches rely on highly abstract task models that neglect the fact that a single satellite may operate under multiple observation modes, each imposing distinct constraints on geometry, attitude maneuvering, and resource utilization. In addition, few studies have addressed the integrated scheduling of point-target and area-target missions, which is essential for scenarios combining discrete and continuous observation demands.

This study proposes an integrated scheduling algorithm for multi-mode, multi-scenario, and multi-task Earth-observation constellations. The algorithm formulates mission planning as a unified spatiotemporal optimization problem, jointly considering visibility, sensor compatibility, attitude feasibility, and onboard resource limits. A CDCL-enhanced constraint-programming solver is employed to enable coordinated scheduling across different observation modes and target types.

Experimental validation on hydropower and disaster-monitoring scenarios shows that the proposed method significantly improves coverage, cross-sensor synergy, and responsiveness compared with traditional homogeneous schedulers. The results establish a new paradigm for integrated and intelligent mission planning of heterogeneous, multi-mode satellite constellations.

UAV Visual Localization in GNSS‑Denied Environments

Tai-Cyuan Wang¹, Lai-Han Tsou¹, Jyun-Ping Jhan¹, Ru-Shan Tang²

¹NTUST, Chinese Taipei; ²NCSIST, Chinese Taipei

Navigating Unmanned Aerial Vehicles (UAVs) in Global Navigation Satellite System (GNSS)-denied environments requires reliable autonomous localization techniques. This study proposes a vision-based localization framework utilizing satellite true orthophotos and Digital Surface Models (DSMs) as absolute geospatial references. The algorithmic pipeline integrates deep learning architectures—specifically SuperPoint and LightGlue—to establish robust image-to-map feature correspondences. The matched correspondences are used to estimate camera exterior orientation parameters through collinearity-based spatial resection with an Iteratively Reweighted Least Squares (IRLS) approach. To validate the proposed methodology, a multi-altitude dataset (100–250 m) was acquired across structurally diverse terrains, including dense building, high vegetation, and bare ground areas. Experimental evaluations demonstrate that the framework achieves meter-level absolute positioning accuracy and stable pose estimation. Analyses further reveal that matching robustness and localization success rates depend heavily on terrain texture and flight altitude; geometrically structured urban scenes at moderate-to-high altitudes consistently yield reliable correspondences, whereas low-texture environments and lower flight altitudes present persistent challenges for continuous visual tracking.

Geometric and Visual SLAM: The accuracy of modern handheld LiDAR scanners

Christoph Strecha¹, Ryan Hughes¹, Andrea Dotti¹, Davide Antonio Cucci¹, Marta Coelho Lopes², Pierangelo Rothenbühler¹

¹Pix4D SA, Switzerland; ²École Polytechnique Fédérale de Lausanne (EPFL), Switzerland

Recent handheld scanners increasingly integrate geometric (LiDAR-based) and visual (image-based) SLAM (Simultaneous Localization And Mapping), promising low-cost and flexible solutions for surveying tasks. This paper evaluates the accuracy of three such systems: the XGRIDS Lixel K1, the SHARE S20, and a Pix4D solution pairing an iPhone Pro with an Emlid Reach RX GNSS (Global Navigation Satellite System) antenna. We conducted experiments in two distinct environments: Scene 1, with continuous, high-quality RTK (Real-Time Kinematic) coverage, and Scene 2, which included an indoor trajectory resulting in a temporary loss of the RTK fix. Accuracy was validated against independent GNSS check points. In Scene 1, the Pix4D solution delivered survey-grade results, achieving a RMSE (Root Mean Square Error) below $3~\text{cm}$ in the $X, Y$, and $Z$ directions. The XGRIDS and SHARE scanners yielded larger maximum errors, around $10~\text{to }15~\text{cm}$. In Scene 2, accuracy degraded; the Pix4D solution's maximum error increased to approximately $12~\text{cm}$ , while the Share S20's maximum error exceeded $25~\text{cm}$. We conclude that while the fusion of visual and geometric SLAM is powerful, a stable RTK fix remains critical for achieving consistent survey-grade accuracy with current low-cost handheld scanners

An integrated workflow for urban tree DBH estimation from handheld mobile laser scanning (HMLS) data

George-Catalin Males¹, Valeria-Ersilia Oniga^1,2, Iosif Lavric¹, Norbert Pfeifer³, Markus Hollaus³

¹Technical University of Civil Engineering Bucharest, Romania; ²quot;Gheorghe Asachi" Technical University of Iasi, Romania; ³Technische Universität Wien, Austria

Stem diameter is a key parameter for assessing woody vegetation growth and its ecological and economic benefits, including biomass production, carbon sequestration, and urban ecosystem services. Recent advances in handheld mobile laser scanning (HMLS) enable efficient acquisition of high-density point clouds for deriving tree structural attributes in complex environments. This study presents an automated workflow for tree detection and diameter at breast height (DBH) estimation in an urban park, using two HMLS systems: the GoSLAM RS100i and the FJD Trion S1. The influence of point cloud density and subsampling resolution (2 - 4 cm) on detection and accuracy was evaluated. Reference data for 69 trees were collected using a forestry caliper and total station, while HMLS datasets were georeferenced with RTK-GNSS. The workflow included point cloud filtering, terrain modelling, stem extraction, and DBH estimation through cylindrical fitting. Detection performance differed between systems and was strongly affected by point density. The GoSLAM RS100i detection rate decreased from 97.1% at 2 cm to 53.6% at 4 cm spacing, whereas the FJD Trion S1 maintained stable performance (~87%) across all resolutions, likely due to higher point density. DBH estimation accuracy was similar for both systems, with RMSE values of 3.3–3.6 cm for filtered data and up to 4.9 cm when all detections were included, alongside a consistent positive bias (1.7–2.5 cm). Subsampling had no significant effect on DBH accuracy, indicating robustness to moderate density reductions. Overall, HMLS systems provide reliable DBH estimates in urban environments, with performance mainly influenced by point cloud quality.

Real-Time Mapping and Planning Intelligent Paths using Optical Lidar and Quadruped Robot

Wen-Yang Chang^1,2, Chen-Hsiang Hung¹, Zheng-Xun Huang³, Hsuan-Jui Chang⁴

¹Department of Mechanical and Computer-Aided Engineering, National Formosa University; ²Smart Machine and Intelligent Manufacturing Research Center, National Formosa University; ³Doctoral Degree Program in Smart Industry Technology Research and Development, National Formosa University; ⁴Department of Bioscience and Biotechnology, National Taiwan Ocean University

In general, the obstacle detection systems mainly rely on depth cameras or AI-based vision approaches; however, these methods are often constrained by limited fields of view and the need for continuous model retraining to adapt to complex and dynamic industrial scenes. To overcome these limitations, this study proposes a LiDAR-based obstacle detection and field monitoring system integrated with a quadruped robot. The proposed system focuses on three main components: real-time field mapping, intelligent path planning with obstacle avoidance, and field change detection. LiDAR point cloud data are pre-processed using pass-through and voxel grid filters, followed by coordinate transformation into the robot reference frame. The Cartographer simultaneous localization and mapping (SLAM) algorithm are employed to generate high-resolution occupancy grid maps for navigation. For autonomous operation, erosion processing and connected component labelling are used to define safe regions, while the A* algorithm enables efficient path planning and adaptive obstacle avoidance in complex environments. To detect unknown obstacles and environmental changes, Gaussian filtering and map differencing are applied, and map similarity is evaluated using histogram analysis and SIFT-based feature matching. Experimental results demonstrated that the system achieves a mapping resolution of 0.05 m and satisfies the Taiwan Association of Information and Communication Standards (TAICS) requirements, including 0.2 m planimetric accuracy and less than 0.1 m positional error. The proposed approach effectively identifies unknown obstacles and visually highlights risk areas, providing a reliable solution for intelligent workplace safety monitoring.

A Multi-Strategy Adaptive Error Modeling and Compensation Method for Star Point Centroid Extraction

Yifan Lou¹, Mi Wang¹, Tiyou Zhou¹, Haonan Zhang¹, Huang Zhang²

¹State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430079, China; ²Hubei Subsurface Multi-scale Imaging Key Laboratory, School of Geophysics and Geomatics, China University of Geosciences, Wuhan, 430074, China

Centroid extraction from star images is a critical component in achieving high-precision satellite attitude determination. Prevailing approaches primarily focus on suppressing a single type of error or depend on fixed filtering and compensation parameters, often lacking a multidimensional and fine-grained analysis and handling of diverse error sources. To address these limitations, this paper proposes a compensation method for centroid extraction based on error classification and modeling, coupled with an adaptive strategy selection mechanism to improve accuracy. Experimental results demonstrate the efficacy of the proposed method: on a set of 30 to 300 laboratory-simulated star images, it enhanced the average centroid extraction accuracy from a baseline of 0.31–0.45 pixels to 0.11–0.19 pixels when using a Static model Unscented Kalman Filter (UKF) integrated with four sub-pixel interpolation techniques. Furthermore, for a larger dataset of 300 to 600 star images simulated at a 300 Hz frame rate, the method achieved an accuracy improvement exceeding 50% across five different motion model UKF methods, demonstrating robust performance.

Integration and Intelligent Monitoring Technology System of Space-Air-Ground Remote Sensing and Its Applications

Lirong Liu¹, Shucheng You¹, Weiyi Jiang^2,1, Xinming Tang¹, Ke Liu¹, Zhengyu Luo¹, Xue Li³

¹Land Satellite Remote Sensing Application Center, MNR, Beijing100048,China; ²School of Geography and Ocean Science, Nanjing University, Nanjing 210023,China; ³Changchun Institute of Technology-College of Exploration and Surveying Engineering, Changchun 130021, China

In recent years, Space-Air-Ground sensing data has become increasingly abundant. This paper focuses on the technology system of integrated intelligent sensing in Space-Air-Ground remote sensing, aiming to integrate data from different platforms and sensor types through deep collaboration to meet the growing demand for high-precision, high-frequency, and near-real-time monitoring in scenarios such as land change detection, natural resource development, and disaster emergency response. This paper constructs a technical framework for the integrated intelligent sensing technology system for Space-Air-Ground remote sensing,focusing on core technical methods such as multi-source data governance and correlation, component-based AI interpretation model development, and the construction of application agents based on multi-modal large models.

This study validated the application of an integrated space-air-ground intelligent monitoring system through a typical ecological restoration project monitoring and supervision case. A test area of approximately 5.61 hectares in Hunan Province, China, was selected to construct an ecological restoration monitoring agent. This agent comprehensively utilized multi-temporal satellite imagery spanning eight years, UAV image data, and tower-based videos.Driven by natural language instructions, the agent autonomously planned task chains, coordinated multi-source data, and triggered models. After the implementation of the ecological restoration project, the results showed 4.87 hectares of new grassland and 0.74 hectares of new forest within the area, achieving intelligent identification and quantitative, automated assessment of land cover types, restoration progress, and ecological recovery outcomes. The experiment demonstrated the system’s advantage in "rapid identification and early warning",forming an intelligent operational closed loop of "monitoring-analysis-decision-feedback."

Dynamic Shadow Removal and Quality Assessment of High-Resolution Orthophotos for Pavement Inspection

Yueh-Che Li¹, Juin-Ping Jhan²

¹Graduate Institute of A.I. Cross-disciplinary Technology, NTUST, Taiwan; ²Graduate Institute of A.I. Cross-disciplinary Technology, NTUST, Taiwan

Traditional pavement inspection and data collection are often constrained by traffic conditions, operational safety, and equipment costs, making it difficult to achieve both efficiency and large-scale coverage. To address these limitations, this study employs a Pavement Roughness Index and Distress Extraction System (PRIDEs), which integrates high-resolution industrial cameras, high-precision global navigation satellite system (GNSS), wheel pulse sensors, and an onboard computer to acquire high-quality images under high-speed driving conditions. Using photogrammetry and computer vision techniques, camera poses are reconstructed to generate dense point clouds, digital surface models (DSMs), and orthophotos for detailed pavement distress analysis. However, the acquired imagery is affected by dynamic shadows and lens-focusing induced blur, resulting in ghosting artifacts and inconsistent orthophoto quality. To mitigate these issues, this study proposes a masking strategy during orthophoto generation, where U-Net is employed to detect shadow regions and Laplacian variance is used to identify blurred areas. By integrating these masks, more uniform and higher-quality orthophotos can be produced. Experimental results demonstrate that the proposed approach effectively reduces false positives and false negatives of crack detection caused by shadows and blur, thereby improving the reliability of orthophotos for automated pavement condition assessment.

Enhancing UWB Indoor Positioning using Bias- Aware EKF and Anchor Self-Localization

Shreya Todmal, Salil Goel

Indian Institute of Technology Kanpur, India

Ultra-Wideband (UWB) technology is gaining

attention for indoor positioning due to its high accuracy, low

latency and resilience to interference, making it ideal for

environments where GNSS (Global Navigation Satellite System)

signals are unavailable—such as warehouses, hospitals, and

underground facilities. However, UWB systems can suffer from

reduced accuracy under Non-Line-of-Sight (NLOS) conditions

and dynamic deployments. This paper proposes a novel bias

aware EKF (Extend Kalman Filter) model, combined with

Anchor Self-Localization method for localization in indoor

environments, and enhancing the flexible deployment of

anchors. The proposed model demonstrates an overall

improvement of 32% and 41% in positioning accuracy

compared to traditional methods across both indoor and

outdoor environments respectively. The paper demonstrates the

proposed ASL method, it performs at par with conventional pre

calibrated methods where anchors are to be localized manually.

Together, the Bias-Aware filtering and ASL approach enhance

the scalability and reliability of UWB-based Indoor Positioning

Systems (IPS) for real-world applications.

Geometrical Accuracy Investigations of Handheld 3D Scanners in Comparison: Low-Cost vs. High-End

Thomas Kersten¹, Malte Marsmann¹, Heinz-Jürgen Przybilla²

¹HafenCity University Hamburg, Germany; ²former Bochum University of Applied Sciences, Germany

Handheld 3D scanners have gained increasing importance in recent years due to their flexibility and declining acquisition costs. While high-end systems provide standardized accuracy specifications, affordable devices often lack reliable and comparable benchmarks. This paper evaluates the geometric accuracy of three low-cost handheld 3D scanners (Revopoint Pop 3 Plus, Revopoint MetroX, 3DMakerPro Moose 3D scanner) compared to two high-end systems (Hexagon MARVELSCAN, Hexagon Absolute Arm with AS1 scanner), using the ZEISS Atos 5 structured light system as reference. Five different test objects with varying material and geometric properties were used for practical assessment. Results reveal significant differences regarding flatness, detail fidelity, and robustness: while some low-cost scanners achieve remarkable accuracies, their performance is less stable under varying conditions. High-end systems, in contrast, consistently provide high precision and reproducibility. This study provides a well-founded classification of current handheld 3D scanners and practical guidance for their application in science, industry, and education.

Loose Coupling Modeling of LiDAR-based Localization and SLAM

Chenglu Wen^1,2, Huanjia Zhang^1,2, Cheng Wang^1,2

¹Fujian Key Laboratory of Urban Intelligent Sensing and Computing, Xiamen University, China; ²Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Xiamen University, China

In recent years, LiDAR-based localization has been widely explored. Among them, Scene Coordinate Regression (SCR)-based

methods have demonstrated outstanding accuracy and robustness in city scenes. Integrating these models with traditional Simultan

eous Localization and Mapping (SLAM) methods is expected to enhance localization accuracy and reliability further. This paper

proposes loosely coupled fusion methods integrating an SCR model with SLAM to improve localization accuracy and robustness.

The approach addresses the information loss problem in high-level sensor fusion while maintaining computational efficiency. The

method achieves tighter data association and complementary performance advantages by strategically combining LiDAR-based

localization results with SLAM pose estimates. Experimental results in the NCLT and HeLiPR datasets demonstrate that the pro

posed fusion framework effectively corrects SLAM drift and maintains stable pose estimation accuracy under diverse environmental

conditions. Furthermore, the sparse-frame coupling strategy significantly reduces computational overhead without degrading local

ization performance, making the method suitable for practical applications. The system exhibits improved robustness across regions

and LiDAR configurations while preserving real-time operation capabilities.

Line Of Sight Calibration For Satellite Imagery Based On Matrix Detector

Guillaume Laurent¹, Alice Latourte², Fabrice Buffe², Bruno Vidal²

¹Thales Services Numeriques, France; ²CNES, France

Matrix detector are becoming increasingly common in optical imaging satellite. To maintain good geometric quality of the images, the line of sight (LOS) of each pixel must be known precisely. This paper aims to estimate the performance of our method of LOS calibration on Co3D datas, which requires a precise geometric model for altimetric reconstruction.

A Data-Driven Framework for Structural Crack Identification in 3D Mobile LiDAR Scans Using Deep Learning Classification Models

Yasmine Megahed¹, Omar Elsisi², Ahmed Shaker³

¹Toronto Metropolitan University, Canada; ²Toronto Metropolitan University, Canada; ³Toronto Metropolitan University, Canada

In cold-climate regions like Canada, pavement infrastructure deteriorates rapidly due to extreme freeze-thaw cycles and heavy use of de-icing salts, accelerating the formation of structural cracks and imposing a financial burden on municipal budgets. By providing an automated LiDAR (Light Detection and Ranging)-based detection framework, this research offers a cost-effective, high-precision monitoring tool that enables early intervention, reducing long-term repair costs and enhancing road safety across Canadian provincial networks. This study evaluates the performance of Support Vector Machines (SVMs) and Multi-Layer Perceptrons (MLPs) for crack classification in a multi-dimensional feature space. We propose integrating geometric height (H) with a novel set of radiometric indices, including the Normalized Difference Intensity Index (NDII) and the Green Ratio (GR), to enhance classification stability. Results demonstrate that both SVM and MLP achieved comparable accuracies of 87% and 86%, respectively, in low-dimensional feature spaces. A critical analysis of the MLP learning curves reveals that the introduction of NDII acted as a numerical stabilizer, mitigating the oscillations caused by raw brightness fluctuations. Furthermore, the study identifies an information ceiling, as architectural expansion of the MLP improved convergence stability but did not exceed the 87% accuracy threshold. These findings provide a robust framework for automated road maintenance using stabilized radiometric features in LiDAR-based distress identification.

Integration of multi-source point clouds for bridge inventory – case study

Paulina Jaczewska, Klaudia Pasternak, Anna Fryskowska-Skibniewska

Military University of Technology, Poland

The aim of the study was to propose a procedure enabling accurate mapping of the above water and underwater areas of the bridge. The object of the study was a road bridge located approximately 30 km north of Warsaw, Poland. The bridge is 332 m long and 13.5 m wide. The bridge is located over Lake Zegrze. A mobile topographic Norbit iLIDAR system was used to measure the bridge structure above the water surface. Bridge pillars measurement and the shape of the bottom of the water reservoir in the immediate vicinity of the bridge were performed using a Norbit Winghead i77h multibeam echo sounder. In order to bridge point clouds integration, a workflow has been proposed: LIDAR and MBES data filtering, consideration of the speed of sound in water, LIDAR and MBES data calibration including Patch Test and MBES cross check. As a result, the integrated point cloud of the bridge was created. The LIDAR point cloud resolution was 1 cm and the MBES point cloud resolution was 0.02 m. The created point cloud of bridge provides could be useful for monitoring erosion and accumulation phenomena, analyzing the stability of bridge pillars and verifying hydrodynamic models.

Estimating laser scanner's effective beam shape using line spread function

Omar AbdelGafar¹, Selin Palaz¹, Yihui Yang¹, Derek Lichti², Christoph Holst¹

¹Chair of Engineering Geodesy, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany; ²Department of Geomatics Engineering, University of Calgary, Calgary, Canada

Accurate characterization of the terrestrial laser scanner (TLS) beam footprint is essential for understanding the stochastic behavior of the scanner. Estimating the laser footprint width is crucial for determining the correlation length between neighboring observations, and thus for providing a realistic estimation of the correlation elements within the variance-covariance matrix of the measurement uncertainties. In this contribution, an intensity-based workflow is introduced to estimate the laser beam footprint by deriving the line spread function (LSF) from the edge spread function (ESF). The proposed method applies no assumptions regarding the geometric or physical behavior of the beam, allowing the footprint to be determined directly from the measured intensity values. Using the BOTA8 target, the Z+F Imager 5016A was investigated at two different distances and two scanning rates with varying mirror rotation speeds. The results provide insights into the influence of distance and scanning rate on the laser beam footprint.

Vectorized Grid Detection and Color Rectification for 3D Point Clouds of Photovoltaic Panels

Yuezhen Cai², Ting On Chan¹

¹Sun Yat-sen University, China, People's Republic of; ²Southern Power Grid General Aviation Services Co., Ltd.

UAV-borne PV point clouds often suffer from severe shadow artifacts and color dropouts, limiting their use for reliable inspection and digital twin construction. We introduce a fully vectorized color rectification framework that exploits panel symmetry and 1D signal processing to restore a consistent radiometric appearance. Starting from a segmented solar-panel point cloud, the method first normalizes panel geometry via RANSAC-based plane segmentation and rotation to a canonical xy-plane, then extracts base colors by clustering RGB values to identify “panel blue” and “grid white” regions. It subsequently detects grid parameters by projecting filtered grid and panel points into 1D spatial density histograms along the x- and y-axes to estimate spacing, offset, and grid-line thickness, and finally performs vectorized recoloring and color remapping of grid and panel points using the recovered parameters. By decoupling periodic grid structure from illumination noise, our approach achieves visually near-perfect color restoration while eliminating intra-semantic variance across modules. The resulting high-fidelity, shadow-free point clouds provide a mathematically consistent foundation for PV digital twins and automated asset evaluation.

EarthDaily Constellation: Systematic, AI‑Ready Daily Change Detection Superspectral Visible, Near-Infrared, Shortwave Infrared, and Thermal Mission

William Parkinson, Chris Rampersad

EarthDaily, Canada

EarthDaily Constellation (EDC) is a ten-satellite, sun-synchronous mission optimized for persistent, daily monitoring of global land and designated coastal waters. Each spacecraft carries co-aligned VNIR, SWIR, and TIR imagers and acquires nadir-only imagery at ~10:30 LTAN to stabilize collection for optimal change detection. A systematic acquisition plan builds a global spatiotemporal archive; EarthPipeline performs automated geolocation, orthorectification, atmospheric correction, QA, and wide-area compositing. Bands and metadata are designed for CEOS CARD4L-SR alignment and inter-sensor interoperability with Landsat and Sentinel-2. The talk reports early on-orbit performance—geometric accuracy, radiometric stability—and benchmarking of atmospheric correction and cloud/shadow masking against ESA Sentinel-2 processing, with a focus on time-series consistency for analytics and ML. We also outline specialized applications concepts and readiness.

Plane-based estimation of boresight misalignment of a laser scanning system

Marcela Valle Machado¹, Antonio Maria Garcia Tommaselli¹, Thiago Tiedtke Reis², Leticia Ferrari Castanheiro³, Thaisa Aline Correia Garcia¹

¹São Paulo State University, Brazil; ²T2R Technological Solutions; ³Embrapa Digital Agriculture

This paper presents a static calibration approach for lightweight laser scanning systems, utilising planes as control entities, with a focus on estimating boresight misalignment angles. The calibration with the system static, aims to minimise errors originating from several sources, such as position and attitude systems, time synchronisation, and control features measurement. The mathematical model is based on the plane equation, combined with the equations of laser scanning. The estimation is performed with the combined model of least squares. Experiments in a terrestrial calibration field were performed. The results show that the approach successfully estimates the boresight misalignment angles, reducing the errors of the point cloud with respect to the control planes.

Assessment of SWOT observations based on in-situ measurements for water surface elevation

Rodolfo Alvarado Montero, Alain Pietroniro

University of Calgary, Canada

Monitoring water resources is essential for supporting human activities and enabling informed decision-making. Since its launch in 2022, the Surface Water and Ocean Topography satellite mission has provided global observations of surface water elevation for rivers, lakes and oceans. Several studies have evaluated SWOT performance for ocean applications (Hay et al., 2025, Lichtman et al., 2025) and continental water bodies (Patidar and Indu, 2025). However, no comprehensive assessment has yet focused on Canadian inland waters. This research presents an initial evaluation of SWOT water surface elevation observations using hydrometric stations operated by Water Survey of Canada (WSC). This evaluation covers the period between operational orbit reached in July 2023 and December 2025.

Advancing High-Resolution Earth Observation: GNSS-SAR Imaging with Spaceborne GNSS-Reflectometry Satellites

Yang Yang, Mengyu Ding, Songbo Wu, Xiaolong Mi, Wu Chen

Hong Kong Polytechnic University, Hong Kong S.A.R. (China)

This presentation introduces a novel approach for high-resolution Earth observation using GNSS-SAR imaging with spaceborne GNSS-Reflectometry satellites. By leveraging low-level intermediate frequency (IF) signals from the CYGNSS satellite constellation, our work demonstrates the feasibility of forming GNSS-SAR images from spaceborne GNSS-R data. The integration of advanced weak signal tracking algorithms and tailored SAR image formation techniques enables the retrieval of Earth observation data with unprecedented spatial and temporal resolution. This addresses longstanding challenges in space-based GNSS-R remote sensing, such as limited spatial resolution and weak signal reception. The LEO satellite-based GNSS-SAR approach offers significant advantages, including global coverage, rapid revisit times, and the potential for onboard processing. These features collectively support scalable, near real-time monitoring of dynamic Earth processes, making this technique highly relevant for extreme weather surveillance, disaster preparedness, and environmental monitoring.

A low-cost universal multi-sensor framework for seamless indoor–outdoor 3D mapping in urban environments

Eslam Muhammed, Ahmed Shaker

Toronto Metropolitan University, Canada

This study presents a low-cost LiDAR–IMU–GNSS mapping framework for continuous and globally consistent three-dimensional reconstruction across indoor–outdoor environments. The work addresses a key limitation in current SLAM and GNSS-integrated systems, where LiDAR-based approaches provide strong local geometric accuracy but lack reliable global referencing, while GNSS-based solutions often rely on high-precision corrections such as RTK or PPP, limiting scalability and deployment in urban environments.

Building upon the Dense Multi-Scan Adjustment SLAM (DMSA-SLAM) framework, the proposed system introduces a structured integration of standalone Single Point Positioning (SPP) GNSS through an external alignment strategy, ensuring that global referencing is achieved without compromising locally consistent LiDAR–Inertial geometry. The framework further incorporates explicit multi-level structural constraints to support consistent cross-floor reconstruction, along with a bounded optimization and loop closure strategy that maintains stability and prevents global trajectory deformation without requiring full pose graph optimization.

The system is validated in a multi-storey urban building under challenging GNSS conditions, including complete signal outages and urban canyon effects. Results demonstrate sub-decimeter indoor geometric accuracy and meter-level global georeferencing using low-cost sensors. Comparison with a high-accuracy terrestrial laser scanning (TLS) reference confirms reliable reconstruction quality, while the proposed system achieves rapid mapping in a single continuous trajectory using a significantly lower-cost sensor suite.

Overall, the framework provides a practical and scalable solution for infrastructure-free indoor–outdoor mapping, supporting applications in BIM, digital twins, and urban asset management.

From Imaging Modeling to Field Validation: A Calibration Framework for a Hybrid Solid-state LiDAR System for Small Body Mapping and Navigation

Qingfan Zhang, Huan Xie, Xiongfeng Yan, Yanmin Jin, Yuanting Xi, Xiaohua Tong

College of Surveying and Geo-Informatics, and Shanghai Key Laboratory of Space Mapping and Remote Sensing for Planetary Exploration, Tongji University, Shanghai 200092, China

This contribution presents a comprehensive calibration framework for a hybrid solid-state LiDAR system designed for small body exploration. Integrating imaging modeling, photon-count–based parameter estimation, and multi-scale ground experiments, the method effectively corrects pixel-dependent range and angular errors. Rigorous validation demonstrates centimeter-level accuracy in both mapping and navigation modes, confirming the framework's robustness and its critical role in enhancing deep-space mission capabilities.

Utilization of Thermal and Optical Dataset for Deep Learning based Damage Detection in Heritage Structures of Hauz Khas, Delhi

Om Singh Lilhare, Hina Pande, Poonam S. Tiwari

Indian Institute of Remote Sensing, Dehradun, India

This research introduces a deep learning-based, multi-sensor framework for automated damage detection in cultural heritage structures using fused thermal and optical imagery. Conducted across five historic sites in South Delhi, India, the study targeted common degradation forms—cracks, spalling, and biological growth—through high-resolution image acquisition using a FLIR T1030sc thermal camera and RGB sensors. Fused datasets (MXS and thermal-optical blends) significantly outperformed optical-only inputs, with the YOLOv11-Tuned model achieving a peak mAP of 91.8%. The fusion allowed reliable detection of subsurface anomalies and fine-scale damage often missed by traditional visual inspections. Oriented Bounding Box (OBB) variants improved localization of non-linear features, while genetic algorithm-based hyperparameter tuning enhanced model precision. The framework offers a scalable, non-invasive, and accurate alternative to manual inspection, supporting early diagnostics and long-term conservation planning. This approach demonstrates the transformative potential of AI and remote sensing in preserving architectural heritage against both environmental and anthropogenic threats.

Integration of multi-sensor core scanning data in mineral mapping

Chuiqing Zeng¹, John Manchuk²

¹Technology Development Group, GeologicAI, Toronto, ON M5T 1V7, Canada; ²Management Team, GeologicAI, Calgary, AB T2C 5S9, Canada

Hyperspectral data alone in mineral exploration often suffers from limitations including signal noise, coarse spatial resolution, and spectral variability, which can hinder mineral discrimination. To address these challenges, we integrate Short-Wave Infrared (SWIR) and Visible Near-Infrared (VNIR) hyperspectral data cubes with complementary sensor modalities, including RGB imagery and LiDAR acquired from indoor scans of drilled core. This multi-sensor fusion enhances the reliability and accuracy of mineral maps by leveraging the strengths of each modality.

At GeologicAI, our indoor scanning platform captures multi-modal data from a box of core using a variety of different sensors. A critical preprocessing step involves isolating the drilled core from the background. We further applied a continuous wavelet transform (CWT) for a scalogram analysis enables the differentiation of unclassified spectra based on their frequency-scale characteristics. Following spatial masking and unclassified spectral filtering, we apply a local end-member selection regime utilizing RGB, VNIR and SWIR for all valid pixels. Afterwards, non-negative least squares (NNLS) linear unmixing. While SWIR remains the primary source for mineral identification and abundance calculations, VNIR and RGB data provide critical support in resolving ambiguities either confirming the presence of minerals difficult to detect with SWIR alone or excluding candidates based on VNIR disagreement or RGB colour disagreement. Mineral maps derived from SWIR data exhibit a reconstruction residual error of 12.4%. While the integration of VNIR data does not necessarily reduce this residual, it enhances confidence in abundance estimations, particularly in regions where SWIR alone cannot separate end members.

ForestLayers: an R package to Quantify Forest Vertical Structure from 1D or 3D Vegetation Density Data

Amélie Juckler^1,2,3, Richard A. Fournier¹, Camille Rouet¹, Osvaldo Valeria^2,4, Philippe Lejeune³, Félix Chabot^1,5, Johannie Lemelin¹

¹Department of Applied Geomatics, Centre d’Applications et de Recherches en Télédétection (CARTEL), Université de Sherbrooke, Canada; ²Chaire en aménagement forestier durable UQAT-UQAM, Canada; ³TERRA Teaching and Research Center – Forest Is Life - Gembloux Agro-Bio Tech, Université de Liège, Belgium; ⁴Institut de recherche sur les forêts, Université du Québec en Abitibi-Témiscamingue, Canada; ⁵Department of Computer Science, Université de Sherbrooke, Canada

Quantitatively Evaluate and Optimize the Target Network of the Calibration Field for the Self-Calibration of Terrestrial Laser Scanners

Xu ZHANG, Qingzhou MAO

武汉大学, China, People's Republic of

Calibration of terrestrial laser scanners (TLS) is paramount for ensuring high-precision measurements. The costs and efficiency of calibration pose significant challenges for both instrument manufacturers and end-users conducting self-calibration of TLS systems. To date, there has been a lack of theoretical methods for quantitatively analyzing and optimizing the geometric network of targets within calibration fields. This study proposes the TNet-GDOP (Target Networks Geometric Dilution of Precision) theory and its mathematical model to quantitatively evaluate the impact of target distribution on parameter solution precision. We propose the optimized the target network strategy based on the precision contribution factor of TNet-GDOP (OptimizeTNet-PCF), a target distribution optimization algorithm with a well-defined scoring function. OptimizeTNet-PCF can reduce the number of targets with minimal effect on parameter precision while suppressing anomalous observations. The number of targets was reduced to one-eighth (from 140 to 16), with ranging parameter variations less than 0.1 mm and angular parameter variations less than 0.2″.

The impact of calibration method on point cloud accuracy in shallow water photogrammetry

Paulina Kujawa, Jaroslaw Wajs, Damian Kasza

Department of Geodesy and Geoinformatics, Wrocław University of Science and Technology, Poland

This paper examines the feasibility of calibrating a consumer camera with a calibration panel to accurately reconstruct seabeds in shallow water. Specifically, it assesses whether calibration parameters determined based on the panel can be applied to an independent set of images captured under different conditions. The study also examined the effect of the analyzed approach on the final accuracy of the point cloud. The analysis covered three calibration variants: (1) external calibration based on an underwater panel, (2) preliminary calibration in which the panel parameters were used as initial values for further optimization, and (3) fully automatic autocalibration. The results showed that calibration using the panel does not improve reconstruction quality and can lead to model distortion. The highest accuracy was achieved with in situ autocalibration, supported by underwater control points.

L-band SAR continuity in Japan and it’s applications

Shinichi Sobue

JAXA, Japan

The Advanced Land Observing Satellite-4 (ALOS-4), launched on July 1, 2024, observes the Earth's surface using its onboard Phased Array type L-band Synthetic Aperture Radar (PALSAR-3). Japan has continuously advanced L-band radar technology, and ALOS-4 offers significantly improved observation performance compared to its predecessor, PALSAR-2, aboard ALOS-2, which was launched on May 24, 2014. ALOS-4 is designed to achieve both high spatial resolution and a wider observation swath—expanding the 3 m strip map mode coverage from ALOS-2’s 50 km to 200 km.

By employing this wide-swath observation capability, ALOS-4 can acquire 3 m dual-polarization data over Japan approximately once every two weeks. These frequent observations support disaster management by providing timely information on events such as volcanic activity, land subsidence, and landslides. Moreover, the high-temporal-resolution 3 m dual-polarization data are valuable for a wide range of applications, including agriculture, ocean monitoring, and environmental studies.

To effectively utilize ALOS-4 data, it is essential to integrate it with the long-term archive of ALOS-2 observations, enabling time-series change detection. Maintaining consistent geometric and radiometric quality between ALOS-2 and ALOS-4 data through cross-calibration and validation is therefore critical. This paper presents the results of these efforts and outlines the current use of ALOS-2 and ALOS-4 data under the ALOS-2 Public–Private Partnership (PPP) Phase B activities.

Evaluating the impact of UAV-LiDAR point cloud density on the accuracy of canopy radiative transfer simulations

Takumi Fujiwara¹, Hiroki Naito², Naoko Miura², Fumiki Hosoi²

¹Dept. of Computer Science, National Defense Academy of Japan, Japan; ²Graduate School of Agricultural and Life Sciences,The University of Tokyo, Tokyo

This study investigates how differences in UAV-LiDAR sensor performance affect the accuracy of canopy radiative transfer simulations. Conducted in a Japanese larch forest in Yamanashi, Japan, the research compares two UAV-mounted LiDAR systems—YellowScan Explorer and Voyager—flown over the same plot. The simulation approach uses a voxel-based model to estimate solar irradiance attenuation and reflection, optimizing parameters to match Sentinel-2 NIR reflectance.

Results show that Voyager, which produced over twice the point density of Explorer, achieved a higher correlation with Sentinel-2 data (r = 0.74 vs. r = 0.67). This suggests that higher point density improves upper-canopy representation and enhances simulation accuracy. However, the study also emphasizes the continued importance of complementary ground-based LiDAR (e.g., handheld or TLS) for capturing understory structure.

The findings highlight that UAV-LiDAR is essential for accurate canopy modeling, but sensor specifications—particularly point density—significantly influence radiative transfer outcomes. Future work should explore integrating multiple LiDAR sources and testing scalability across diverse forest types and phenological stages.

Machine learning applications for modeling and mapping soil erosion in tropical regions

Francisco Hélter Fernandes do Amaral¹, Andrés Velastegui-Montoya^2,3, Eder MS de Paula⁴

¹Postgraduate Program in Geography, Federal University of Pará, Belém, Brazil; ²Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University, Guayaquil, Ecuador; ³Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University, Guayaquil, Ecuador; ⁴Faculty of Geography, Federal University of Pará, Belém, Brazil

Soil erosion is a significant threat to ecosystem quality, and the development of accurate models to map erosion susceptibility is essential for enhancing public mitigation policies. This study investigates the applicability of the algorithms Weighted Subspace Random Forest (WSRF), Random Rotation Forest (RRF), and Naive Bayes (NB) to map soil erosion susceptibility in the Rio Pardo watershed, located between the states of São Paulo and Minas Gerais. A total of 120 sample points of erosion and non-erosion sites were used, identified through high-resolution images from Google Earth Pro and field visits. Fifteen conditioning factors were initially considered, but after analyzing multicollinearity and factor relevance, only thirteen were selected for the final modeling. The dataset was randomly divided into 70% for training and 30% for testing to assess the robustness of the models. The performance of the algorithms was evaluated using metrics such as accuracy and AUC-ROC. The accuracies obtained were 0.87 for NB, 0.89 for RRF, and 0.88 for WSRF, while the AUC-ROC values were 0.93, 0.96, and 0.95, respectively. RRF showed the best performance, confirming the usefulness of these models in sustainable management and conservation of areas susceptible to erosion.

Heat Wave and Heat Stress Space-Time Patterns Assessment Using Climate Reanalysis Data and In Situ Measurements

Alberto Vavassori, Matej Žgela, Filippo Bissi, Maria Antonia Brovelli

Dept. of Civil and Environmental Engineering, Politecnico di Milano, Milano, Italy

This work combines in situ measurements of near-surface air temperature with the CMCC VHR-REA_IT climate reanalysis dataset to assess the spatial and temporal dynamics of heat wave (HW) events and evaluate heat stress (HS) conditions across Italy for the period 1981-2024. HWs are characterised in terms of their frequency, duration, and intensity, while HS is evaluated through ad-hoc indices, including Humidex. A trend analysis is performed to investigate the temporal trends of HWs and of hazardous HS conditions. Results indicate a significant increase in the number of HW events alongside a growing frequency of severe thermal discomfort conditions (up to 6 days more per decade). Overall, this work underscores the intensification of heat-related hazards in the study area, emphasising the need for mitigation and adaptation strategies. The ultimate goal is to develop a scalable, open-source methodology that enables continental-scale assessments of heat extremes and their impacts.

Spatio-temporal semantic alignment and standardization of multimodal data in cultural landscape heritage

Shao Tong¹, He Jie²

¹School of Architecture，Tianjin University, China; ²School of Architecture, Harbin Institute of Technology(Shenzhen)

Current Historical Geographic Information (HGI) research faces significant challenges in integrating multi-source heterogeneous data (China Historical GIS Project, 2025). The lack of unified semantic standards, effective interoperability mechanisms, and systematic organization of historical sources has led to severe "data silos." Consequently, a core problem remains: the semantic fragmentation, temporal inconsistency, and disconnected evidence chains of complex cultural landscape data (Southall, 2014). While existing approaches successfully utilize traditional GIS for spatial management or foundational ontologies (e.g., CIDOC CRM) (Bekiari et al., 2024) for static artifact cataloging, they struggle to formalize and compute the dynamic evolution of heritage sites over long historical trajectories.

To overcome these bottlenecks and advance the multidimensional application of cultural landscape heritage data, this study proposes a data organization framework centered on semantic normalization and standardization. Driven by a novel hybrid semantic architecture, we construct an extensible semantic foundation and a multi-source fusion mechanism. This approach seamlessly couples macroscopic cultural landscape heritage event-centric modeling with microscopic temporal annotations, strictly regulated by a "policy–ontology–rules" constraint mechanism.The framework is designed to support computable, searchable, and inferable unified knowledge representations, thereby enabling deep integration of spatio-historical big data, semantic reasoning, and evidence- based decision-making for cultural landscape heritage management.

Rapid identification of components of categorical changes during a time series of maps

Robert Gilmore Pontius Jr¹, Antonio Fonseca¹, Aiyin Zhang¹, Sergio Fagherazzi²

¹Clark University, USA; ²Boston University, USA

This presentation addresses our profession’s need for new methods to identify rapidly the prominent patterns concerning the locations, time intervals, classes, and transitions that account for gross changes during sequential time intervals in a series of maps, as opposed to popular methods that compute merely the sizes of classes at time points. Trajectory Analysis is a method that computes various components of change during a time series for exactly one land cover class. Our method of Change Components Analysis extends the concepts of Trajectory Analysis to present new concepts to address multiple classes using our new free software. Our novel methods are especially effective at identifying where, when, which classes, and which transitions demonstrate suspicious changes that warrant attention to data quality. Our new methods identify also change components that can give insights to landscape processes.

Local pathways of association

Rui Qu¹, Jiao Hu², Yongze Song³, Bing Wang⁴, Yanfang Sun⁵, Yufei Wang⁶, Xueyuan Zhang⁷, Kai Ren⁸

¹School of computer science and technology, Aba Teachers College, Aba Zhou 623002, China; ²Key Laboratory of Poyang Lake Wetland and Watershed Research Ministry of Education, Jiangxi Normal University, Nanchang, 330022, China; ³School of Design and the Built Environment, Curtin University, Perth 6845, Australia; ⁴China National Offshore Oil Research Institute Co., Ltd., Beijing, China; ⁵College of Civil Engineering, Taiyuan University of Technology, Taiyuan, China; ⁶Department of Primary Industries and Regional Development, 1 William St, Perth WA 6000, Australia; ⁷School of Environment and Resources, Taiyuan University of Science and Technology, Taiyuan 030024, China; ⁸State Key Laboratory of Climate System Prediction and Risk Management, Nanjing Normal University, Nanjing 210023, China

Spatial association reveals the interconnected nature of geographical phenomena, describing the interactions and influences of environmental variables across geographic space. Path analysis can explore complex causal relationships between variables by analyzing path coefficients. However, in large-scale studies, path analysis methods are often affected by local effects, which can influence the accuracy and reliability of the results. This study develops a local pathway association (LPA) model to analyze local effects of pathways among variables that integrate path analysis and local pathway coefficient estimations. The LPA model was employed to investigate the spatial heterogeneity of spatial associations between factors such as climate, soil, and vegetation on the Tibetan Plateau. Results indicate that the LPA model effectively reveals the spatial variation characteristics of local path coefficients between geographic variables, avoiding the underestimation or overestimation of global path coefficients in traditional path coefficient studies. The developed LPA model provides an effective technical tool for revealing spatial differences in path associations of large-scale spatial studies. The strong data compatibility of the LPA model allows for broad applicability across various disciplines and a deeper understanding of localized interactions and variations in complex geospatial and Earth systems.

High-Resolution Sub-daily Wildfire Progression Monitoring with MODIS, VIIRS and Sentinel-3 Using Flow-Matching Generative Models

Eric Brune, Yifang Ban

KTH Royal Institute of Technology, Sweden

This Contribution presents a generative Flow Matching Framework for sub-daily Wildfire Progression Monitoring from combined MODIS, VIIRS and Sentinel-3 Observations. The Approach treats all available Multi-Sensor Looks as irregular Samples along an underlying Spatio-temporal Fire Trajectory and learns continuous Vector Fields that map coarse Reflectance Observations to Sentinel-2-like Reflectance and Burned Area Masks. The Input Constellation uses MODIS Bands 1, 2 and 7, VIIRS I1-I5 and Sentinel-3 OLCI Bands Oa08 and Oa17 together with SLSTR Band S6, providing complementary Information in the visible, NIR, SWIR and Thermal Domains as well as staggered Overpass Times. Labels are derived from Sentinel-2 Surface Reflectance and Burned Area Polygons from the National Burned Area Composite as well as additional manually interpreted Fire Perimeters. We expect the learned Model to reconstruct Fire Progression at 3-6 Hour Resolution for many large Events, to improve Burned Area Delineation over single Sensor Baselines, and to provide Ensemble-based Uncertainty Estimates that highlight ambiguous Regions under Smoke or Cloud. The resulting Multi-Sensor Dataset and trained Model are intended as reusable Resources for future Research on Wildfire Monitoring and Data Assimilation.

A new way of interoperability - Implementing a JSON-LD for OGC SensorThings API Standard

Colm Walsh¹, Alexandra Kokkinaki¹, James Clare¹, Shreyash Musale¹, Christopher Thompson¹, Rob Atkinson², Alejandro Villar Fernandez², Markus Wilhelm Jahn²

¹British Oceanographic Data Centre, United Kingdom; ²Open Geospatial Consortium, Germany

This text outlines an approach to achieving practical geospatial data interoperability through incremental, data-driven standardization rather than relying on a single, universal standard. It frames interoperability as an evolving process in which data models, syntactic formats, semantic vocabularies, and protocol bindings are progressively aligned, generating network effects that lower implementation costs. The AMPLIFY-EDS project applies these principles to the end-to-end lifecycle of Near Real Time (NRT) environmental sensor data across the UK Environmental Data Service (EDS). Led by the British Oceanographic Data Centre (BODC), the project establishes a federated API ecosystem using the OGC SensorThings API (STA), integrating multiple MQTT data streams from research vessels and partner data centres. A Python relay application performs ingestion, validation, and quality control before posting data to a FROST server, while a React frontend provides visualisation. Metadata harmonisation required community agreement on minimal entity requirements, vocabularies, and JSON schemas, drawing on schema.org and SOSA. The team then enriched STA outputs by mapping JSON to JSON-LD and creating context files validated through OGC Building Blocks.

Spatiotemporal Prediction of Hourly NO2 concentrations using dynamic DTG data

Yoohyung Joo, Sehoon Oh, Jiwan Hong, Joon Heo

Yonsei University, Korea, Republic of (South Korea)

This study presents a spatiotemporal modeling framework for predicting hourly NO2 concentrations in Seoul by incorporating dynamic vehicle activity data recorded from Digital Tachographs (DTG). Conventional Land Use Regression (LUR) models rely on static spatial predictors and therefore struggle to represent short-term emission dynamics driven by rapidly changing traffic conditions. To overcome this limitation, this research integrates high-frequency DTG variables—vehicle speed, acceleration, braking events, and truck activity—into a dynamic LUR model and evaluates hourly NO2 variability across the urban environment.

Model performance was assessed using panel regression with random effects and hourly time indicators to capture temporal fluctuations at fixed monitoring locations. The DTG-integrated model exhibited substantially improved explanatory power, raising the within R2 from 0.17 in the static baseline to 0.25. The consistent significance of DTG-derived predictors highlights the dominant influence of real-time traffic behavior on short-term pollution levels and confirms the value of incorporating high-resolution mobility data.

Hourly prediction maps revealed strong diurnal patterns, with concentrations lowest at 4 a.m. and highest at 8 p.m., when evening congestion produced values nearly double those of early morning. A LISA cluster analysis further showed that high–high spatial clusters expanded from 17% to 28% of the study area during peak hours, demonstrating increased spatial concentration of pollution. The transition of grid cells between cluster categories also indicated dynamic shifts in spatial patterns throughout the day.

Overall, this study demonstrates that integrating DTG data substantially improves the characterization of hourly pollution dynamics and provides a foundation for time-sensitive, location-specific air-quality management strategies.

Extending CityGML with a Multi-LoD4 ADE for Urban Digital Twins: Geometry Visualization and Semantic Integration of BIM/GIS

Peyman Azari, Songnian Li, Ahmed Shaker

Department of Civil Engineering, Toronto Metropolitan University, Toronto, Ontario, Canada

This research presents a new Multi-LoD4 Application Domain Extension (ADE) for CityGML to improve the integration of Building Information Modelling (BIM) and Geospatial Information Systems (GIS) in Urban Digital Twins (UDTs). The proposed approach extends CityGML’s Level of Detail concept to better represent both exterior and interior elements of buildings while keeping their semantic information complete. It links the geometric model to a graph-based database that stores and connects all building components, allowing users to visualize and query the data interactively in a web environment. The Multi-LoD4 ADE enhances interoperability, semantic richness, and data accessibility, providing a more comprehensive and practical foundation for future UDT applications in areas such as building management and urban analysis.

Adaptive Photovoltaic Panel Detection Pipeline with Deep Learning Adaptive Photovoltaic Panel Detection Pipeline with Deep Learning

Hewerton Alves da Silva, Nicolas Rafael Silva Lucena, Rubens Oliveira da Cunha Junior, Ana Paula de Barros Araújo, Christian Willibald Biehler

Senai Innovation Institute for Information and Communication Technologies (ISI-ICT), Brazil

This work presents an automated and adaptive pipeline for detecting photovoltaic (PV) systems in high-resolution satellite imagery. The solution was developed to support large-scale energy monitoring efforts in the state of Minas Gerais, Brazil, where geographic diversity and visual variability pose significant challenges to accurate PV identification. The proposed pipeline operates from a single pair of geographic coordinates, automatically defining the area of interest, acquiring a basemap image, classifying the spatial context through HSV histograms, UMAP dimensionality reduction, and K-Means clustering, and dynamically selecting the most suitable deep learning segmentation model. Multiple U-Net architectures with different ImageNet-pretrained encoders were evaluated to segment PV panels, and building footprints from public datasets were used to refine detections through geospatial segmentation (SamGeo). Experimental results indicate that model performance varies across environmental contexts, highlighting the importance of context-aware model specialization. Preliminary evaluations show that dynamically assigning models such as ResNet50 and VGG16 to their optimal clusters improves segmentation accuracy. Overall, the proposed methodology demonstrates a modular, scalable, and context-adaptive approach for PV system detection, suitable for integration into urban and energy monitoring platforms.

Spatial and non-spatial clustering of Advanced Producer Services in the United Kingdom

Zhiya Xu¹, Mingshu Wang¹, Ziqi Li²

¹University of Glasgow, UK; ²Florida State University, USA

Clustering methods are widely used in regionalisation research to identify spatial and functional structures within complex economic systems. Yet different clustering specifications can lead to contrasting interpretations of regional patterns. Advanced Producer Services (APS), i.e., specialised, knowledge-intensive business services, provide a useful setting to examine these methodological choices.

This paper develops a framework comparing spatially constrained and unconstrained clustering for delineating APS employment regions in the UK. Spatial methods group neighbouring units to preserve geographic contiguity, while non-spatial methods group areas with similar employment profiles regardless of location. We ask to what extent APS regionalisation follows spatial contiguity versus functional--economic linkages that transcend geography.

Our contribution is twofold. Substantively, we show that APS in the UK form functionally coherent but spatially fragmented regions, challenging planning approaches that assume contiguous blocks of territory. Methodologically, we quantify the trade-off between cluster quality and spatial interpretability, providing a simple diagnostic to guide method choice in regionalisation studies.

Efficient Allocation and Routing of Disaster Responders: Formulation and Validation of a Regional Travel Problem

Toshihiro Osaragi

Institute of Science Tokyo, Japan

Effective disaster response requires rapid allocation of limited human and material resources to dispersed and dynamically changing demands. This study formulates a regional travel problem, an extension of the Multiple Traveling Salesman Problem (mTSP), to optimize the assignment and routing of responders—such as firefighters and volunteers—to affected individuals and facilities. To address the NP-hard nature of the problem, a computationally efficient heuristic is proposed that integrates fuzzy c-means clustering and a genetic algorithm (GA). Responders are first stochastically assigned to demanders based on a composite score combining distance, compatibility, and urgency. Remaining demanders are then optimally allocated using a GA to minimize total travel completion time while balancing workload. The model incorporates three key factors—workload differences, responder–demander compatibility, and urgency—and is implemented as a web-based travel assistance application capable of real-time recalculation when new responders or demanders appear.

Simulation experiments conducted in Setagaya Ward, Tokyo, demonstrated that accounting for workload differences and enabling dynamic recalculation significantly reduced completion time and improved cooperative task efficiency. Field experiments with actual responders verified these findings: the proposed system halved total completion time compared to conventional SNS-based coordination and eliminated route overlaps and missed visits. The results confirm that the proposed model and system enhance operational efficiency and reliability in dynamic disaster environments. This research provides a practical, data-driven foundation for real-time disaster management, with future work focusing on scalability, responder performance calibration, and robustness under disrupted network conditions.

Hybrid Quantum Genetic Algorithm for Hyperparameter Optimization in a Burnscar Segmentation Model

Mahkame Salimi-Moghadam1¹, Sohrab Ganjian¹, Mozhdeh Shahbazi¹, Mikhail Sokolov¹, Barry Sanders²

¹Canada Centre for Mapping and Earth Observation, Natural Resources Canada, Canada; ²Institute of Quantum Science and Technology, University of Calgary, Canada

Hyperparameter tuning is a critical step in training artificial intelligence (AI) models for Earth observation (EO) tasks, as it directly impacts model accuracy, convergence speed, and generalization capacity. Traditional optimization methods such as grid search, random search, and Bayesian optimization often suffer from high computational costs and limited scalability, particularly when applied to complex model architectures and large datasets. Grid and random search scale poorly with dimensionality of the search space and often waste evaluations on unpromising regions of the search space, especially for deep neural networks. Random search improves over grid search but still requires a large number of trials to reliably find good configurations in high-dimensional search spaces. Bayesian optimization methods, while more sample-efficient, typically involve non-trivial surrogate modelling and acquisition optimization steps that add overhead and can struggle with very large, mixed (discrete–continuous) search spaces. These challenges are further amplified in EO applications, where segmentation models are trained on large datasets, making each hyperparameter evaluation computationally expensive and limiting the practicality of purely classical search strategies.

Recent advances in quantum computing have introduced novel paradigms for solving combinatorial optimization problems. Quantum-inspired and hybrid quantum-classical algorithms leverage principles such as superposition and probabilistic amplitude encoding to enhance search efficiency in high-dimensional spaces while benefiting from the strengths of classical algorithms. Building on these concepts, we investigate a Hybrid Quantum Genetic Algorithm (HQGA) for hyperparameter tuning. To evaluate this approach, we apply it to the optimization of a semantic segmentation model specialized for wildfire burnscar detection.

Joint Optimization of Location and Capacity for Spatial Equity of EV Charging Infrastructure : A Case Study in Jeju Island

Geonu Park, Sehoon Oh, Jiwan Hong, Yoohyung Joo, Joon Heo

Department of Civil and Environmental Engineering, Yonsei University, Seoul, Republic of Korea

This study presents a two-stage framework for planning Electric Vehicle (EV) Charging Infrastructure that explicitly targets Spatial Equity on Jeju Island. First, projected 2035 Origin–Destination demand is downscaled to a 500 m grid and evaluated on a routable road graph using a network-based Gaussian Two-Step Floating Catchment Area (G2SFCA) model to produce a high-resolution Accessibility surface. Second, a Quadratic Programming (QP) model jointly optimizes station Location and charger Capacity under a fixed budget by minimizing the demand-weighted variance of Accessibility, thereby reducing disparities across demand cells. Candidate stations are derived from publicly accessible Points of Interest and selected with a coverage-oriented clustering scheme; a greedy loop adds sites that yield the largest marginal reduction in the equity objective, with capacities re-optimized by QP at each step.

The evaluation compares four scenarios—status quo, Location-only, Capacity-only, and the proposed joint optimization—using established equity metrics including the demand-weighted Standard Deviation, Mean Absolute Deviation, Coefficient of Variation, and the Gini coefficient. Although full numerical results are in progress, preliminary simulations indicate that the joint strategy delivers more balanced Accessibility across urban and rural areas than single-focus baselines while maintaining overall service levels. The framework is reproducible, policy-oriented, and transferable to other regions, offering planners a rigorous, data-driven tool to allocate limited public fast-charging resources fairly under future EV uptake.

The "Last Meter" Dilemma: Global Disparities in Accessible Information Labelling of Urban Parks for Wheelchair Users

Luoan Yang, Wei Huang

College of Surveying and Geo-informatics, Tongji University, Shanghai, China

The “last meter” dilemma in urban accessibility refers to the lack of accessible information at the terminal points of public service facilities, which hinders wheelchair users' mobility, even physical infrastructure may be present. This study investigates this dilemma on a global scale, analyzing over 210,000 parks across 100 of the world's most populous cities to quantify how information gaps create real-world barriers.

To quantify these gaps, the study introduces two metrics: Absolute Accessibility Loss (AAL) and Accessibility Gap Ratio (AGR), which measure the additional travel time burden on wheelchair users resulted from the lack of accessible information. The findings show that only 34.9% of parks are labelled as accessible. This disparity has tangible consequences: Wheelchair users must travel farther and spend more time reaching parks labelled as accessible than the general population does to reach any park. The study also reveals a clear global divide, where high-income cities show higher labelling rates and shorter travel times for wheelchair users, while cities in Africa, India, and Southeast Asia exhibit higher disparities

This study furnishes a framework for policymakers, presenting a novel perspective for the assessment of urban equity and a scalable instrument for tracking advancements towards the United Nations Sustainable Development Goals, specifically SDG 11 (Sustainable Cities and Communities) and SDG 10 (Reduced Inequalities).

Advancing Image Geo-localization by Embedding Geospatial Intelligence into Vision-Language Models

Meiliu Wu

University of Glasgow, United Kingdom

Image geo-localization aims to infer where a photograph was taken purely from its visual content. This task underpins applications in navigation, urban analytics, disaster response, and environmental monitoring, but current vision-language models (VLM) are mostly trained on generic web data with little explicit geospatial information. This work develops GeospatialCLIP, a geospatially enhanced VLM that embeds geospatial intelligence directly into CLIP via spatially explicit contrastive learning.

GeospatialCLIP is trained on 180k geotagged image-text pairs spanning street-view imagery, multi-temporal satellite images (2014 and 2023), and OpenStreetMap tiles. Rich captions and spatial context are curated by GPT-4 and experts, describing spatial patterns of objects, land use, urban form, and features that support geo-localization. A spatially explicit text encoder integrates structured tokens with geo-image type and geo-location across scales, enabling a shared geospatial representation space.

Zero-shot global geo-localization experiments evaluate GeospatialCLIP on unseen datasets across geo-locations, scales, and years, and compare it with vanilla CLIP and ResNet backbones. Across city, country and continent levels, GeospatialCLIP consistently improves top-1 accuracy for all imagery types, and its zero-shot performance on street-view images matches few-shot CLIP. The results highlight how embedding geospatial knowledge into VLMs can yield more robust, data-efficient GeoAI models and point towards future geospatial foundation models that better support scientific discovery and real-world decision-making.

Classifying Tourism and geographic Texts using fine-tuned LLMs with Chain-of-Thought Data

Tian Lan, Donglin Cheng, Zhiwei Wu, Zhilin Li

Faculty of Geosciences and Engineering, Southwest Jiaotong University

Tourism and geographic text data is one of the most common data types in spatial analysis, and the classification of such data is an essential preprocessing step to facilitate more in-depth mining of spatial-temporal information. In the past decade, a variety of classification methods for tourism and geographical text data have been developed. These methods established important foundations for automated text analysis, yet their effectiveness has often been constrained by the availability of labelled data and the need for carefully designed feature representations. Recently, large language models (LLMs) demonstrate clear advantages in long-sequence modeling, offering new directions for text classification, particularly for long-form texts. However, employing commercial LLMs poses a significant cost challenge due to the high expense per token, and processing long texts consumes a considerable volume of tokens. In fact, it is feasible to adopt a strategy of locally deploying and fine-tuning open-source large language models that have reduced parameter counts. In this study, we have trained some open-source LLMs with chain-of-thought text. Experimental results show that the highest-performing model (e.g. fine-tuned Qwen3-1.7B) achieves an average accuracy of 95.83%, improving by 4.17% over the baseline RoBerta. Classification results can support tasks such as intelligent tourism recommendations, geographic knowledge construction, and toponym recognition. It may be concluded that the proposed chain-of-thought-guided LLM method can be effectively employed to classify tourism and geographic text data, and LLMs with reduced number of parameters have the potential to solve specific tasks with limited computation resources.

High Spatio-Temporal Resolution Estimation of XCO2 Observations using Spatial Feature Fusion

Yang Hui¹, Cui Liu¹, Feng Gefei²

¹China University of Mining and Technology, China, People's Republic of; ²Jiangsu Normal University, China, People's Republic of

High spatio-temporal resolution estimation of XCO₂ is crucial for accurately quantifying regional carbon sources and sinks. Because XCO₂ variability is influenced not only by local geographic conditions but also by surrounding environmental and meteorological factors, this study proposes an advanced estimation approach that fuses multi-scale spatial features. We develop SpatialFusionNet, a convolution-based module that leverages local spatial association and receptive-field characteristics to integrate meteorological and surface environmental information within a 2.3° × 2.3° grid. This module extracts and fuses spatial feature patterns and subsequently estimates XCO₂ concentrations.

By combining SpatialFusionNet with machine learning methods—Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), and Deep Neural Networks (DNN)—we construct a deep spatial-feature fusion model based on OCO-2 XCO₂ observations over China, CAMS reanalysis data, meteorological variables, and vegetation indicators. Significant performance improvements are achieved: RMSE decreases by 1.297 ppm (SVM), 0.480 ppm (DNN), and 0.200 ppm (XGBoost) in ten-fold cross-validation against OCO-2 trajectory samples. Validation using the TCCON Hefei station yields a correlation of 0.85, demonstrating strong reliability.

Using the DNN combined with SpatialFusionNet, we further generate a seamless annual XCO₂ distribution for China in 2015 and analyze its temporal–spatial characteristics. The proposed framework provides an effective pathway for producing high-resolution XCO₂ datasets and supports fine-scale assessment of regional carbon cycling.

Walking Speed and Climate Resilience: a dynamic Approach to Accessibility for vulnerable urban Populations

Andrea Ajmar, Alexandra Stankulova, Luca Staricco

Interuniversity Department of Regional and Urban Studies and Planning, Politecnico and Università di Torino, Torino, Italy

Urban strategies establishing climate shelters typically delineate service areas using 15-minute walking isochrones, aligning with "chrono-urbanism". However, this practice often relies on the standard walking speed of a healthy, middle-aged male, a simplification that risks significantly overestimating the real accessibility for vulnerable groups, such as the elderly.

This paper presents a dynamic methodology to analyse how accessibility changes when accounting for two crucial factors: age/gender and thermal comfort (heat exposure along the route). The approach uses the Physiologically Equivalent Temperature (PET) index to dynamically adjust walking speed based on environmental conditions and the heightened vulnerability of subjects (represented by a 65-year-old female).

Applied to a case study in Turin, Italy, the results demonstrate a profound accessibility error caused by standard methods. Neglecting the combined effects of age and heat may lead to a 100% overestimation of the actual number of elderly women served. When these factors were integrated, the municipal area covered by shelters plummeted from 35.2% (standard scenario) to only 8.6% (highest stress scenario). Furthermore, the proportion of elderly women considered served dropped drastically from approximately 65% to just over 18%.

These findings confirm that dynamic accessibility calculations are essential for identifying optimal locations for new climate shelters and ensuring effective, equitable adaptation strategies.

Game Engine-Based Urban Tree Digital Twin for visualizing and simulating Carbon Flux

Laura Ikonen, Heikki Kauhanen, Petri Rönnholm, Matti Tapio Vaaja

Department of Built Environment, Aalto University, Finland

This study aimed to develop an easily accessible, interactive digital twin model in Unreal Engine that visualizes urban trees and their carbon flux based on the Metsäkanta tree database, and

simulated carbon sequestration and emissions dataset. The model provides a flexible and automated framework for incorporating additional carbon and tree data for any area. Additionally, it showcases the potential of data-driven game engine visualizations in creating engaging scientific communication for a broader demographic.

Spatio-Temporal Lag Detection for Virtual–Physical Trajectories

Guangsheng Dong¹, Hui Wang³, Shuhan Zhang², Tao Cheng²

¹State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China; ²SpaceTimeLab for Big Data Analytics, Dept.of Civil, Environmental and Geomatic Engineering, University College London, London, UK; ³School of Electronic Information, Wuhan University, Wuhan, China

This contribution presents an exploratory study on the relationship between virtual and physical trajectories in London, with a particular focus on how their spatio-temporal lags evolve under different urban conditions. Virtual trajectories are derived from map tile access logs of OpenStreetMap, while physical trajectories are constructed from anonymised mobile phone data. Both datasets are aggregated to Middle Layer Super Output Areas (MSOAs) for the period from 1 February to 30 April 2020.

We apply a simple rolling-window cross-correlation to each MSOA to monitor, over time, whether virtual activity leads, coincides with or lags behind physical activity. Two case studies illustrate the insights provided by this spatio-temporal lag detection. Around major concerts at the O2 Arena, virtual trajectories consistently lead physical trajectories by approximately 1–3 days, reflecting anticipatory route planning and information searches. Around the first Covid-19 lockdown, the lag landscape reorganises: positive lags become more dominant and their spatial configuration shifts, indicating that virtual activity remains a robust leading signal for constrained but persistent urban mobility.

A Study on Building a Virtual Tribe for Indigenous Peoples Living Away from Their Home Tribe

Sendo Wang

National Taiwan Normal University, Taiwan

A Wikipedia-style collaborative mapping website is proposed in this paper to document, to archive, and to share these TEK. All knowledge articles are contributed by volunteers based on the volunteered geographic information (VGI) concept. The article can be written in the corresponding indigenous language to precisely describe their cultural knowledge. Compare to the Wikipedia, this website is actually a WebGIS. A knowledge article refers to a point, a polyline, or a polygon, which means the knowledge article is georeferenced. The website is composed of open software, such as MySQL, OpenLayers, GeoServer, Drupal and Apache. These indigenous knowledge articles are the source of contents of the Virtual Tribe, the virtual reality of their home tribe. We deployed UAV to take aerial photographs and produced ortho-rectified images and 3D mesh models of the tribe. We also applied 360°panorama camera to take 360°panorama images or videos at important locations when we walk with the elder people around the tribe. Finally, these images, 3D model, and TEK are integrated in the virtual tribe. It’s like a digital twin of the home tribe. Users can explore the tribe and learn TEK from elder people who speaks indigenous language in the 360° panorama video embedded in the virtual tribe.We have cooperated with two high schools in the indigenous countries to build up an immersive virtual reality (iVR) using the TEK articles on the proposed website. The feedback from students is positive and encouraging.

A review of spatiotemporal locust modeling methods under remote sensing–eco-statistical coupling: from Markov approaches to hierarchical Bayesian frameworks

Shuhong Peng^1,2, Jing Qian¹, Jing Wang³

¹Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China, People's Republic of; ²School of Computer Science and Engineering, Huizhou University, China, People's Republic of; ³School of Arts and Design, Huizhou University, China, People's Republic of

Locust outbreaks pose persistent challenges for agriculture and food security due to their pronounced spatiotemporal complexity. Existing monitoring and modelling approaches often struggle with sparse and biased field observations, cloud-affected and discontinuous satellite time series, and the difficulty of fusing heterogeneous data across scales. This paper reviews spatiotemporal locust modelling methods under a unified remote sensing–eco-statistical coupling framework. We first summarize multi-source observational inputs, including optical and microwave remote sensing, reanalysis meteorological data, and ground surveys, and outline common workflows for spatial alignment, temporal aggregation, lag handling, and uncertainty-aware quality control. We then examine three major model families along a coherent pathway from behavioural processes to probabilistic inference: Markov and semi-Markov models for explicit state transitions and duration; hidden Markov and state-space models for representing latent ecological states while correcting observation error; and hierarchical Bayesian spatiotemporal models, including INLA-based implementations, for cross-scale integration and formal uncertainty quantification. Building on this synthesis, we propose practice-oriented principles for model selection that account for state observability, temporal structure, spatial dependence, uncertainty representation, data and computational costs, and interpretability. Finally, we discuss a data–model–decision loop that links probabilistic risk products to operational thresholds, surveillance strategies, and control actions. The review aims to support more robust, transparent, and operationally useful early warning and resource allocation for locust management.

Development of a hash interaction algorithm via urban object information generation based on a variable 3D geohash framework

Hongki Sung, Woochul Choi, Youngjae Jeon, Ckyusoo Chong

Korea Institute of Civil Engineering and Building Technology

Recent increases in extreme climate events and urban accidents highlight the need for urban digital twin technologies capable of real-time monitoring and predictive simulation. However, existing digital twin systems primarily focus on visually realistic three-dimensional representations, which makes large-scale safety simulations computationally expensive due to massive 3D datasets and complex physical models.

To address this limitation, this study proposes a Hash interaction algorithm based on a variable 3D GeoHash framework for generating urban object information and enabling lightweight spatial interaction simulations. The framework extends conventional two-dimensional GeoHash by incorporating elevation to construct hierarchical 3D GeoHash cells that support efficient geocoding of urban objects.

The proposed method consists of four key processes: (1) classification of urban objects into fixed spatial information (e.g., buildings, roads, and terrain) and dynamic spatial information (e.g., weather conditions and moving entities); (2) generation of object-specific attribute information and physical properties; (3) establishment of movement rules between neighboring GeoHash cells; and (4) development of a rule-based inter-Hash interaction algorithm that updates physical state variables through interactions with adjacent cells.

By restricting interaction calculations to neighboring Hash cells, the proposed approach significantly reduces computational complexity while maintaining real-time update capability. The adjustable GeoHash resolution also enables simulations ranging from city-scale environments to centimeter-level spatial detail, supporting lightweight digital twin applications for urban safety management and construction-site monitoring.

Fly with GIS: A GIS-Based Electronic Flight Bag Decision-Support Concept for In-Flight Weather Deviation in the Cockpit

Rockie U Kei Kuok, Zheng Lei

Department of Aviation, School of Engineering, Swinburne University of Technology, Australia

This project proposes a GIS-based decision-support concept integrated within the electronic flight bag (EFB) to assist pilots in tactical in-flight weather deviation under convective conditions. The project addresses a critical gap between experience-driven cockpit decision-making, primarily relying on onboard weather radar imagery, and optimisation-based trajectory planning methods that are typically designed for strategic or air traffic management contexts rather than real-time pilot use.

The proposed framework utilises weather radar-aligned data combined with geospatial layers such as terrain, airways, and traffic to construct a unified operational environment. Within this GIS-based architecture, optimisation techniques (e.g., rapidly-exploring random trees, deep reinforcement learning) are applied to generate feasible and hazard-aware deviation trajectories. These trajectories are presented to pilots as advisory “ghost” flypaths on the EFB, supported by quantitative metrics such as weather clearance, additional track distance, and estimated fuel or time penalties, while maintaining the pilot fully in the decision loop.

Expected outcomes include improved flight efficiency through reductions in track mileage, deviation time, fuel consumption, and enhanced safety margins. Furthermore, the system aims to reduce pilot cognitive workload and stress by externalising complex decision-making processes and providing clear, optimised guidance during time-critical situations.

Overall, the project offers a practical cockpit-deployable solution that bridges weather radar-based situational awareness and advanced optimisation methods, enabling more consistent, data-driven, and operationally robust pilot decision-making.

Spatiotemporal graph network-based method for predicting urban emergency events

Shishuo Xu, Xiaoqian Hu, Fan Xie, Yihua Wang, Jiahui Guo

School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, China

This study proposes a Spatio-Temporal and Semantic Correlation Graph Convolutional Network (STS-GCN) to enhance the prediction of urban emergency events. Addressing the limitation of existing models that fail to fully integrate multi-dimensional correlations, the STS-GCN framework jointly models spatial, temporal, and semantic (categorical) dependencies. The model constructs distinct graphs to represent these relationships, using Graph Convolutional Networks (GCNs) to extract and fuse spatial and semantic features. A Gated Recurrent Unit (GRU) is then employed to capture temporal dynamics.

Trained and validated on a 2015 dataset from the Toronto Police Service—categorizing events into traffic collisions, shootings, robberies, and assaults—the model was evaluated against several baselines. Experimental results demonstrated that the STS-GCN model achieved superior performance, obtaining the lowest RMSE (0.1829) and MAE (0.0023), and the highest Accuracy (0.8705). The study concludes that through effectively learning the complex internal patterns of events through multi-dimensional feature modeling, the proposed framework offers a robust and generalizable tool for accurate urban emergency prediction, with significant potential to support public safety governance and resource allocation.

Research on Collaborative Visual Analysis Method of Mixed Reality Across Geographic Scenarios

Yaqin Sun

China University of Mining and Technology, China, People's Republic of

With the deep integration of geographic information science and human-computer interaction technology, how to support multiple users to cross different physical spaces and collaboratively perceive, analyze, and make decisions on complex geographic phenomena in a unified virtual and real fusion environment has become a cutting-edge challenge in this field. This article proposes a systematic mixed reality collaborative visual analysis method for the collaborative geographic cognition needs across geographic scenarios. The paper first analyzes the core scientific issues of cross geographical scenario collaborative analysis, namely the coupling representation of geographical scenarios and the collaborative aggregation of multi-user cognition. In response to this, we have constructed a four in one theoretical framework of "data model view interaction". The results show that this method can effectively break geographical isolation, build an immersive "co environment" collaborative space, and significantly improve the situational perception ability, communication efficiency, and collaborative decision-making quality of multi domain experts in complex geographical problems. This study not only provides cutting-edge collaborative analysis tools for geographic information science, but also provides important methodological support for interdisciplinary directions such as spatial human-computer interaction and group geographic cognition.

Spatial Distribution Pattern of Elderly Care Facilities in Urban Areas of Beijing from the Perspective of Spatial Accessibility

Dayong Zhang^1,2, Siqiao Chen^1,2, Yanhui Wang^1,2

¹College of Resources Environment and Tourism, Capital Normal University, Beijing 100048, China; ²Key Laboratory of 3Dimensional Information Acquisition and Application, Ministry of Education, Capital Normal University, Beijing 100048, China

This study analyzes the spatial accessibility and distribution patterns of elderly care facilities in six central urban districts of Beijing (Dongcheng, Xicheng, Haidian, Chaoyang, Fengtai, Shijingshan) against the backdrop of rapid population aging. Using methods such as the Two-Step Floating Catchment Area (2SFCA) and kernel density analysis, the research integrates multi-source spatial and socio-economic data. Results reveal an unbalanced spatial distribution of facilities and varying service capacities, with insufficient coverage within a 5-minute travel scope and improved but transport-dependent accessibility within 10 minutes. The study highlights challenges in achieving “nearby elderly care”, particularly in areas like Fengtai District. It recommends optimizing facility layout by repurposing existing spaces in core areas and constructing new facilities in underserved peripheral zones, in line with “community-based” and “home-based” elderly care principles, to better meet the needs of the aging population.

Research on Strengthen the Supervision and Administration of Geographic information Security and Data Governance

Heng Li^1,2

¹National Geomatics Center of China, China, People's Republic of; ²Technology Innovation Center for Geographic Information Public Service, Ministry of Natual Resources, China

As technologies such as intelligent connectivity and artificial intelligence become increasingly mature, and the platform economy evolves at a rapid pace, new products, business formats, and models—including autonomous driving, unmanned driving, and the low-altitude economy—are transitioning from pilot demonstrations to application trials, and beginning to enter widespread practical use on a large scale. The advancement of these new technologies, business formats, and models is driving the "ubiquitization" of surveying and mapping. It has become feasible to illegally obtain large-scale, precise location information of ground features quickly in a short period, which poses severe challenges to the supervision and administration of surveying, mapping, and geographic information security. This paper first introduces geographic information data security technologies, including the classification and grading of geographic information data and the confidentiality processing of geographic information data, among others. Secondly, it designs a geographic information security supervision and data governance model, covering geographic information data application scenarios, the geographic information data circulation control model, the geographic information data circulation security architecture and its applications, etc. Finally, it summarizes the challenges and opportunities faced by geographic information security supervision and data governance.

High-resolution land cover mapping with GeoAI: instance segmentation for land cover analysis

Federica Gerla^1,2, Lindo Nepi³, Mihnea Cățeanu^4,5, Francesca Ferroni^6,7, Khew Ee Hung⁸, Krystian Mocny⁹, Shahla Yadollahi¹⁰, Caterina Balletti¹, Roberto Pierdicca¹¹, Mattia Balestra^11,12

¹CIRCE Laboratory of Cartography and GIS, Department of Architecture and Arts, Università Iuav di Venezia, Dorsoduro 1827, 30123 Venezia, Italy; ²Department of Civil and Environmental Engineering (DICEA), Sapienza Università di Roma, Roma, 00185, Italy; ³Department of Information Engineering (DII), Università Politecnica delle Marche, Ancona, 60131, Italy; ⁴Department of Forest Engineering, Forest Management Planning and Terrestrial Measurements, Faculty of Silviculture and Forest Engineering, Transilvania University of Brașov, Șirul Ludwig van Beethoven 1, 500123 Brașov, Romania; ⁵Faculty of Geodesy, Technical University of Civil Engineering, 020396 Bucharest, Romania; ⁶Department of Environmental Biology, Sapienza Università di Roma, Roma, 00185, Italy; ⁷Mountain Partnership Secretariat, Food and Agriculture Organization of the United Nations (FAO), Rome, Italy; ⁸School of Agriculture, Hokkaido University, 060-0809, Japan; ⁹Department of Geomatics, Institute of Soil Science and Plant Cultivation, Czartoryskich 8 Str. 24-100 Pulawy, Poland; ¹⁰Environment Campus, Liege University, Arlon, 6700, Belgium; ¹¹Department of Civil, Building and Architectural Engineering (DICEA), Università Politecnica delle Marche, Ancona, 60131, Italy; ¹²Department of Agricultural, Food and Environmental Sciences (D3A), Università Politecnica delle Marche, Ancona, 60131, Italy

This study investigates the potential of GeoAI and instance-based segmentation for high-resolution land cover classification in San Vito di Cadore (Veneto), a UNESCO mountain region in the Dolomites characterized by high ecological heterogeneity. The dataset comprises 650 manually annotated orthophotos at a spatial resolution of 0.1–0.5 m, labelled across seven main land cover classes (Forest, Shrubland, Grassland, Cropland, Water Bodies, Artificial/Urban Areas, and Rocky/Bare Areas) and harmonized with Corine Land Cover (CLC) aggregated categories for inter-comparison. Snow and cloud were treated as auxiliary classes given their frequent occurrence in alpine imagery. The YOLOv11 instance-segmentation model was trained on 1000×1000 px tiles, with a SAHI (Slicing Aided Hyper Inference) framework adopted during inference to process large-scale orthophotos without loss of spatial quality. Results show an overall precision of 0.847 and recall of 0.575, with mAP@0.5 exceeding 0.65. Quantitative comparison with the Regione Veneto land cover product (2023) reveals good agreement for the dominant forest class (−1.4%), while the largest discrepancies concern artificial surfaces (+20.8%) and agricultural areas (−36.3%), attributed to differences in spatial scale and training-sample imbalance. The work highlights the advantages of instance-aware deep learning for generating accurate, spatially coherent land cover maps and underlines the growing relevance of GeoAI workflows for environmental monitoring and spatial planning in complex mountain environments.

Wind field risk aware Global Path Planning and Trajectory Optimization in Urban Low-altitude Environments

Sichang Wang, Leyang Zhao, Xiaoming Li, Hang Mei, Lei Han, Yang Zhao, Ding Ma

Research Institute for Smart Cities, School of Architecture and Urban Planning, Shenzhen University, Shenzhen, 518060, PR China

Urban building clusters can significantly alter the low-altitude urban wind field, creating local regions with high wind speed, intense turbulence, and strong non-uniform disturbances. These conditions may cause lateral deviation, attitude instability, increased trajectory error, and even exceed the wind-resistance limits of low-altitude UAVs. To address the lack of explicit modeling of local high-risk wind regions in traditional path planning methods, this paper proposes a wind-risk-aware path planning method, termed RP- A* (Wind Risk-Aware Probability A*). First, a steady-state RANS urban airflow model with the standard k-ε turbulence model is used to obtain the mean wind field and turbulence statistics in the low-altitude flight region. Second, the CFD results are reconstructed on a grid to build a wind risk model consisting of gust risk, crosswind risk, and overall wind-limit-exceedance risk. Third, a direction-dependent integrated risk cost is introduced into the A* search framework, and the RP-A* algorithm is developed to achieve path planning that balances route efficiency and flight safety. Finally, Monte Carlo simulations driven by turbulence-based wind perturbation samples are conducted to estimate the empirical failure rate of planned paths. Results show that RP-A* significantly reduces path failure risk compared with baseline shortest-path methods while requiring only a limited increase in path length. The proposed framework provides an effective approach for safe UAV path planning in complex urban low-altitude environments.

Analysis of Crowd Behaviour Intensity in Historic Urban Areas from the Perspective of Transportation Spatial Pattern: Case Study of Kunming, China

Yi Zhang, Jie Jiang

Beijing University of Civil Engineering and Architecture, China, People's Republic of

The sustainable development of historic districts, aligned with UN SDG 11.4, requires integrated approaches that balance heritage preservation with contemporary urban functionality. This study proposes a novel analytical framework combining Space Syntax theory and Point of Interest (POI) data to address this challenge. Departing from traditional non-hierarchical methods, approach of the study innovatively processes vectorized road networks with a focus on community and block-scale hierarchy, more accurately reflecting human-scale spatial perception and connectivity. This refined Space Syntax model quantitatively analyzes street accessibility and spatial configuration, which is then integrated with categorized POI data to reveal the inherent socio-economic logic and functional distribution within historic urban areas. The framework is empirically validated through a case study of a historic district in Kunming, China. Results demonstrate that this combined methodology provides a comprehensive understanding of the spatial organization, offering data-driven insights to support precise and sustainable conservation and renewal strategies for historic districts.

Recognition and Extraction of Spatial Coordinates in Natural Language Texts Using BERTimbau for Land Document Analysis

Taís de Sousa Pereira, Mauro José Alixandrini Junior, Jorge Ubirajara Pedreira Junior, Vivian de Oliveira Fernandes

UFBA, Brazil

This study addresses the challenge of automatically recognizing and extracting spatial coordinates from unstructured natural language texts, particularly within the domain of land registry documents (fiduciary documents). It proposes a deep learning-based method utilizing the BERTimbau language model, fine-tuned for the Named Entity Recognition (NER) task. This research expands the scope of geoparsing beyond the traditional focus on toponyms, specifically targeting the direct extraction of coordinate data for reliable automation in engineering, land cadastre, and land regularization.

From orthophotos to building footprints over a decade: model inference-based approach for urban densification analysis in Iași, Romania

Loredana-Mariana Crenganis¹, Ana-Maria Loghin¹, Constantin Stoian², Ana-Maria Olteanu-Raimond², Arnaud Le Bris², Juste Raimbault², Julien Perret², Benedicte Bucher², Nicolas Gonthier²

¹quot;Gheorghe Asachi" Technical University of Iasi, Romania; ²Univ. Gustave Eiffel, IGN-ENSG, LaSTIG – Saint-Mande, France

Urban densification in post-socialist cities involves fine-scale spatial transformations that are difficult to quantify in data-scarce environments.

This study proposes FLAIR-HUB2BF, a model inference-based workflow for automated building footprint extraction

and multi-temporal change analysis, applied to the city of Iasi, Romania. The methodology extends the SUBDENSE conceptual

framework by integrating the FLAIR-HUB deep learning model for semantic segmentation of very high resolution aerial orthophotos

from 2011 and 2024, followed by binary mask extraction, instance segmentation, and Douglas–Peucker polygon generalization,

yielding 34,454 and 17,141 georeferenced building footprints, respectively. The approach demonstrates that coherent building

footprint datasets and their temporal evolution can be derived directly from aerial imagery without relying on complete cadastral

databases. To support rigorous evaluation, the first open benchmark building footprint datasets for Romania were produced through

manual photo-interpretation correction across four morphologically distinct urban neighborhoods of Iasi, and assessed against ISO standards

19157 spatial data quality standards, achieving commission and omission rates of 1.95% and 2.39%, respectively. Quantitative

evaluation using complementary GMA (Geometric Matching of areas) and MCA (Multi-Criteria Algorithm) data matching algorithms

confirms moderate-to-high spatial accuracy, with MCA surface agreement rates reaching 91%. The results demonstrate

the capability of the method to capture fine-scale urban transformations, including infill development, brownfield redevelopment,

and peri-urban expansion, while revealing the critical influence of input data quality on segmentation performance. The proposed

workflow establishes a transferable, reproducible, and open methodology for building-level urban monitoring applicable to other

Romanian and European cities facing similar data constraints.

Design and Implementation of an AR System for Real-Time Urban Model Editing and Visualization

Céphas MWIMANGIRE^1,2, Mohammad Hosseingholizadeh², Hamidreza Ostadabbas^1,2, Zhihang Yao¹

¹Centre for Geodesy and Geoinformatics, Stuttgart University of Applied Sciences (HFT Stuttgart), Stuttgart, Germany; ²Geoinformatics Department, die STEG Stadtentwicklung GmbH, Germany

Augmented Reality (AR) offers an immersive medium for visualizing 3D city models directly within physical environments, but current systems lack real-time synchronization with authoritative geospatial databases. This paper suggests an open-source architecture that bridges this gap by enabling bidirectional, standards-compliant communication between AR Microsoft HoloLens 2 frontend and a CityGML-based backend. The system integrates PostgreSQL/PostGIS with 3DCityDB, exposed through a Django API, and connects to AR front-ends such as Microsoft HoloLens 2 via Cesium for Unreal.

Integrating Road Surface Condition Data into OpenDRIVE Models for Autonomous Vehicle Simulations

Vivien Poto, Janos Mate Logo, Viktor Gyozo Horvath, Arpad Barsi

BME Department of Photogrammetry and Geoinformatics, Hungary

This work proposes an extension to the OpenDRIVE standard to represent pavement surface defects, improving the realism of autonomous vehicle simulations and enabling the integration of road condition data from modern mapping and AI-based detection methods.

Spatiotemporal uncertainty of movement data in unstructured geographic areas: Approaches to generate possibility spaces from ship movements

Monika Sester¹, Stefan Fuest¹, Andreas Gollenstede², Claus Brenner¹

¹Institute of Cartography and Geoinformatics, Germany; ²Institute for Applied Photogrammetry and Geoinformatics, Jade University of Applied Sciences, Oldenburg, Germany

In cultural heritage research, one task is to reconstruct historical ship routes; however often exact trajectories are lacking, and thus an accurate itinerary is difficult to reconstruct due to inaccurate or incomplete documentation. The aim of this work is therefore to create spaces of uncertainty as so-called possibility spaces based on the calculation of a geographical extent that attempts to encompass all valid path options.

In order to obtain meaningful possibility spaces, it is important to first define the navigable space and also take into account additional factors such as water depths, currents, wind direction and known historical shipping lanes that may influence a possible route. This information can be used to define the cost for calculating possible routes.

To calculate possibility spaces, different approaches of path planning are explored, such as transferring the navigable space into a routing graph, converting the space into a regular grid, or using an irregular grid in terms of a mesh.

Subsequently, options for deriving the final possibility spaces are described, such as using the explored nodes during the search process (e.g. using A*), or to generate a possibility space by creating a variation of paths by calculating k-shortest paths. Of particular interest is the calculation of paths that have a cost value similar to a predefined acceptable maximum. These paths form the outer boundaries of the possibility space to be created.

High-definition road map generation from mobile mapping data: a case study on the Tangenziale di Napoli

Enrico Breggion¹, Maria Alicandro², Caterina Balletti¹, Nicole Pascucci², Francesco Guerra¹, Andrea Martino¹, Giovanni Pugliano³, Sara Zollini², Donatella Dominici²

¹Università Iuav di Venezia, Italy; ²Università degli studi dell'Aquila, Italy; ³Università degli studi di Napoli Federico II, Italy

High-definition (HD) maps are a key digital infrastructure for connected and autonomous vehicles, especially in highway environments where detailed and reliable road representations are required. This contribution presents an end-to-end workflow for HD road map generation from mobile mapping data, developed within the HD SMART MAP project (PNRR Spoke 7) and applied to a 10 km stretch of the Tangenziale di Napoli.

The survey was carried out with the GAIA M1 Mobile Mapping System, integrating LiDAR, panoramic imagery and GNSS/INS. This configuration enabled the acquisition of dense point clouds and synchronized images even in GNSS-challenging areas such as tunnels. All data were georeferenced in the national reference system RDN2008, with heights referred to the ITALGEO2005 geoid.

The processing pipeline includes point cloud filtering, ground segmentation and DTM generation, as well as the production of an orthophoto of the corridor from panoramic imagery. These products support the semi-automatic extraction of lane markings and road features, which are then encoded according to the ASAM OpenDRIVE standard.

The resulting HD map provides a geometrically and semantically rich, machine-readable description of the highway, suitable for vehicle localization, path planning and simulation. The case study demonstrates that semi-automated procedures significantly speed up HD map production compared to traditional manual workflows.

Investigating Array Programming for Spatial Operations with Vector Geometries

Balthasar Teuscher, Xuanshu Luo, Paul Walther, Martin Werner

Technical University of Munich, Germany; TUM School of Engineering and Design, Department of Aerospace and Geodesy, Professorship of Big Geospatial Data Management

Vector geometries are traditionally represented as entities of sequences of coordinate structures. With advancements in hardware and software for data analytics, a preference for columnar data layouts arose. This paper examines array programming for spatial operations to evaluate the potential performance benefits of modern computing architectures and emerging spatial data encodings. Evaluating selected operations, such as distance calculation, extent extraction, and affine transformations, indicates similar or improved performance for geometries with columnar coordinate layouts. By leveraging modern compiler infrastructure, we further demonstrate that advanced hardware features in commodity hardware, such as vector instructions, are becoming accessible without specialized code. The performance comparison with established, widely used geospatial and computational libraries reveals significant untapped potential for increasing the efficiency of spatial computing.

Automatic surface extraction and web visualization workflow for large laser scanner point clouds with open-source solutions

Marcello La Guardia¹, Mauro Lo Brutto², Mila Koeva³, Giuseppe Mussumeci¹

¹Department of Engineering, University of Messina, 98158 Messina, Italy; ²Department of Engineering, University of Palermo, Viale delle Scienze, 90128, Palermo, Italy; ³Faculty of Geo-Information Science and Earth Observation, University of Twente, Enschede, The Netherlands

Recent advances in geomatics and 3D surveying have enabled the acquisition of increasingly dense point clouds through both static and mobile laser scanning systems, supporting the rapid digital documentation of built and natural environments. At the same time, the growing diffusion of WebGL technologies has opened new possibilities for the online visualization and dissemination of complex three-dimensional datasets. Within this context, the present study proposes an open-source workflow for the automatic extraction of significant geometric surfaces from laser scanner point clouds and their integration into a web-based visualization framework. The method was developed within a Python-based processing environment and tested on three datasets characterized by different levels of geometric complexity: a regular built environment, an under-construction building environment, and a historical context. The workflow includes point cloud preprocessing, automated segmentation strategies adapted to the geometric complexity of each case, extraction of planar and non-planar elements, polygonal surface generation, mesh construction, and conversion of outputs into lightweight formats suitable for web publication. The final visualization environment combines segmented polygonal models and subsampled point cloud data through open-source WebGL technologies. The results demonstrate that the proposed strategy provides a scalable and flexible solution for the rapid online representation of large laser scanner datasets, supporting surface recognition, low-cost accessibility, and future semantic enrichment within web-based geospatial and Digital Twin applications.

HBIM of the Galleria Grande in the Reggia di Venaria Reale: A Scan–to–BIM Workflow towards Digital Twin Integration

Xiang Li, Filiberto Chiabrando

Politecnico di Torino, Italy

This paper reports progress in the Venaria Reale pilot of the EU Horizon Europe project HERITALISE (2025–2028), using the Galleria Grande as a test case for a preventive-conservation workflow toward digital twin integration. It presents a reproducible Scan-to-BIM workflow for HBIM that delivers a 3D backbone combining geometric reliability, semantic queryability, and source traceability. Multi-sensor datasets, including terrestrial laser scanning, SLAM-based mobile mapping, and UAV photogrammetry, are georeferenced within a unified coordinate framework. A georeferenced UAV–TLS fused point cloud serves as the main modelling baseline, while SLAM data are used only as local infill for verified missing areas. Data management follows a raw-working-deliverable structure with logged parameters, transformation matrices, and quality-control notes. Registration residuals are controlled within 0.01–0.05 m and checked through section-based and distance-based validation in critical junction areas. Geometric modelling adopts a Revit-Rhino workflow guided by structural, semantic, evidential, and feasibility criteria. Semantic enrichment follows the HERITALISE Common Data Environment and BIM Execution Plan, with ObjectID linking HBIM elements to an external SQL database and supporting continuity between legacy and current room naming systems.

Dynamic Landslide Susceptibility Assessment Using Machine Learning Models

Yuya Tanaka¹, Shintaro Goto²

¹Doctoral student, Graduate School of Geo-Environmental Science, Rissho University, Kumagaya, Saitama, Japan; ²Professor, Department of Environmental Systems, Faculty of Geo-Environmental Science, Rissho University, Kumagaya, Saitama, Japan

Landslide susceptibility assessments have traditionally used static rainfall statistics that do not reflect the actual meteorological conditions when slopes fail. This study develops a machine learning framework that aligns high-resolution radar rainfall (XRAIN, 250 m / 10 min) and modeled soil moisture (XSWI) with documented landslide occurrence times as dynamic triggering factors. Applied to the Heavy Rain Event of July 2018 in Hiroshima Prefecture, the framework combines watershed-based spatial cross-validation, systematic comparison of four class imbalance strategies (no treatment, sample weighting, random under-sampling, and SMOTE) across eight algorithms (XGBoost, LightGBM, CatBoost, HGBoost, Random Forest, Balanced Random Forest, Easy Ensemble Classifier, and Logistic Regression), and spatially explicit SHAP interpretation.

Two key findings emerged. First, soil moisture state — not instantaneous rainfall intensity — was the dominant triggering predictor: XSWI variables ranked 2nd and 3rd in importance after slope angle, operating as independent axes (r = 0.074). The no-treatment condition consistently outperformed all resampling strategies across algorithms. Second, spatial SHAP mapping revealed that predisposing factors produce time-invariant contribution patterns governed by terrain, while dynamic triggers produce event-specific patterns reflecting rainfall distribution; their spatial overlap identifies the highest-risk locations. Time-series susceptibility maps confirmed that the framework captures within-event risk evolution as rainfall progresses — a capability unattainable with static approaches. These results indicate that incorporating occurrence-time-aligned soil moisture dynamics substantially improves both the predictive and explanatory capacity of landslide susceptibility assessment.

Improving Sentinel-5P Imagery Usability Through Machine Learning Gap-Filling

Zhanbin Wu, Vasil Yordanov, Maria Antonia Brovelli

Politecnico di Milano, Department of Civil and Environmental Engineering, Piazza Leonardo da Vinci, 32, Milan, Italy

Accurate air quality monitoring depends on continuous satellite observations of key pollutants such as nitrogen dioxide (NO₂) and sulfur dioxide (SO₂) from the Sentinel-5P/TROPOMI mission. However, these datasets often suffer from severe spatio-temporal discontinuities due to cloud cover, surface reflectance, viewing geometry, and strict quality filtering, which limit their reliability for environmental and health-related applications.

This study addresses the challenge of missing data reconstruction over the Po Valley, Northern Italy (2019-2023), an area characterized by complex terrain and frequent winter inversions that amplify data gaps. A comprehensive statistical analysis revealed substantial data loss, averaging 45% for NO₂ and 77% for SO₂, with pronounced seasonality and strong correlations with elevation.

To fill these gaps, an integrated machine learning framework was developed, combining a LightGBM baseline model and a 3D Convolutional Neural Network (3D CNN). The models exploit multi-source predictors, including meteorological variables (ERA5), atmospheric priors (CAMS), topography, land cover, and population density, together with cyclic temporal encoding.

Preliminary results demonstrate that the 3D CNN significantly improves gap reconstruction performance (R² = 0.95 for NO₂, 0.74 for SO₂) compared to the LightGBM baseline, though at higher computational cost. The proposed framework enhances the spatio-temporal continuity and usability of Sentinel-5P data, supporting more reliable environmental monitoring and policy-making in data-sparse conditions. Future work will extend the approach to other pollutants, regions, and deep learning architectures.

Ecuadorian Amazon Deforestation Hotspots Due to Oil Infrastructure Development Over the Last Century

José Agustín García-Romero¹, Isabel Adriana Chuizaca-Espinoza^2,3, Andrés Velastegui-Montoya^1,3, Yasmim Carvalho Guimarães⁴, Luciana das Dores de Jesus da Silva⁵, Fabiola D. Yépez-Rincón⁶, Kevin Rodriguez González⁶, Aylet Vega Aguilar⁶

¹Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ²Earth Observation and Geoinformatics Division, National Institute for Space Research (INPE); ³Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ⁴Graduate Program in Natural Disasters (Unesp /CEMADEN); ⁵Departament of Aquatic Systems, Concepción University, Concepción, Chile; ⁶Universidad Autónoma de Nuevo León, Faculty of Civil Engineering, Department of Geomatics

The Ecuadorian Amazon hosts remarkable biodiversity and cultural richness but faces increasing pressures from the expansion of oil-related activities. This study evaluated the distribution and concentration of deforestation hotspots between 2000 and 2023, analyzing their relationship with existing oil infrastructure and environmentally significant areasLand Use and Land Cover data from MapBiomas Ecuador were combined with Kernel Density Estimation (KDE) analyses based on the spatial distribution of oil blocks, pipelines, wells, and the limits of environmentally sensitive areas. The results indicate a net loss of 391,303 ha (4%) of forest cover, with 80% of the hotspots located within a 10 km radius of hydrocarbon infrastructure. However, intangible zones, protected areas, and water protection zones showed minimal impacts. The findings of this study provide technical evidence to support land-use management and conservation efforts in ecologically vulnerable Amazonian regions.

Leveraging SDGSAT-1 Data for Exploring the Interactions of Nighttime Lights and Human Settlement Structure at High Spatial Resolution

Johannes H. Uhl¹, Martino Pesaresi¹, Michele Melchiorri¹, Panagiotis Politis³, Linlin Lu², Thomas Kemper¹

¹European Commission, Joint Research Centre (JRC), Ispra, Italy; ²Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China; ³European Dynamics Belgium S.A, Brussels, Belgium

Nighttime light (NTL) Earth observation data represent an invaluable resource for measuring population distribution, disaster impact, economic activity, and socio-economic inequalities from space. While traditional NTL data sources provide consistent long-term measurements, they are spatially coarse, impeding spatially detailed analyses of nighttime lights. Novel, high-resolution NTL data from the SDGSAT-1 satellite capture NTL variations across space and time at fine spatial detail of 10 to 40 m and open new research avenues but also require innovative analytical approaches. Herein, we demonstrate the potential of jointly assessing annual SDGSAT-1 composites and human settlement data from the Global Human Settlement Layer (GHSL) and other data, characterizing the built environment, human population distribution, and the rural-urban continuum. We illustrate that such data integration generates new insights on the interactions of nighttime lights and settlement-related characteristics at unprecedented detail. For example, we find that NTL emissions tend to be highest in old parts of settlements (<1975) and lowest in very recently developed land. The brightness of major roads and non-residential areas at night approximately doubles, on average, compared to residential built-up areas. ~80% of urban population resides in areas characterized by luminous, stationary NTL, while this population share drops to ~15% in very rural areas. Looking at infrastructure-related land use, we find that airports emit the highest levels of stationary and non-stationary NTL in our study area. These results illustrate the potential of high-resolution data from SDGSAT-1 and pave the way forward to include such data in settlement and population modelling at scale.

Deep Learning-based underwater mapping of Posidonia Oceanica from satellite data for coastal habitat monitoring

Payam Yari¹, Filiberto Chiabrando², Francesca Gallitto¹, Valeria Longhi¹, Stefania Manca¹, Paolo Maschio¹, Francesca Matrone¹, Andrea Maria Lingua¹

¹Geomatics Lab, Department of Environment, Land and Infrastructure Engineering (DIATI), Politecnico di Torino, Corso Duca degli Abruzzi, 24, Torino (TO), Italy; ²Laboratory of Geomatics for Cultural Heritage (LabG4CH), Department of Architecture and Design (DAD), Politecnico di Torino, Viale Pier Andrea Mattioli, 39, Torino (TO), Italy

The POSEIDON project aims to develop scalable and repeatable approaches for monitoring Posidonia Oceanica (PO) meadows, a key Mediterranean habitat that supports coastal ecosystem services and long-term blue-carbon storage, yet they are increasingly threatened by warming waters and cumulative human pressures. This work presents a satellite-based workflow for benthic habitat mapping that combines Sentinel-2 multispectral imagery, ancillary bathymetry, and deep-learning semantic segmentation. Sentinel-2 Level-2A data and bathymetry were integrated into multi-band inputs on a common 10 m grid, with analysis restricted to water pixels. A wall-to-wall reference map was generated by harmonising existing habitat products into six benthic classes for supervised model training and evaluation. U-Net and DeepLabv3 architectures with a ResNet backbone were tested for a representative September 2015 scene. The workflow was first assessed in the Culuccia peninsula, where it achieved an overall accuracy of 0.830 and a Kappa coefficient of 0.786. It was then successfully transferred to the Capo Testa - Punta Falcone Marine Protected Area (MPA), where the best-performing configuration reached an overall accuracy of 0.882 and a Kappa coefficient of 0.843. These results show that open-access satellite data combined with robust semantic segmentation models can provide a reliable and non-destructive framework for seagrass mapping in complex coastal environments.

A Generative Adversarial Network Framework for Vertiport Location: A Case Study in Toronto

Naghmeh Jafarpournaser, Songnian Li

Toronto Metropolitan University, Canada

Nowadays, with technological advancements and the increasing volume of urban traffic, low-altitude urban air mobility, particularly for time-sensitive trips such as airport travel, has emerged as a promising solution. Vertiports are one of the key components of this novel transportation system, serving as the ground connection points for urban air mobility operations. The optimal placement of vertiports, considering various influencing factors, is critical to the successful implementation of this emerging mode of transportation.

In this study, a data-driven framework is proposed to identify the most suitable areas for vertiport placement in order to facilitate and accelerate access to airports in the City of Toronto. By integrating environmental constraints, population density, ground transportation connectivity, noise impact zones, and regulatory considerations, the framework evaluates land suitability using GIS-based analysis and a deep-learning approach called Generative Adversarial Network (GAN). The proposed methodology can generate a vertiport network by learning nonlinear spatial relationships between multiple spatial layers, without the need for subjective rules, and finally identifying potential vertiport locations with maximum coverage. The results demonstrate two strategically located vertiports for accessing each of Billy Bishop and Pearson airports, situated in commercial, mixed-use, and industrial zones, high-demand areas, and locations near major public transit stations.

Using textureless, low-detailed 3D city models for visual localization

Yasmin Loeper¹, Markus Gerke¹, Phillipp Fanta-Jende²

¹Institute of Geodesy and Photogrammetry, Technische Universität Braunschweig; ²Unit Assistive and Autonomous Systems, Center for Vision, Automation and Control, AIT Austrian Institute of Technology

In this paper, we investigate the use of CityGML data for visual localization. Therefore, we present a visual localization approach that uses CityGML data. We compare different matching approaches for real images and renderings of CityGML data and evaluate our results using query images with accurate ground truth poses. We show that pose estimation is possible with object features of city models. We propose an evaluation of the estimated pose with independent ground truth poses from the reference data.

Indoor Positioning, Wi-Fi, BLE, BIM, Digital Twin, Hybrid Localization

Hossein Barakati¹, Majid Kiavarz¹, Mohammadreza Jelokhani-Niaraki¹, Hamid Kiavarz²

¹Department of Remote Sensing and GIS, Faculty of Geography, University of Tehran, Iran; ²Department of Earth & Space Science & Engineering, York University, Toronto, Canada

Achieving a genuine digital twin for smart buildings requires accurate and real-time knowledge of the spatial positions of users and objects within indoor environments. Despite major advances in Indoor Positioning Systems (IPS), most existing frameworks lack a structured data connection with Building Information Modeling (BIM) which provides the semantic and geometric representation of building elements and spaces. This disconnects limits both real-time synchronization and three-dimensional spatial visualization. This study presents a novel BIM-driven hybrid framework that integrates Wi-Fi and Bluetooth Low Energy (BLE) signal data with BIM to establish a data-centric foundation for digital-twin development. The experimental setup was deployed on the fourth floor of the Faculty of Geography at the University of Tehran, modeled as a BIM-based indoor test environment. Received Signal Strength (RSS) data were collected from 35 reference points (RPs) and seven transmitters (four Wi-Fi access points and three BLE beacons), normalized, and processed using both Fingerprinting and Trilateration models. By incorporating the vertical component (Z) and linking spatial records to BIM entities such as IfcStorey and IfcSpace, user locations were visualized within a 3D building model. The BLE-enhanced Fingerprinting model achieved a substantially lower error (RMSE ≈ 0.40 m) than Trilateration (RMSE = 2.38 m), while the final hybrid model, employing adaptive weighting between sub-models, achieved more than 95 % accuracy within one meter of the ground truth.

These results demonstrate that integrating IPS data with BIM provides a robust foundation for digital-twin creation in smart buildings.

Scenario-based Energy Simulation of Tree Planting Strategies to Reduce the Heating and Cooling Demand of Buildings under 2050 Climate Conditions

Adhisye Rahmawati¹, Weixiao Gao^2,3, Camilo León Sánchez³, Giorgio Agugiaro³

¹Master in Geomatics, Delft University of Technology, The Netherlands; ²Department of Geo-information Science and Remote Sensing, Wageningen University & Research, The Netherlands; ³3D Geoinformation Group, Department of Urbanism, Faculty of Architecture and Built Environment, Delft University of Technology, The Netherlands

Bottom-up, physics-based Urbain Building Energy Modelling (UBEM) approaches enable systematic assessment of building typologies and operational behaviours even when empirical data are limited, providing robust results for district-scale heating and cooling simulations. However, most physics-based UBEM applications have focused chiefly on building-related parameters and have given limited attention to environmental factors such as vegetation, although this element affects building energy demand under changing climate conditions.

To overcome this limitation, the paper will present a simulation-based workflow that evaluates how urban tree-planting strategies influence building heating and cooling demand under current and projected 2050 climate conditions. Specifically, the workflow builds upon the simulation-based UBEM platform SimStadt by embedding vegetation effects directly within a single modelling environment, removing the need for external microclimate coupling or additional simulation tools. Our method is based on standardised CityGML building models, simplified yet seasonally dynamic vegetation representations, and a unified modelling environment that allows consistent comparison of cooling and heating demand under both current and projected climate conditions. This integration allows for the quantitative evaluation of tree-planting strategies for both heating and cooling demand at the district scale.

The paper will present the results of the study carried out in some neighbours of the Dutch city of Rotterdam.

Thermography-based Energy Classification: Integration of Point Cloud Segmentation and Energy Performance Certificates for Urban Energy Modelling

Sebastiano Anselmo¹, Vittorio Scolamiero^1,2, Maria Ferrara³, Piero Boccardo¹

¹Interuniversity Department of Regional and Urban Studies and Planning (DIST), Polytechnic of Torino; ²Department of Civil, Building and Environmental Engineering (DICEA), Sapienza Università di Roma, Via Eudossiana, 18, 00184 Roma; ³Department of Energy, Polytechnic of Torino

Cities are at the core of the current debate on climate change mitigation, and multiple policies on the global and continental scale have acknowledged this condition, pushing towards an increase in the renovation rate and the installation of renewable energy technologies. The revised Energy Performance of Buildings Directive (European Parliament and Council, 2024) targets the renovation of 35 million buildings across Europe, starting from worst-performing buildings. The authoritative tool for the identification of such buildings is the Energy Performance Certificate, the European reference scheme for energy performance in buildings, which covers only a fraction of the European building stock, approximately 30-50%. This study aims to refine a published methodology which takes advantage of aerial thermography and Energy Performance Certificates to attribute an energy class to the whole building stock. The research question is how to classify the building stock, thus making it possible to compute the final energy consumption, by adopting a geospatial approach which considers simultaneously remotely-sensed and metered data. This research considers three main inputs: a thermal point cloud, the building layer of the technical map of the City of Turin, and Energy Performance Certificates.The method is based on the assumption that all buildings have the same indoor temperature. For this reason, the external surface temperature is a proxy of the quality of the envelope, with low-performing buildings having higher thermal losses and therefore an higher external temperature. Then, the class distribution is observed in the Energy Performance Certificates database and replicated in the whole building stock.

Organizing temporally vague Raster Data in Cloud Environments for machine-learning Applications

Tobias Werner, Thomas Brinkhoff

Jade University of Applied Sciences, Germany

Time series are an important source of information about changes in land cover. However, historical raster datasets are often characterized by vague and imprecise temporal properties. We have developed a novel raster data management system designed specifically for machine-learning applications, which organizes temporally vague raster data in cloud environments. The system addresses the challenges of processing historical maps with uncertain temporal attributes. It combines object storage, PostGIS Raster and the Spatio-Temporal Asset Catalogue (STAC) API, enabling the efficient, interoperable management of spatio-temporal raster data. It allows users to define and evaluate vague instants and fuzzy intervals, enabling them to perform precise queries on temporally relevant datasets. This solution is particularly useful for managing databases in a flexible and customizable way, and is ideal for sovereign data management and self-managed infrastructures.

Analysing the Evolution of Kenya’s Road Network since the 1950s using Historical and Contemporary Road Datasets

Tanja Kramm¹, Nicodemus Nyamari², Vincent Moseti³, Annika Klee¹, Leon Vehlken¹, Jing Jing¹, David M. Anderson⁴, Chrisina Bogner^2,5, Georg Bareth¹

¹GIS and Remote Sensing Group, Institute of Geography, University of Cologne, Germany; ²Ecosystem Research Group, Institute of Geography, University of Cologne, Germany; ³Center for Development Research (ZEF), University of Bonn, Germany; ⁴Department of History, University of Warwick, United Kingdom; ⁵Global South Studies Center, University of Cologne, Germany

This study investigates the long-term evolution of Kenya’s road network from 1950 to 2020, highlighting how colonial legacies, post-independence modernization, and contemporary planning have shaped infrastructure development. Using deep learning techniques, roads were extracted from historical topographic maps (1950–1980) and transformed into GIS-compatible data, resulting in a nationwide road dataset of approximately 56,000 km from the mid-20th century. These data were compared with a 2020 dataset from the Kenya Roads Board, which documents over 161,000 km of roads. The analysis reveals that Kenya’s total road length has nearly tripled, and the average distance to the nearest road has decreased from 8.6 km to 5.1 km. However, the road development is uneven across the country: southern and central regions show significant growth, while northern and arid areas remain underserved, reflecting persistent spatial disparities rooted in colonial planning. A regional comparison in southwestern Kenya shows a 56% increase in road length between the 1950s/60s and 1970s/80s, with notable upgrades in road quality. The proportion of paved roads rose from 1.5% to 12%, and tertiary dry-weather roads declined from 64% to 26%. Despite these improvements, only 15–30% of Kenya’s roads are paved today, which is below the continental average of 47%. This study demonstrates the value of integrating historical and contemporary geospatial data to assess infrastructure development, identify gaps, and support planning aligned with Kenya Vision 2030 and the Sustainable Development Goals. The findings underscore the importance of spatial analysis in evaluating development outcomes and guiding future investment strategies.

Spatiotemporal Data Management for subnational Census Data on global Scale

Thomas Brinkhoff

Jade University of Applied Sciences Oldenburg, Germany

Knowledge of the regional distribution of the world’s population is essential for political and social decisions not only but especially for achieving the 17 Sustainable Development Goals (SDGs). Census and other population data at the subnational level are important for this purpose. However, current population data management platforms largely ignore the spatiotemporal nature of census data. Here, we outline the requirements for a spatiotemporal population data management system and present its general architecture, data model and state of implementation. The developed system currently stores population data from approximately 200 countries, nearly 11 million spatial units and around 770 million individual population figures.

A geographic knowledge integrated computation framework based on grid graph modelling

Bing Han^1,2, Tengteng Qu³

¹School of Mathematical Sciences, Peking University, Beijing 100871, China; ²National Engineering Laboratory for Big Data Analysis and Applications, Peking University, Beijing 100871, China; ³School of Mechanics and Engineering Science, Peking University, Beijing 100871, China

Managing dynamic geographic knowledge effectively is hindered by fragmented tools lacking holistic integration, particularly when handling the heterogeneous and evolving nature of real world spatio-temporal data. Traditional knowledge graphs and databases struggle with efficient representation, storage, and reasoning over such complex information. This paper propose an integrated computation framework built upon Grid Graph Modelling to make geography computable. This framework provides an end-to-end solution encompassing knowledge representation, storage, querying, and spatio-temporal reasoning. It synergistically integrates three core components: the Grid Augmented Geographic Knowledge Graph (AugGKG) for unified grid based representation with computable spatial relations; the Grid Graph Database (GGD) for spatially aware storage and efficient grid algebra based computation; and the Grid Neighborhood-based Graph Convolutional Network (GN-GCN) for advanced reasoning by learning from semantic, spatial grid, and temporal dimensions. This cohesive architecture transforms diverse geographic data into actionable knowledge, enabling efficient querying and complex reasoning, paving the way for next generation intelligent geospatial systems, including empowering foundation models, enhancing smart cities, creating digital twins, and reasoning geographic event evolution.

Evaluating the spatial resolution of raster data products

Nicholas Hamm

University of Nottingham, China, People's Republic of

This paper introduces a method to analyze the effect of aggregation on continuous (interval or ratio scale) raster data. Previous research used the entropy based local indicator of spatial association (ELSA) to study the change in the local spatial association this, new paper extends that idea by evaluating both the within and between pixel variability. The standard deviation was used to evaluate the between pixel variability with a decrease in the SD indicating a decrease in the image information content. Ec (diversity) is one part of the ELSA statistic and gives a measure of the within-pixel heterogeneity. We should balance the this within and between-pixel variability when choosing the pixel size for a raster dataset. The variogram was used to explore the change in spatial structure. Current research is refining this method and developing a tool that will support the user to choose the pixel size for mapping.

Current research is following two further avenues. The first is to adapt this method for categorical data with an application in land cover mapping. Second is to build in the effect of predictive uncertainty in the pixel values.

Improving GNSS performance in Location-Based Services through synthetic carrier-phase measurements

Paolo Dabove, Milad Bagheri, Neil Gogoi

Politecnico di Torino, Italy

Carrier phase observations enable millimeter-level GNSS positioning, but their continuity is frequently disrupted by signal blockages and cycle slips. This limitation is particularly critical for low-cost and smartphone receivers, where weak antennas, urban multipath, and duty cycling cause frequent phase gaps that prevent reliable ambiguity resolution. Before addressing the full complexity of mass-market observations, the prediction methodology must be validated under controlled conditions. In this work we investigate whether machine learning, supported by precise satellite orbits and clocks, can predict carrier phase observations during signal gaps with millimeter-level accuracy. Twenty-four hours of Galileo data from the TORI permanent station (SPIN3 network, Torino, Italy) are processed at 30~s sampling using GFZ final SP3 and CLK products. After forming the ionosphere-free combination, an iterative carrier-phase based estimator removes the receiver clock, tropospheric delay, and ambiguity, reducing the residuals to a median standard deviation of 60~mm. Synthetic gaps from 60~s to 1800~s are introduced (1045 gaps total) and four prediction strategies are compared: polynomial fitting (degrees~3 and~5), Fourier-augmented polynomial, Gradient Boosting Regression with satellite geometry features, and Gaussian Process Regression. The Gradient Boosting model achieves the best overall performance, reaching 4.4~mm RMS for 60~s gaps, 9.4~mm for 5~min gaps, and 21~mm for 30~min gaps, well below the half-wavelength threshold required for cycle slip repair. These results demonstrate that geometry-aware gap prediction is feasible at the sub-wavelength level, providing a validated foundation for extending the approach to low-cost and smartphone GNSS receivers.

A Semantic-Spatial Cognition Driven Approach for Indoor Element-Level Layout Rationality Mapping

Mengtao Wang¹, Jianwu Jiang^1,2, Jingwen Li^1,2, Zhizhong Kang³, Bo Song^1,2, Hong Huang¹

¹College of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541004, China; ²Ecological Spatiotemporal Big Data Perception Service Laboratory, Guilin 541004, China; ³School of Land Science and Technology, China University of Geosciences, Beijing 100083, China

Indoor maps are essential for robot services, but the high dynamics of indoor environments caused by human activities lead to frequent layout changes, making it challenging to maintain map accuracy and timeliness. Existing map update methods, such as periodic full reconstruction or event-triggered incremental updates (Prieto-Fernández et al., 2024; Xia et al., 2024), lack a quantitative mechanism to evaluate whether element layouts are sensible. This makes it difficult to predict systematic changes and creates a paradox between "update frequency and element granularity." To overcome these limitations, this study proposes a spatial cognition-driven approach to identify the rationality of indoor element layouts, providing a predictive metric for efficient, layered map updates and enabling advanced robot navigation and safety warnings.

Text-Guided Semantic Segmentation Method for Indoor 3D Point Clouds

Jinyu Tan¹, Juntao Yang¹, Yutao Zhang¹, Sa Li¹, Dandan Liu¹, Zhengwen Wang¹, Xue Zhang²

¹College of Geodesy and Geomatics, Shandong University of Science and Technology, Qingdao 266590, China; ²College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China

To address limited high-level semantic modeling in indoor 3D point cloud segmentation, this study proposes a text-knowledge-guided framework built upon RandLA-Net. Category-level textual semantic prototypes are constructed through multi-template prompting and encoded by a pre-trained text encoder to provide stable semantic priors. These textual cues are progressively integrated into point cloud feature learning through shallow semantic modulation and high-level cross-modal fusion, enhancing the interaction between geometric representations and semantic knowledge. The network is jointly optimized by segmentation supervision, prototype alignment, and boundary refinement, enabling it to learn discriminative features that preserve local geometric details while encoding richer semantics.

A Coarse-to-Fine Indoor Point Cloud Registration Method Guided by Prior Correspondences

Mengchong Sun¹, Juntao Yang¹, Yutao Zhang¹, Sa Li¹, Dandan Liu¹, Xue Zhang²

Superpoint matching is a critical step in coarse-to-fine point cloud registration, and its performance directly affects the accuracy of sub-sequent point matching and pose estimation. However, most existing methods establish correspondences mainly relying on feature sim-ilarity, without explicit modeling of spatial structure, which easily leads to unstable matching in complex scenarios such as noise, occlu-sion, and low overlap. To address these issues, this paper proposes a coarse-to-fine point cloud registration method guided by prior correspondences. First, prior superpoint correspondences are constructed using rigid transformations estimated by existing SOTA methods, and are serially encoded via a prior encoding module to provide explicit constraints for feature learning. Furthermore, multiple geometric information including pairwise distances, angles, and normals is introduced and uniformly encoded to enhance spatial struc-ture representation. On this basis, a prior-guided sparse mixture-of-experts attention mechanism is designed to differentially model fea-tures in overlapping and non-overlapping regions, thereby improving feature discriminability and structural consistency. Using the learned features, the model gradually establishes correspondences through superpoint matching and point matching, and estimates the final rigid transformation with RANSAC. Experiments on the 3DMatch dataset show that when sampling 1000 point correspondences, the proposed method achieves an inlier ratio of 80.7% and a registration recall of 92.9%, which are 5.5% and 1.1% higher than the base-line method respectively, verifying the effectiveness of the proposed method in terms of accuracy and robustness.

GRACE-Based Long-Term Terrestrial Water Storage Decline in the Susurluk Basin, Türkiye

Ilgin Ozemir¹, Dilek Eren Akyuz², Fusun Balik Sanli¹

¹Yildiz Technical University, Turkiye; ²Istanbul University-Cerrahpasa, Turkiye

Climate change is reshaping the global water cycle, causing substantial alterations in precipitation, evaporation, and runoff patterns. These shifts are driving rapid changes in terrestrial water storage (TWS), which includes groundwater, soil moisture, surface water, snow, and ice. Declining TWS threatens freshwater security, increases the vulnerability of ecosystems and communities, and directly impacts sustainable water management—key concerns addressed under SDG 6 . In parallel, intensifying climate-driven water losses align with the global challenges highlighted in SDG 13, particularly regarding adaptation and resilience.

This study examines long-term TWS variations in the Susurluk Basin of Türkiye’s Marmara Region using NASA’s GRACE and GRACE-FO satellite missions. By measuring gravity anomalies caused by mass changes, GRACE enables the detection of large-scale water storage shifts. Monthly data from 41 GRACE grid points (2002–2022) were processed using the Mann-Kendall trend test at a 5% significance level. Consistent acceptance of the H1 hypothesis and universally negative Z values confirm a statistically significant and persistent decline in TWS across the basin.

Results show that water storage loss accelerated dramatically between 2012 and 2022 compared to 2002–2012. The basin exhibits an overall decreasing coefficient of –0.0561, while sub-basin analyses indicate 20-year average losses ranging from –1.3 cm to –0.1 cm. These findings demonstrate a clear, worsening depletion of water resources, emphasizing the urgent need for climate-adaptive water management. The documented TWS decline underscores the relevance of this work to SDG 6 by highlighting risks to water availability and to SDG 13 through evidence of climate-induced hydrological change.

Digital Detectives of Environment Tackling Cigarette Butt Pollution

İbrahim Topcu, Berk Anbaroglu

Hacettepe University, Turkiye

The aim of this paper is to design and develop an openly accessible, web-based Crowdsourced Geographic Information (CGI) framework, referred to as the Digital Detectives of Environment (DiDE), to facilitate the collection of geo-located events. The framework incorporates three user roles: (i) citizens, (ii) experts, and (iii) supervisors. Citizens can browse relevant events without requiring authentication, while experts are responsible for collecting geographic data, including the optional attachment of photographs or videos. Supervisors, on the other hand, define and manage event types. Each event type is classified as either useful or harmful, which determines its visibility to citizens. The pilot implementation was conducted at the Beytepe Campus of Hacettepe University, focusing on four event types aligned with Green Deal actions: rubbish bins and recycling bins (useful), and cigarette butts and full rubbish/recycling bins (harmful). During a one-week data collection period, a total of 490 events were recorded by 37 students. The results reveal clear clustering patterns in both space and time. Temporally, a large proportion of the data was collected on the final day, indicating a tendency toward procrastination among participants. Spatially, events are concentrated in the southern part of the campus, where most facilities are located. This pattern is further supported by analyses using the F and G functions. In particular, cigarette butt events exhibit strong spatial clustering, with a mean nearest-neighbour distance of approximately 25 metres. This finding provides empirical support for the broken windows theory.

Multi-Sensor Spatial Data Fusion for Road Condition Monitoring Digital Twins

Deepak Satheesan, Songnian Li, Michael Chapman

Toronto Metropolitan University, Canada

Pavement Management Systems (PMS) are essential for evaluating and maintaining transportation infrastructure; however, conventional monitoring methods are often labour-intensive, costly, and inaccurate. The growing need for reliable. timely pavement condition data has driven the development of automated, data-driven approaches. This study presents a low-cost and scalable framework for pavement condition monitoring that integrates multimodal sensing with a digital twin (DT) environment. Smartphones equipped with inertial measurement unit (IMU) sensors, GPS, and cameras are used to collect synchronized vibration and visual data during normal driving conditions. Vibration signals are analysed to detect anomalies associated with pavement surface irregularities, while video data are processed using a deep learning-based object detection model to identify surface distress. A late fusion approach combines the outputs from both modalities to improve detection reliability and provide comprehensive condition assessment. The system enables spatial mapping of detected distresses and supports real-time visualization through a web-based DT dashboard. Results demonstrate that multimodal sensing compensates for the limitations of individual sensors, enhancing both detection accuracy and robustness. The proposed framework offers a practical solution for efficient pavement monitoring. It supports data-driven decision-making for proactive infrastructure management, with potential for future expansion through crowdsourced data and additional sensing technologies.

A Lightweight Mobile Monitoring System For Detection Of Small-Scale Road Debris

Shaozhuang Xu, Jianqin Zhang, Xinhao Geng, Zikai Wang

School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, China, People's Republic of

To address the challenges of low efficiency and omission in manual inspection of small road debris, this study develops a lightweight mobile monitoring system for fine road debris. The system integrates a high-resolution industrial camera and a GNSS positioning unit to achieve real-time image acquisition and spatial synchronization. Built on the Python platform, the software includes data acquisition and communication modules that enable automatic uploading of images and system status information. To tackle the issues of small-object detection and limited edge-device computing power, an improved Dynamic-YOLOv8n model is proposed by introducing dynamic convolution and attention mechanisms to enhance recognition accuracy for small debris. Field experiments show that the system operates stably at vehicle speeds of 40–70 km/h, achieving an average detection accuracy of 93.2%. The results demonstrate that the proposed system achieves lightweight, real-time, and high-precision detection performance, providing an efficient and practical solution for road safety monitoring and digital maintenance.

A Framework for Integrating and Managing Heterogeneous 3D Geospatial Data in Urban Digital Twins

Jeson Lonappan, Olga Shkedova, Udo Feuerhake, Monika Sester

Leibniz Universität Hannover, Germany

Urban Digital Twins (UDT) require systematic integration of heterogeneous 3D geospatial data sources, but existing integration

methods struggle with semantic information loss during fusion, geometric precision degradation through format conversions, and

limited storage scalability. This paper presents a modular, database-centric framework achieving bidirectional semantic enrichment

through semantically enriched voxelization. The framework integrates CityGML building models, Mobile Mapping System (MMS)

point clouds, and Digital Terrain Models (DTM) using PostgreSQL/PostGIS database system with pgPointCloud, the point cloud

extension of Postgres for patch-based storage. A two-stage refinement pipeline is applied to align MMS point clouds to CityGML

wall surfaces using Random Sample Consensus (RANSAC) and Iterative Closest Point (ICP) algorithms. To integrate the terrain,

Constrained Delaunay Triangulation (CDT) algorithm is applied with building footprints as constraints. All datasets are independ-

ently voxelized at a common configurable resolution, with voxels enriched via custom pgPointCloud schemas storing multi-source

attributes. A unified voxel table merges layers using priority-based conflict resolution. The framework is evaluated in terms of com-

putational performance, registration precision, and storage efficiency, demonstrating feasibility and correctness of the integration

pipeline on a representative urban test case. This paper presents a proof-of-concept evaluated on a small urban area in Hannover,

Germany, demonstrating the framework’s potential for further development.

Towards a Digital Twin infrastructure for landslides: users and data requirements

Federica Gaspari, Marco Gentile, Daniela Carrion, Riccardo Barzaghi, Lorenzo Panzeri, Laura Longoni

Dept. of Civil and Environmental Engineering (DICA), Politecnico di Milano, Milan, Italy

The increasing frequency and magnitude of landslides necessitates a fundamental shift from reactive mitigation to proactive, predictive risk governance. To define the necessary tools for this transition, this study conducts a systematic literature review and operational analysis of current Digital Twin (DT) implementations in the geosciences. Through this review, we identify four primary target user groups (emergency responders, technical experts, public administrators, and citizens) and map their specific 4D data requirements and interaction logics. Our findings highlight that most existing systems function as "Digital Shadows" characterised by unidirectional data flows and a topography gap, where dynamic sensor data is superimposed onto static, outdated 3D meshes. Based on these requirements, we propose a theoretical layered architectural framework for a Data Hub designed to bridge these gaps. The conceptual architecture is structured into three interconnected tiers: an Acquisition Layer for multi-scale data ingestion; a Modelling and Processing Layer for AI and physics-based stability assessment; and an Application and Service Layer for translating complex data into actionable intelligence. Finally, this work investigates a possible implementation path for landslides DT projects by outlining technical recommendations. This includes the adoption of cloud-native formats (e.g., Cloud Optimized GeoTIFF, Zarr) and unified interoperability standards (e.g., OGC SensorThings API) to evaluate the feasibility of transitioning towards a true bi-directional cyber-physical system for landslide risk management.

Spatio-temporal modelling of H5N1 avian influenza outbreaks in Europe (2021–2024)

Mehak Jindal^1,2, Samsung Lim^1,2, Raina McIntyre^2,3

¹School of Civil and Environmental Engineering, University of New South Wales Sydney, New South Wales 2052, Australia; ²Biosecurity Program, Kirby Institute, University of New South Wales Sydney, New South Wales 2052, Australia; ³College of Health Solutions & College of Public Service & Community Solutions, Arizona State University, Tempe, United States.

Highly Pathogenic Avian Influenza (HPAI), particularly the H5N1 strain, poses a significant ongoing threat to animal health, biodiversity and food security across Europe. Understanding where and when avian influenza risks intensify is essential for targeted surveillance and rapid response. This study develops a data-driven spatio-temporal framework that integrates geospatial, ecological and climatic datasets to explain and forecast the dynamics of H5N1 outbreaks between 2021 and 2024. Weekly country-level outbreak counts (208 weeks, 37 countries) were analysed using a hierarchical endemic-epidemic model with an assumption of Negative Binomial distribution. Environmental covariates, bird-species densities, and human population metrics were incorporated into endemic and autoregressive components. Model performance was evaluated using rolling one-step-ahead forecasts assessed by proper scoring rules (logarithmic score and ranked probability score) and calibration diagnostics. The proposed framework substantially outperformed a regression-only Negative Binomial baseline, reducing mean logS by approximately 29% and RPS by 49%, while exhibiting improved probabilistic calibration. Results indicate that H5N1 transmission is structured by ecological drivers and local persistence mechanisms rather than purely seasonal effects. Anseriformes, Charadriiformes and Pelecaniformes densities were identified as the key migratory bird families contributing to the viral spread. The endemic-epidemic model achieved high forecast accuracy, with majority of the of observed weekly outbreak counts falling within central predictive intervals (RPS = 0.76, logS = 0.61). Overall, the proposed framework provides a scalable approach for integrating ecological and spatial information into early-warning systems for HPAI surveillance.

Optimization of Satellite Antenna Placement at a Ground Control Station using UAV LiDAR Data

Patryk Wróblewski, Anna Fryśkowska-Skibniewska, Paulina Jaczewska

Military University of Technology, Poland

Reliable communication between satellites and ground control stations (GCS) is fundamental to modern space missions, with its effectiveness being directly dependent on an unobstructed Line-of-Sight (LoS). Traditional site planning methods, relying on low-resolution terrain models, often overlook crucial obstacles like buildings or dense vegetation. This paper presents a comprehensive methodology using high-resolution Light Detection and Ranging (LiDAR) data, acquired from an Unmanned Aerial Vehicle (UAV), to precisely model horizon obstruction and optimize the placement of transceiver antennas. The methodology was verified on a real-world case study in Zielona Góra, Poland. The workflow included data acquisition, PPK-based trajectory processing, and point cloud subsampling using an Octree-based algorithm. The core of this work was the implementation of an algorithm to generate detailed elevation masks by calculating the maximum obstruction angle for defined azimuthal intervals. The analysis clearly identified the superior of two potential locations, proving the method's effectiveness as a decision-support tool in the space sector.

Integrating Microsoft Building Footprints and OpenStreetMap to Improve Building Representation

Cidália Fonte^1,2, Joaquim Patriarca^1,2,3, Diogo Duarte^2,4

¹University of Coimbra, Department of Mathematics; ²INESC Coimbra; ³University of Coimbra, Department of Informatics Engineering; ⁴University of Coimbra, Department of Electrotecnic Engineering

This paper investigates whether integrating the Microsoft Building Footprints (MBF) dataset with building footprints contributed by the OpenStreetMap (OSM) community can improve the spatial quality of building data. Specifically, the authors assess whether the resulting hybrid dataset enhances completeness and positional accuracy relative to the original MBF and OSM datasets. The evaluation was conducted in a study area encompassing both urban and rural environments, using 1:5,000 topographic cartography as the reference dataset. The merged MBF+OSM dataset successfully captured 87% of the buildings represented in the reference cartography, outperforming the standalone MBF and OSM datasets, which captured 81% and 70%, respectively. These results demonstrate that combining MBF and OSM footprints provides a more comprehensive representation of buildings and can offer a valuable alternative for applications requiring detailed, up-to-date building information.

An Integrated Geomatic and HBIM Workflow for Reviving Lost Architectural Heritage: The “TURIN 1911-project” Case Study

Mohammadreza Mehdizadeh, Filiberto Chiabrando, Beatrice Tanduo

Department of Architecture and Design - Politecnico di Torino, Italy

The International Exposition of Turin held in 1911 in the Val-entino Park to celebrate the fiftieth anniversary of Italian unifi-cation; Hosted pavilions dedicated to science, industry, art, and architecture, symbolizing the modern spirit of post-unification Italy (Italy World’s Fairs, 2024). Today, only some traces of the project survive within the park. This disappearance has turned the exposition into a lost heritage landscape, known primarily through archival maps, photographs, and historical records. In late 2014 the project Turin 1911 started according to the cooperation between the Politecnico di Torino- Depart-ment of Architecture and Design in cooperation with the Uni-versity of California San Diego - School of arts and Humanities (https://italyworldsfairs.org/)

This project focuses on the digital revival of this exposition by integrating these materials with digital surveying and immer-sive visualization, aims to reproduce this vanished site and make it perceptible again to the public through virtual reality technologies.Within this framework, the research presented in this paper concentrates on the optimization of these mainly Revit and ArchiCAD modeled pavilions using the tools provid-ed by Unreal Engine for deployment on standalone VR sys-tems. The goal is to make heavy weight architectural scenes accessible in VR without connecting to PC and extending the concept to portable devices.

Discussion on Quality Model and Evaluation Methodology for WMS

Han Wenli, Ge Juan, Zhang Libo, Shen Jing, Tan Yang

National Quality Inspection and Testing Center for Surveying and Mapping Products, China, People's Republic of

The integration of internet technologies and geographic information systems (GIS) has positioned Web Map Services(WMS) as indispensable tools for daily life, with their service quality attracting significant attention. This study proposes a quality evaluation model and method tailored for WMS, encompassing four critical dimensions: query and retrieval, map display, thematic services, and productization services. Empirical validation was conducted through functional, performance, and productization evaluations of three leading domestic platforms, utilizing technical benchmarks and user-centric metrics. Results demonstrate the model’s efficacy in quantifying service quality, aligning closely with real-world user experiences. The framework provides actionable guidelines for regulatory bodies to monitor service compliance and for providers to optimize architectural designs, thereby addressing gaps in personalization and cross-border functionalities observed in current systems. Furthermore, this work highlights the necessity of integrating emerging technologies—such as real-time traffic data and AI-driven personalization—to meet evolving demands for energy efficiency, global connectivity, and hyper-localized services. By bridging technical assessments with practical governance needs, the study offers a strategic roadmap for advancing service quality, supporting the development of China’s digital economy, and enhancing societal well-being through reliable geospatial solutions.

Driver training in immersive virtual reality (VR) and transfer to the real world: A feasibility study on learning to reverse a truck in VR

Andrea Frick^1,2, Arzu Çöltekin³

¹Institute for Research and Development of Collaborative Processes, School of Applied Psychology, University of Applied Sciences Northwestern Switzerland (FHNW), Switzerland; ²Institute of Mental Health, School of Applied Psychology, Zurich University of Applied Sciences (ZHAW), Switzerland; ³Insitute of Interactive Technologies, School of Computer Science, University of Applied Sciences and Arts Northwestern Switzerland (FHNW), Switzerland

Virtual reality (VR) offers important advantages in training complex spatial skills, as required for example in driving, because it enables immersion and experience-based learning, and offers financial, ecological, and safety benefits. In the context of driving, as larger vehicles can be especially challenging to master for beginners, we investigated whether truck driving instruction and practice in a VR-simulator enhances performance, and whether the acquired skills transfer to maneuvering a real vehicle. In an empirical feasibility study, we first measured learners’ performance while an experienced instructor trained them on a conventional simulator vs. in VR, following analogous training protocols. The task was to reverse a truck with a trailer, a particularly difficult task that requires extensive practice. After training, participants completed a test on a real vehicle to validate the effectiveness of the training. Participants were asked to report previous experience, attitudes towards the system, motion sickness, and fatigue levels. Four male participants, who had a car driving license but no experience reversing a truck with trailer, completed the training. Results demonstrate that basic maneuvering skills can be trained in VR and transfer to the real vehicle. Even with a low-budget VR solution, participants learned easily, and learning curves were comparable to the simulator condition. Participants reported positive attitudes towards the training in both conditions. Future research could investigate whether using a customized VR environment that takes full advantage of all the benefits of VR, could lead to even greater training gains.

3D Geodata Based Optimization of UAV Docking Stations in Mountainous Areas for Emergency Response

YiLang Lin, Zhiyong Wang, Yongjie Lin, Yonghan Liao, Junjie Lu, Runze Huang, Lang Hu, Yuanren Jie

South China University of Technology, China

In recent years, the increasing frequency of natural disasters in remote and rugged areas has underscored the importance of unmanned aerial vehicles (UAVs) for rapid emergency response. This paper presents a novel approach for optimizing the placement of UAV docking stations in mountainous terrain for emergency operations. We develop a comprehensive, 3D Geodata framework that integrates 3D Digital Elevation Models (3D DEM), building infrastructure, and road network data to create a realistic three-dimensional optimization environment. The proposed system employs an Enhanced Adaptive Particle Swarm Optimization (EAPSO) algorithm with adaptive parameters, diversity maintenance mechanisms, and intelligent convergence detection to effectively handle the complex constraints of mountainous environments. Experimental results demonstrate that our 3D-aware EAPSO approach achieves superior performance in balancing coverage efficiency, energy consumption, and network connectivity compared to conventional optimization methods. The proposed system provides a scientific foundation for improving emergency response capabilities in challenging geographical environments.

A Framework for Enabling Data Sharing and Accessibility in a Transdisciplinary Federated Marine Spatial Infrastructure

Bolelang Sibolla^1,2, Raymond Molapo^1,3, Lufuno Vhengani¹, Diego Madanga¹, Gasethata Maduenyane¹

¹Next Generation Enterprises and Institutions, Council for Scientific and Industrial Research, South Africa; ²Department of Geography Geoinformatics and Meteorology, University of Pretoria, South Africa; ³Multilingual Speech Technologies, North-West University, Potchefstroom 2520, South Africa

The Sustainable development goals and the United Nations Ocean Decade require preservation of the oceans and the efficient management of marine resources, contributing to a sustainable oceans and blue economy. Oceans span a wide area with exclusive economic zones of different countries adjacent to each other. This therefore necessitates the collaborative management of these resources across several countries in a region. Data is essential to providing trusted information, which in turn drives knowledge generation from science to policy implementation, towards informed decision making regarding the ocean resources. Harmonising data into decision support tools becomes a challenge due to two main reasons. Firstly, due to the transdisciplinary nature of the ocean, where data is governed by a variety of standards. Secondly, regional collaboration requires the data and knowledge to be shared in a federated environment in order to preserve data sovereignty, while cognisant of the network challenges in developing countries. This paper presents a standard compliant framework for enabling data sharing and access in these environments based on lessons learnt in the Marine and Coastal Operations for Southern Africa and Western Indian Ocean region, a project supported by the African Union Commission‘s GMES and Africa program.

A Spatiotemporal Knowledge Graph Construction and Management System

Xiuli Zhu^1,2, Wei Zhang¹, Jie Yang³, Ye Zhang^1,2, Ping Lu^1,2, Xiaoyu Liu⁴, Yunlu Peng^1,2, Ran Li^1,2, Tingting Zhao^1,2, Xinpeng Wang^1,2, Xinli Di^1,2

¹National Geomatics Center of China, 28 Lianhuachi West Road, Haidian District, Beijing, 100830, China; ²Key Laboratory of Spatio-temporal Information and Intelligent Services (LSIIS), MNR, 28 Lianhuachi West Road, Haidian District, Beijing, 100830, China; ³State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing, 100101, China; ⁴Chongqing Institute of Surveying and Monitoring for Planning and Natural Resources, Chongqing, 401120, China

With the deep integration of big data and artificial intelligence technologies, the knowledge graph has emerged as an important method for organizing and understanding complex spatiotemporal information. Traditional knowledge graph management systems often face three significant challenges when dealing with spatiotemporal information in domains such as natural resource, urban studies, and emergency management. Firstly, the limited visualization capability makes it hard to intuitively represent the spatial distribution and temporal evolution of spatiotemporal knowledge. Secondly, the lack of systematic and deep machine-interpretable representation methods leads to inadequate diagnostic, predictive, and decision-making knowledge services. Thirdly, the knowledge construction process heavily relies on expert involvement resulting in high barriers to entry and low efficiency. To address these issues systematically, this paper designs and implements a comprehensive spatiotemporal knowledge graph construction and management system that integrates full lifecycle knowledge management, multi-form visualization methods, general and thematic knowledge graph construction.

Unsupervised Mapping of Flood-prone Areas in Ghana Using Sentinel-1 Time-Series

Felix Enyimah Toffah¹, Francesco Pirotti^2,3

¹Dept. of Civil, Building and Environmental Engineering, Sapienza University of Rome, Via Eudossiana 18, 00184 Rome, Italy; ²Dept. of Land and Agroforestry Systems (TESAF), University of Padua, Viale dell'Università 16, 35020 Legnaro, Italy; ³Interdepartmental Research Centre in Geomatics (CIRGEO), University of Padua, Viale dell'Università 16, 35020 Legnaro, Italy

Flooding is one of the most persistent natural hazards in Ghana, causing recurrent damage to infrastructure, livelihoods, and local economies. Despite its widespread impacts, most flood-related research has been concentrated on Accra, leaving many vulnerable regions understudied. This paper integrates Earth Observation (EO) datasets to identify and characterise flood-prone areas across Ghana at a national scale. Precipitation patterns between 2015 and 2025 were derived from the IMERG dataset, while Sentinel-1 Synthetic Aperture Radar (SAR) imagery was used for flood mapping through a change detection (ratio) approach. Results show a clear seasonal cycle, with major rainfall peaks from April to October, directly corresponding to observed flood events. Flooding is concentrated in the southern half of the country, particularly in Western, Western North and Eastern Regions, and recurrent hotspots around Kumasi in Ashanti and the Weija dam in Greater-Accra regions. Spatial patterns align closely with national topography, confirming the vulnerability of low-lying settlements and riverine communities. Technically, the study demonstrates the effectiveness of SAR-based change detection for flood mapping in data-sparse environments, while highlighting limitations related to in-situ validation and urban misclassification. From a policy perspective, the findings provide evidence to support flood risk management strategies, including targeted infrastructure investment and improved drainage planning. The results underline the necessity ofadopting engineering solutions to reduce flood vulnerability in communities in Ghana.

3D Modelling of Easement Rights Using BIM : A Feasibility Study

Hamta Amini¹, Mahmoud Reza Delavar², Saeid Emamgholian³

¹School of Geomatics and Geospatial engineering, University of Tehran, Iran, Islamic Republic of; ²Centre of Excellence in Geomatic Eng. in Disaster Management and Land Administration in Smart City Lab., School of Surveying and Geospatial Eng., College of Engineering, University of Tehran, Tehran, Iran; ³Faculty of Forestry, Geography, and Geomatics, Dept. of Geomatics, Université Laval

This contribution presents a feasibility study on representing access easement rights in multi-owned buildings using BIM and the IFC standard. A 3D BIM model was generated from 2D cadastral plans, and access easements between parking and storage units were modeled as explicit IFC entities with legal attributes such as beneficiary, servient unit, and restriction semantics. The study demonstrates how embedding easements as identifiable objects in IFC can enhance the clarity of Rights, Restrictions, and Responsibilities (RRRs) and improve the communication of legal constraints for future 3D digital cadaster applications.

Digital Tools for Interpretation of Reconstructed Mining Features. Project Digital Geopark Muskau Arch.

Jan Blachowski¹, Anna Buczynska¹, Monika Hajnrych¹, Aleksandra Kaczmarek¹, Damian Kasza¹, Paulina Kujawa¹, Jaroslaw Wajs¹, Jörg Benndorf², Kersten Löwen³

¹Politechnika Wrocławska, Wrocław, Poland; ²Technical University Freiberg, Germany; ³European Group of Territorial Cooperation Geopak Muskau Arch, Klein Kolzig, Germany

The aim of the presented study is to develop and implement the strategy for digitally reconstructing and presenting the forgotten heritage associated with underground and open-pit mining conducted in the nowadays bilateral UNESCO Geopark Muskau Arch located on the border of Germany and Poland.

The research is led by scientific partners from Poland (Wroclaw University of Science and Technology) and Germany (Freiberg Technical University), with cooperation from the European Grouping of Territorial Cooperation (EGTC) Geopark Muskau Arch within the project “Digital Journey through Geopark Muskau Arch” co-financed from the European Regional Development Fund as part of the Poland-Saxony 2021-2027 INTERREG Cooperation Program.

Integrating Point of Interest and BERT to identify potentially contaminating Enterprises in Datong City

Xin Du¹, Bin Yang¹, Jing Tong², Xiaomin Du², Liya Liu¹, Jinzhong Yang², Weiling Yao²

¹Hebei Remote Sensing Center; ²China Aero Geophysical Survey and Remote Sensing Center for Natural Resources

Effective management of environmental safety risks in brownfield redevelopment relies on accurate identification of contaminated enterprises. A key challenge is the rapid acquisition of data on these enterprises. This study proposes a method leveraging Point of Interest (POI) data and a BERT-based prediction model to identify potentially contaminated enterprises. The method was applied to Datong, a major industrial and mining city in China. The method successfully identified 329 potentially contaminated enterprises across 23 types of polluting industries. Notably, enterprises in the mining and washing sectors of bituminous and anthracite coal represented 26.2% of the total identified, reflecting Datong’s coal-centric industrial nature. The proposed method efficiently identifies potentially contaminated enterprises, supporting targeted environmental risk management and brownfield redevelopment. Integrating it with regulatory frameworks can enhance compliance monitoring and inform decision-making for sustainable urban development.

GeoAI: A Pipeline for Environmental Monitoring and Feature Discovery

Hana Gamracy¹, Paramdeep Sodhi¹, George Sphicas¹, Divya D'Souza¹, Puranjai Garg¹, Noga Gottlieb¹, Sahil Sanjay Gupta¹, Rucha Maslekar¹, Atanas Patterson¹, Noah Steaderman¹, William Stout¹, Fernanda Lopez Ornelas², David Saah², David Guy Brizan¹, Christopher Brooks¹

¹Department of Computer Science, University of San Francisco, United States of America; ²Department of Environmental Science, University of San Francisco, United States of America

The development of successful geospatial artificial intelligence (GeoAI) systems is hampered by two major obstacles: a scarcity of high-quality, annotated satellite imagery and a lack of unified platforms for modeling and testing. We introduce a scalable GeoAI framework that allows users to query, retrieve, and analyze high-resolution imagery using natural language interaction and direct processing of images. The system incorporates IBM-NASA's Prithvi Foundation Model for supervised detection of environmental features and the Clay Foundation Model for unsupervised similarity search when detectors are unavailable. An interactive interface allows users to search for features (such as swimming pools, vegetation changes, and burn scars), apply detectors to TIFF images, and explore new regions for model training

Evaluating the Relationship between Atmospheric Pollutants and Land Surface Indices Using Multi-Sensor Satellite Data

Aman Srivastava, Aditya Kumar Thakur, Rahul Dev Garg, Pradeep Kumar Garg

Indian Institute of Technology Roorkee, India

India, as one of the fastest-developing nations, faces severe air quality challenges due to rapid urbanization, industrialization, vehicular emissions, and agricultural activities. With major cities frequently exceeding WHO pollution limits, understanding the spatial and temporal behavior of atmospheric pollutants has become crucial. The integration of satellite-based geospatial technologies provides a powerful framework for assessing land–atmosphere interactions and their environmental implications. This study investigates the relationship between atmospheric pollutants and land surface characteristics across India using Sentinel-5P and Sentinel-2 datasets. The objective is to examine how pollutants influence vegetation health and urbanization through indices such as the Normalized Difference Vegetation Index (NDVI) and the Normalized Difference Built-up Index (NDBI). Google Earth Engine (GEE) and MATLAB were employed for data processing, statistical analysis, and visualization. NDVI and NDBI were derived from Sentinel-2 bands, while pollutant data (NO₂, SO₂, CO, O₃, HCHO, and CH₄) were extracted from Sentinel-5P. Correlation analysis, univariate regression, and temporal trend models were used to evaluate pollutant behavior and its linkages with land cover dynamics from 2019–2024. Results revealed strong positive correlations among NO₂, CO, SO₂, and HCHO (r = 0.59–0.76), indicating common anthropogenic sources, while NDVI showed significant negative correlations with O₃ (r = –0.46) and HCHO (r = –0.64). Formaldehyde and methane displayed the strongest increasing trends, highlighting growing emissions and vegetation response contrasts. The findings emphasize the interconnectedness of pollution, vegetation degradation, and urban expansion. Future research should integrate meteorological parameters and predictive modeling to strengthen sustainable environmental management and urban planning frameworks in India.

Analysis of the Current Situation and Research on Countermeasures of National Fundamental Surveying and Mapping Achievements Services

Ping Lu, Xiuli Zhu, Wanzeng Liu, Xinli Di, Ran Li, Shangwei Lin

National Geomatics Center of China, 28 Lianhuachi West Road, Haidian District, Beijing, 100830, China

Based on the current situation of the application and service of national fundamental surveying and mapping achievements from 2020 to 2024, this paper adopts a combined method of quantitative and qualitative analysis to identify the existing problems and challenges, including constraints imposed by confidentiality management policies, the need to improve the timeliness and category diversity of data, and the insufficient service awareness and informatization service level. Corresponding countermeasures and suggestions for promoting the efficient provision and extensive utilization of fundamental surveying and mapping achievements are put forward, mainly including improving the policy and institutional system for the confidentiality management of surveying and mapping achievements, perfecting the achievement update mechanism, enriching the variety of achievements, advancing the processing and compilation of public-version surveying and mapping achievements, and constructing a public geographic information data innovation and application laboratory.

5:30pm - 7:30pm

Exhibition Opening & Reception
Location: Exhibition Hall "F"

Date: Tuesday, 07-July-2026

8:30am - 10:00am

WG II/2B: Point Cloud Generation and Processing
Location: 713A

8:30am - 8:45am

Multi-Source Fusion of Roof Skeletons, LiDAR and Street-View Imagery for Semi-Automated LoD-2 Building Modelling

Vaibhav Rajan¹, Sander Münster¹, Jonas Bruschke¹, Ferdinand Maiwald²

¹Digital Humanities, Friedrich-Schiller-Universität Jena, Germany; ²Chair of Optical 3D-Metrology, TUD Dresden University of Technology, Germany

LoD-2 building models are more informative and practically more useful than LoD-1 representations because they capture the roof structure that defines the essential three-dimensional form of a building. They are important for applications such as urban planning, environmental simulation, and digital heritage. Although recent roof shape extraction methods can derive vectorised 2D roof structures from very-high-resolution imagery, transforming these image-based representations into fully textured 3D buildings remains challenging. In this paper, we present a semi-automated LoD-2 reconstruction pipeline that integrates HEAT-derived roof geometry with airborne LiDAR, satellite and Google Street View imagery. The 2D outputs are reprojected into map coordinates, fused with LiDAR through a two-stage roof reconstruction strategy to derive roof shapes and combined with an adaptive, LiDAR-based ground base initialisation to create a complete 3D wireframe. Roofs are textured using VHR orthophotos while the walls are textured via a process of Street View panorama selection, geometric filtering, Mask2Former segmentation, and homography rectification. Across a large-scale evaluation on 1000 buildings, the proposed two-stage reconstruction strategy improves geometric agreement with the LiDAR reference data achieving a roof-surface RMSE of 0.445~m. The wall texturing process produces convincing facades when suitable panoramas are available. While minor challenges such as sensitivities to LiDAR outliers, incomplete roof geometry, and facade occlusions persist, this pipeline effectively bridges 2D roof parsing and textured LoD-2 model generation, providing a robust and scalable foundation for advancing toward fully automated workflows.

8:45am - 9:00am

BIM-to-Labelled Point Cloud : Automated Point Cloud Annotation from BIM Models using Bounding Boxes and Solid Geometry

Saad Boudarbala^1,2, Tania Landes², Hélène Macher², Thibault Bavoux¹

¹Futurmap Lyon, France; ²INSA-Strasbourg, France

This paper presents an automated framework for generating semantically labelled building point clouds from their corresponding BIM models. The proposed methodology aims to facilitate the creation of training datasets for deep learning–based indoor semantic segmentation. Two complementary labelling strategies are introduced. The first relies on bounding boxes (BBX) extracted from BIMelements to efficiently assign labels to points based on volumetric inclusion. The second approach uses solid geometry and a nearest-neighbour principle (SG-NN) to compute distances between BIM object meshes and the point cloud, enabling a more precise spatial correspondence. In addition, a room-based geometric grouping strategy is proposed to structure the annotated point clouds into spatial units compatible with common indoor segmentation datasets. The methods are evaluated through a qualitative analysis on several real building datasets of different typologies and acquisition conditions, as well as through a quantitative evaluation based on a manually segmented reference point cloud. Results show that the SG-NN approach achieves higher performance, with an average Recall of 92% and IoU of 88%, compared to 87% of Recall and %78 of IoU for the BBX approach. While the BBX approach provides faster processing, the SG-NN strategy achieves higher labelling accuracy, particularly for geometrically complex elements. The proposed workflow enables scalable dataset generation from Scan-to-BIM projects while significantly reducing manual annotation effort.

9:00am - 9:15am

Enhanced SegNet-based Building Extraction Framework via Image Segmentation and Point Cloud Fusion

Chi Tien Nguyen, Dinh Minh Bui, Somin Han, Changjae Kim

Department of Civil Engineering and Environment, College of Engineering, Myongji University

This paper presents an enhanced building extraction framework that combines deep learning-based image segmentation with photogrammetric point cloud refinement for urban roof detection. The method first applies a modified SegNet model to orthophotos from the ISPRS Vaihingen dataset to generate initial building masks. These results are then refined using geometric information from point clouds through ground filtering, clustering, and normal-guided region growing. By integrating spectral information from imagery with structural cues from 3D data, the proposed framework improves roof boundary delineation and reduces spurious detections. Experimental results on Areas 35 and 37 show that the method achieves strong overall performance, with a precision of 0.96, recall of 0.81, IoU of 0.78, and F1-score of 0.88. The findings indicate that point cloud refinement helps produce cleaner and more reliable building objects than image-based segmentation alone, especially in complex urban scenes. However, the approach remains sensitive to the density and quality of the point cloud. Overall, the study demonstrates that fusing orthophoto segmentation with point cloud processing is an effective strategy for more accurate and geometrically consistent building extraction.

9:15am - 9:30am

Application Of Multi-Source Photogrammetric Data For Fast Building Inventory

Anna Fryskowska-Skibniewska, Patryk Wróblewski, Klaudia Pasternak, Julia Gotowiec

Military University of Technology, Poland

The rapid expansion of urban areas and the continuous demand for their monitoring make remote sensing data a highly valuable tool for collecting large volumes of geospatial information in a relatively short time and with high repeatability. The main objective of this paper is to examine the potential offered by different types of geospatial data, as well as the relationships based on their scope, in comparison with measured reference data.

Architectural inventory tasks are useful not only for engineering projects but also for broader applications, such as environmental impact assessments, spatial planning, and related fields. This article introduces a rapid and cost-effective mixed-mode data collection framework for building inventory development, integrating terrestrial laser scanning, UAV imagery, and traditional ground measurements.

The paper will discuss the latest measurement technologies and their practical applications in building surveying, illustrated with a selected case study. The criteria for selecting appropriate measurement methods will also be analyzed, depending on the investor’s requirements and the intended use of the documentation.

This paper presents a set of techniques for updating the geometric information of buildings using laser scanning and imagery. It begins with an introduction to the fundamental concepts, terminology, and principles of 3D information. Subsequently, various measurement techniques are described, along with a discussion of potential sources of error and data incompleteness. The extracted geometric values are validated against independent survey data.

9:30am - 9:45am

Conjugate Feature-Guided Dense Stereo Matching for High-Precision Attribute-Enriched Urban Point Clouds

Yung-Ching Yang, Jen-Jer Jaw

National Taiwan University, Taiwan

Accurate 3D reconstruction of urban scenes from multi-view images is essential for city planning, digital twins, and autonomous navigation. Traditional dense image matching relies on low-level cues such as intensity or gradients, which often produce noisy or incomplete point clouds in complex urban environments. This study introduces an attribute-enriched dense matching framework that embeds both geometric features and semantic attributes from multi-view images to guide dense image matching.

The framework first extracts semantic labels and geometric feature correspondences to generate intermediate products: conjugate features, feature seeds, an attribute map, and an initialized disparity map. These elements provide reliable priors that constrain dense matching, reduce search ranges, and prevent mismatches across structural boundaries. Dense image matching then propagates these constraints, producing an attribute-enriched disparity map and point cloud in which each 3D point carries both geometric and semantic information.

Evaluated on urban datasets, the proposed approach improves corner and edge localization, enhances edge continuity, reduces outliers in low-texture areas, and preserves semantic and structural attributes throughout 3D scene reconstruction. By integrating feature-based initialization with attribute-enriched dense image matching, the method delivers more accurate, interpretable, and robust 3D urban reconstructions, supporting downstream tasks such as precise measurement, object recognition, and scene analysis.

9:45am - 10:00am

Efficient Extraction and Specification-Compliant Optimization of Railway Alignment Parameters from UAV LiDAR Point Clouds

Zhaochen Han, Xuming Ge, Min Chen, Han Hu

Faculty of Geosciences and Engineering, Southwest Jiaotong University

The rapid acquisition of high-precision parametric railway alignment is a fundamental prerequisite for intelligent railway construction and maintenance. Traditional measurement techniques and alignment fitting methods heavily rely on manual operations, often resulting in inefficiency, high costs, and insufficient accuracy control. To address these challenges, this study proposes an automated method for extracting and optimizing railway alignment from UAV-based LiDAR point clouds. Initially, track centerlines are extracted by leveraging the geometric smoothness of the railway and the structural characteristics of the track. A multi-constraint energy model integrating distance, orientation, and curvature is constructed to fit the geometric parameters of alignment elements, thereby providing high-quality initial values for subsequent alignment engineering parameter optimization. Finally, a global optimization strategy based on the simulated annealing algorithm is applied to jointly refine the engineering parameters of the standardized alignment composition, ensuring strict compliance with railway design specification. Experimental results demonstrate that the proposed method can efficiently and robustly extract high-precision alignment parameters with well-defined engineering semantics from complex railway point clouds, thereby providing reliable technical support for intelligent construction and full lifecycle management of railway systems.

8:30am - 10:00am

WG III/1I: Remote Sensing Data Processing and Understanding
Location: 713B

8:30am - 8:45am

OG-TPTV: A texture-preserving regularizer for hyperspectral image denoising

Zhangping Wu, Mi Wang

Wuhan University, China

Hyperspectral images (HSIs) are often severely degraded by mixed noise, such as Gaussian, stripe, and impulse noise during acquisition and transmission, which seriously impedes their subsequent applications. Therefore, HSI denoising is both crucial and challenging. In this work, we present a gradient-domain outlier-guided texture-preserved total variation (OG-TPTV) regularizer designed to remove mixed noise in HSIs. First, we utilize the mode-3 low-rank property of HSI gradient maps along the spectral dimension and apply a low-rank decomposition model to extract their spatial representation coefficients (SRCs). To improve the sparsity characterization of SRCs in the gradient subspace, an outlier-guided strategy is introduced. Specifically, we perform outlier detection on gradient maps to distinguish noise from texture structures and remove outliers to generate precise texture weighting maps. The resulting texture weight maps offer adaptive guidance for adjusting the strength of the sparsity constraints. Finally, a denoising method for HSIs is developed based on OG-TPTV. Extensive experiments on both synthetic and real HSIs demonstrate the superior denoising performance of our method.

8:45am - 9:00am

SpectralNet-X: Transformer-based Lossy Compression for Hyperspectral Satellite Data

Jannik Sheikh¹, Wolfgang Groß¹, Jannick Kuester¹, Andreas Michel¹, Martin Weinmann²

¹Fraunhofer IOSB, Germany; ²Karlsruhe Institute of Technology (KIT)

Hyperspectral satellite missions generate massive data volumes that are difficult to transmit and store under tight onboard resource constraints, making effective lossy compression a key enabling technology. We propose SpectralNet-X, a transformer-based autoencoder for spectral-only compression of spaceborne hyperspectral imagery at a fixed compression ratio of 16. The encoder maps each spectrum to a low-dimensional latent code using a 1D convolutional projection followed by stacked self-attention layers with rotary position embeddings, and aggregates information via cross-attention pooling. The decoder reconstructs full-band spectra through an upsampling stack and per-band affine calibration. To improve reconstruction fidelity and generalization, SpectralNet-X is first pretrained in a masked-signal reconstruction task inspired by SimMIM and then fine-tuned with a mixed objective that combines mean-squared error and spectral angle mapper (SAM) terms using a scheduled weighting scheme. We evaluate SpectralNet-X on the large-scale HySpecNet–11k benchmark and in a mission-realistic cross-sensor setting, where models trained on HySpecNet–11k are tested on PRISMA hyperspectral scenes. Across PSNR, SSIM, and SAM, and when compared to three different compression autoencoders, SpectralNet-X achieves the lowest angular reconstruction errors while maintaining competitive distortion metrics and substantially reducing the fraction of spectra with large SAM outliers. These results indicate that transformer-based spectral compression is a promising candidate for robust, mission-realistic onboard hyperspectral data reduction.

9:00am - 9:15am

Sensitivity of Deep Learning Validation to Spatial Scale–Sample Size Interactions in Hyperspectral Imaging

Yanfang Sun^1,2, Yuan Jiang^1,2, Yongze Song³, Rui Qu⁴

¹College of Civil Engineering, Taiyuan University of Technology, Taiyuan, China; ²Shanxi Key Laboratory of Civil Engineering Disaster Prevention and Control, Taiyuan,China; ³School of Design and the Built Environment, Curtin University, Perth, Australia; ⁴School of Computer Science and Technology, Aba Teachers College, Aba Zhou

Validating the performance of deep learning models in satellite imagery is essential for ensuring model generalizability, decision reliability, and spatial transferability—particularly in the context of hyperspectral images, which contain high-dimensional, spatially complex data. While it is well recognized that multiple spatial characteristics influence deep learning model performance, few studies have systematically examined how the interactions among these characteristics affect model validation sensitivity in hyperspectral contexts. This study aims to investigate how the interaction between spatial scale (e.g., surrounding 3, 5, 7 grids) and training sample size (e.g., 10%, 30%, 50% of all data) influences the validation accuracy and sensitivity of deep learning models. An innovative validation sensitivity index is developed to quantify the change in accuracy per unit of spatial scale and sample size, enabling a more refined assessment of model robustness. The index is applied to three representative hyperspectral datasets, covering diverse environmental and spectral conditions. Results show that spatial scale accounts for 0~21.0% accuracy variation, training sample size contributes 5.6~36.5% variation, but their interaction leads to 5.4~70.3% variation, indicating a nonlinear amplification enhanced effect. These findings may be explained by the compounded influence of data contextuality, spatial redundancy, and model overfitting dynamics. This study demonstrates the critical need to consider spatial interactions in validation design, offering new insights for enhancing the reliability of geospatial artificial intelligence (GeoAI) applications in remote sensing and spatial data science.

9:15am - 9:30am

Assessment of RTM-induced Surface Reflectance Differences between 6SV and VLIDORT under a Single Atmospheric-correction Framework

Seungwon Kim¹, Suyoung Sim¹, Jongho Woo¹, Sungwoo Park¹, Seungkyoo Lee¹, Chaeyun Kim¹, Huiji Yu¹, Kyung-Soo Han²

¹Division of Earth Environmental Science (Major of Spatial Information Engineering), Pukyong National University, Republic of Korea; ²Professor, Division of Earth Environmental Science (Major of Spatial Information Engineering), Pukyong National University, Republic of Korea

Surface reflectance is a foundational variable in optical remote sensing, as inaccuracies introduced during atmospheric correction can propagate and amplify across subsequent satellite-derived products. Nonetheless, the extent to which the choice of Radiative Transfer Model (RTM) affects reflectance retrieval has not been sufficiently examined. This study investigates how two widely used RTMs—6SV and VLIDORT—produce different surface reflectance outcomes when applied under consistent atmospheric and geometric conditions for the GEO-KOMPSAT-2B/GEMS instrument. To ensure comparability, both models were driven by identical GEMS aerosol properties and an equivalent LUT configuration.

The comparison shows that while the two RTMs reproduce broadly similar spatial patterns, systematic quantitative differences remain in the retrieved reflectance. These differences vary depending on atmospheric and viewing conditions, particularly under higher aerosol loading. A sensitivity analysis further indicates that aerosol amount and scattering characteristics, alongside viewing geometry, are key factors influencing the magnitude of RTM divergence.

Overall, this study provides a structured assessment of RTM-dependent variability in atmospheric correction and highlights the importance of model choice when interpreting or harmonizing surface reflectance products. The findings offer a basis for improving consistency in future GEMS-based retrievals and for advancing reliable surface reflectance generation in geostationary remote sensing.

9:30am - 9:45am

Attention-driven Cross-modal Self-supervised Learning for Label-efficient Hyperspectral-LiDAR DSM Classification

Jonathan Gonzalez Santiago¹, Wolfgang Gross¹, Karsten Schulz¹, Wolfgang Middelmann¹, Uwe Soergel²

¹Fraunhofer IOSB, Germany; ²Institute for Photogrammetry and Geoinformatics (ifp), University of Stuttgart, Germany

Remote sensing acquisition systems rely on a range of platforms, from drones to satellite missions, to record multimodal Earth surface data. This fact encourages the preparation of datasets with complementary properties, thereby increasing their discriminative potential. A common complementary combination is between Hyperspectral and LiDAR-generated digital surface model data. While engaging, this fusion poses challenges for specific applications. Multiple works fuse these modalities at the feature level using vector concatenation, maximization, or averaging. Although functional, these methods omit target interactions between the modalities. Another challenge in remote sensing is the quantity and quality of labels required by deep learning methods, which are expensive, error-prone, and difficult to scale. We address the challenges above by proposing a self-supervised processing framework based on cross-modal attention that effectively fuses features at multiple levels, thereby exploiting complementary information across data streams. Specifically, our method is founded on a pseudo-Siamese network that reweights each modality’s features with information from the other via a mirrored cross-modal attention. The network’s objective is to maximize the similarity between the feature representations of both streams. A fusion network builds a latent representation using the learned encoders and attention modules. Then, a k-Nearest Neighbor classifier categorizes each sample within the representation using ten labels per class. Our experiments show that our spatial- and channel-spatial cross-modal attention approaches outperform well-established fusion methods for label-efficient land cover classification across datasets. Our findings lay the groundwork for fusion methods that effectively exploit inter-stream data relationships to encourage complementarity.

9:45am - 10:00am

GAN-based pan-to-rgb Image Translation for remote sensing Data

Xiaowei He¹, Yingzi Xiong¹, Xiao Ling¹, Qinghong Sheng¹, Xiao Xu²

¹Nanjing University of Aeronautics and Astronautics, China, People's Republic of; ²Yangtze Delta Region Institute of Intelligent Sensing (Nantong)

Despite the rapid development of satellite sensors, acquiring high-resolution RGB images remains a challenge. In this paper, a GAN-based multiscale feature-based pan-to-rgb model is proposed to establish a novel framework for high-resolution, high-fidelity RGB images generation from remote sensing panchromatic images. The spatial structure, texture, and color of the results are consistent with the real images, and the colors are naturally realistic and vibrant. Multiscale features and symmetric luminance color decoders are utilized to overcome color desaturation, inaccuracy, and distortion in conventional algorithms. By combining CNNs for local feature modeling and transformers for global feature modeling, this approach learns pan-to-rgb mappings to produce high-resolution, high-fidelity RGB images in CIELAB space. Besides, the luminance distance loss and the color distance loss are utilized to prevent the coupling of luminance and color. We also conducted experimental validation on Gaofen-7 satellite data, and the results demonstrated that the FID, CF, and △CF indicators of the proposed algorithm improved by 2.90%, 11.77%, and 64.51%, respectively, compared to the comparison algorithms.

8:30am - 10:00am

WG I/6B: Orientation, Calibration and Validation of Sensors
Location: 714A

8:30am - 8:45am

Evaluation and performance assessment of a novel UAV-borne laser scanner system

Gottfried Mandlburger¹, Elisabeth Ötsch¹, Philipp Knopf²

¹TU Wien, Department of Geodesy and Geoinformation, Austria; ²Knopfhoch GmbH, Austria

Miniaturized UAV laser scanning systems have advanced rapidly over the past decade, especially in the low-cost sector. DJI entered this field with the Zenmuse L-series, integrating GNSS/INS with compact scanners. While the first-generation L1 showed moderate precision, the L2 improved notably through reduced beam divergence. In November 2025, DJI released the Zenmuse L3. In this contribution, we assess its performance.

The main upgrade from L2 to L3 lies in the LiDAR unit: L3 uses a single 1535 nm laser instead of multiple 905 nm diodes, offers a symmetric 0.25 mrad beam divergence, and supports pulse repetition rates from 350 kHz to 2 MHz. High PRR operation is limited to altitudes ≤50 m due to missing multiple-time-around resolution. Scan modes include linear, non-repetitive, and a new star-shaped pattern.

L2 and L3 were tested at three sites in Lower Austria covering a warehouse, power-lines, and forests. Flights were conducted at 80 m AGL (350 kHz) and, for the warehouse, 50 m AGL (2 MHz). Precision, strip consistency, point density, feature separability, and vegetation penetration were evaluated using the scientific software OPALS.

L3 data showed sharper edges, reduced noise, and higher separability, yielding spline-fit residuals of 0.9 cm versus 2.6 cm for L2 for reconstructing a double-threaded power-line. Ground point coverage in forests increased from 18 % (L2) to 51 % (L3). Strip height differences are around 2 cm for both sensors and L3 achieved sub-centimeter precision on sealed surfaces. Overall, L3 offers substantial gains in spatial resolution, precision, and vegetation penetration.

8:45am - 9:00am

Geometric and radiometric Calibration of a rotating multi-beam Lidar using a rotating tilted Platform

Heikki Hyyti, Matias Mäki-Leppilampi, Harri Kaartinen

Finnish Geospatial Research Institute FGI, Finland

Intrinsic calibration of rotating multi-beam lidars (RMBL) enables more precise measurements. We calibrated our sensor to improve its geometric and radiometric accuracy using a rotating tilted platform. The rotating mechanism widens the field of view of each lidar channel and allows all lasers of the sensor to measure the same areas in a room containing planar wall and floor sections. Therefore, we can collect measurements for geometric and radiometric calibration with minimal amount of calibration targets. Furthermore, we used data based numerical minimization to estimate the calibration parameters for all 128 lidar channels in our RMBL sensor. For the intrinsic geometric calibration of the sensor, we estimated the elevation and azimuth angles of each laser. For the radiometry, we estimated a linear model for each laser to correct the intensity measurement. For a linear model, two different known diffuse reflectance targets are sufficient for the radiometric calibration. We tested our methods in two different environments, in an office room and a longer corridor. We showed that the methods can improve the precision of the RMBL sensor significantly. Regarding geometry, we were able to reduce the error on average from 16.1 mm to 15.1 mm (6.2% improvement). For radiometry, we were able to improve the reflectance measuring accuracy on average from 9.5% errors down to -0.9% errors (91% improvement).

9:00am - 9:15am

Tightly-coupled joint Adjustment of static and kinematic Laser Scanning Data

Florian Pöppl, Philipp Amon, Nikolaus Studnicka, Martin Pfennigbauer, Andreas Ullrich

RIEGL Laser Measurement Systems GmbH, Austria

In recent years, laser scanning has evolved into a core surveying technology for 3D mapping, both statically from stationary scan positions (terrestrial laser scanning, TLS) and kinematically from moving platforms (kinematic laser scanning, KLS). Consequently, there is a growing demand for methods that efficiently and coherently support both static and kinematic data acquisition modes. This contribution presents a tightly-coupled approach for the co-registration of TLS and KLS data, which simultaneously integrates GNSS positions, inertial measurements, planar features extracted from both static and kinematic point clouds, and control information in a joint non-linear least-squares adjustment. This is neither just a transformation of the kinematic onto the static point cloud nor a simple correction of the trajectory in e.g., a strip adjustment, but rather a tightly coupled adjustment of static and kinematic data. This approach avoids the need for additional survey control for kinematic data by leveraging the static scan data as a proxy, enabling accurate georeferencing even in scenarios where the individual datasets cannot be reliably tied to control points. Results show that the co-registration notably improves the relative consistency of kinematic datasets with respect to a static reference. Such co-registration enables new use-cases for multi-modal data acquisition, such as change-detection in repeated kinematic data acquisitions with respect to a static reference dataset, or more flexible ways of integrating ground control in kinematic surveys.

9:15am - 9:30am

Position and Orientation from Asynchronous Lidar in GNSS Denied Environments

Craig Glennie, Francisco Haces-Garcia

University of Houston, United States of America

This study investigates the use of a distributed asynchronous lidar system for augmented position and orientation determination in Global Navigation Satellite Systems (GNSS) denied environments. An asynchronous lidar design is one in which the laser transmitter and detectors/receivers are disconnected and carried on separate platforms. This unique geometry offers observational redundancy that can be used to estimate the trajectory of the receiver platforms. The paper presents the results of simulation experiments, first examining single epoch solutions and then considers estimates of position and orientation along simulated flight trajectories. The results show that as long as the laser transmitter is operated above the GNSS denied environment, the system is able to simultaneously estimate position and orientation for multiple receiver drones, even for extended periods of GNSS outages. The accuracy of position and orientation estimation is dependent on the exact flight path and the number of lidar receivers in the solution, but with favorable geometry the accuracy of position estimation can approach that provided by a high precision GNSS solution.

9:30am - 9:45am

Extraction of Image-to-Lidar Correspondences and their Impact on Optimal Sensor Fusion

Kyriaki Mouzakidou, Aurélien Arnaud Brun, Jan Skaloud

Earth Sensing & Observation Laboratory (ESO), Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland

This work extends our initial proof-of-concept via emulations on the benefits of relative spatial constraints between imagery and lidar point clouds in a factor graph based optimization with satellite positioning (GNSS) and raw inertial readings (Mouzakidou et al., 2025). Here, we demonstrate practically the automatic extraction and integration of 2D-3D correspondences established in the 3D domain within rough natural terrain flown over by an aircraft with sensors of high quality. We show that considering cross-domain (i.e. 2D-3D) constraints enables the calibration of internal camera parameters and its boresight on job, i.e. within mapping flight configurations, where conventional approaches fail. The common optimization of raw IMU data with such constraints improves the respective agreements between the lidar and image dense clouds, achieving consistency at ground resolution level, which is not the case for the conventional (standard) processing of acquired data.

9:45am - 10:00am

GNSS-Constrained Motion Estimation for Robust Visual-Inertial-Odometry Initialization

Chunqi Dai, Sagi Filin

Technion - Israel Institute of Technology, Haifa, Israel

Visual-inertial odometry (VIO) plays a key role in modern navigation and mapping systems.

For their successful integration, an initialization phase, in which IMU-related bias factors are estimated, becomes a fundamental step.

Without one, the subsequent nonlinear estimation of the platform pose may fail to converge or completely diverge.

As reliance on visual and inertial information may exhibit instability due to error accumulation with time, incorporating absolute positioning information through global navigation satellite system (GNSS) measurements, may enhance its robustness and accuracy.

Accordingly, GNSS and visual-inertial initialization frameworks have been receiving growing attention in recent years where current strategies tend to follow a loosely-coupled formulation that first initializes the VIO trajectory, and then aligns it with GNSS measurements.

Such strategies are multi-stage, nonlinear, and computationally expensive, motivating us to introduce an alternative framework in which GNSS position is integrated with the raw visual-inertial measurements to form absolute translation constraints.

Accordingly, we achieve a closed-form, linear and globally consistent drift-free solution which is computationally efficient and requires neither 3D reconstruction nor nonlinear refinement, as common approaches do.

Testing our initialization formulation on benchmark multi-sensor datasets, results show that we outperform current baselines while exhibiting robustness in challenging scenarios.

8:30am - 10:00am

WG IV/9B: Spatially Enabled Urban and Regional Digital Twins
Location: 714B

8:30am - 8:45am

A BIM and LLM Framework for Automated Construction and Demolition Waste Management

Nolan Porther, Amirhossein Nourbakhshrezaei, Mojgan Jadidi

Lassonde School of Engineering, York University, Canada

Artificial Intelligence (AI) integration has become an essential of modern AEC workflows, yet it has failed to gain a position in waste management. This gap is particularly prominent given the urgent environmental and legal imperatives for the sector to mitigate its demolition outputs. Existing approaches to waste classification and diversion cost estimation rely on manual interpretation of project documentation, a process that is both resource-intensive and structurally incompatible with the machine-readable data environments established by Building Information Modelling (BIM). This paper presents a framework that bridges Industry Foundation Class (IFC) compliant BIM data and Large Language Model (LLM) capabilities to automate Construction and Demolition Waste (C&DW) classification and probabilistic cost optimisation. The framework utilizes IfcOpenShell to extract element geometry and material data, channeling this information into a Retrieval-Augmented Generation (RAG) pipeline. To ensure rigorous compliance during classification, a FAISS-indexed knowledge base grounds a locally deployed Llama3 model against the specific mandates of Province of Ontario, Canada regulation 102/94. Diversion cost scenarios are computed through a Bayesian cost module coupled to a multi-objective genetic algorithm (MOGA) optimiser. Th proposed approach is evaluated against a labelled dataset of 104 IFC type-and-material combinations, the RAG classifier. Performance thresholds were established a piori based on multi-class classification benchmarks and Bayesian cost model uncertainty tolerances. The framework achieved a macro-average F1 of 0.84 and overall accuracy of 88%, satisfying the minimum criteria for automated C&DW characterization under Ontario Regulation 102/94.

8:45am - 9:00am

Open Data for large-scale geospecific 3D Simulation for Security Applications - A Case Study

Dirk Frommholz

German Aerospace Center (DLR), Germany

This case study details the integration of official large-scale open 2D and 3D geospatial data of the city of Berlin, Germany, into the Virtual Battlespace 4 (VBS4) simulator for security applications. Realistic scenery with elements specific to the target area is obtained from a digital terrain model, true-ortho mosaic, and high-resolution land use/land cover layer rasterized from OpenStreetMap vector primitives. For the central Mitte borough with its government institutions and foreign embassies, almost 20000 buildings are prepared from textured CityGML data in an automatic multi-stage process. This process involves pre-wrapping the texture images, which are referenced by the semantic 3D models using non-canonical coordinates, and the rapid creation of compact atlases to reduce the bitmap count by three orders of magnitude. To ensure that the building meshes blend seamlessly into the terrain, vertical adjustment methods are discussed, and ground extrusion is implemented to approach the model's base surfaces from below. Data import into VBS4 happens through its Geo interface for the terrain, ortho, and land cover, while the buildings are compiled into an add-on with a custom workflow that involves reprojection, collision component setup, and damage behavior configuration. During interactive convoy training in the virtual environment, a high recognition value compared to the real landscape could be attested visually. Simulation exhibited acceptable frame rates, but required considerable computing resources.

9:00am - 9:15am

An Adaptive Digital Twin Framework Based on Online Learning for Smart Water Management in Campus Buildings

Gopika Rajan, Songnian Li

Toronto Metropolitan University, Canada

Water scarcity and increasing demand have made sustainable water management a global priority, reflected in UN SDG 6, which emphasizes water-use efficiency and reducing water scarcity. Smart Water Management (SWM) has emerged as an advanced, data-driven approach that leverages ICT and IoT systems to monitor, analyze, and optimize water use. Digital Twin (DT) technology enhances SWM by creating dynamic virtual replicas of physical systems to support predictive analytics and operational intelligence. While DTs are widely used in large-scale Water Distribution Networks, these implementations typically do not require detailed 3D modelling.

Campus-scale water systems present unique challenges due to the integration of external and interior water networks, variable building functions, and the need for detailed spatial representation. This study proposes a comprehensive DT framework for Smart Water Management at Toronto Metropolitan University. It integrates BIM, GIS, sensor data, and graph-based modelling to capture 3D interior utilities and enable real-time monitoring, hydraulic simulation, and network analysis. The framework adopts Tao et al.’s five-layer DT architecture and introduces the IFCGraph Model, which combines IFC multipatch geometry with a Neo4j knowledge graph for enhanced interoperability and topological analysis. Overall, the framework supports operational intelligence, proactive management, and scalable campus-level water system optimization.

9:15am - 9:30am

An OGC standards-based Urban Digital Twin platform supporting co-creation of Positive Energy Districts: Case study of the Nordbahnhof district in Stuttgart, Germany

Rushikesh Padsala^1,3, Amando Reber², Christina Simon-Philipp², Volker Coors¹

¹Centre for Geodesy and Geoinformatics, Stuttgart Technical University of Applied Sciences (HFT Stuttgart), Stuttgart, Germany; ²Centre for Sustainable Urban Development, Stuttgart Technical University of Applied Sciences (HFT Stuttgart), Stuttgart, Germany; ³Department of Building, Civil, and Environmental Engineering, Concordia University1515 St. Catherine St. West Montreal, QC, H3G 2W1 Canada

Urban Digital Twins (UDTs) are increasingly recognized as enablers of evidence-based planning and citizen engagement. While the involvement of civil society in planning the built environment is well established, its role and motivation in advancing the clean energy transition remain largely unexplored. This paper presents the development and application of an Open Geospatial Consortium (OGC) standards-based UDT platform for the co-creation of Positive Energy Districts (PEDs), as demonstrated through the Nordbahnhof district case study in Stuttgart. The platform integrates interoperable 3D city and energy data using CityGML 2.0 with its Energy ADE 3.0 extension, both compliant with OGC standards to ensure semantic consistency and cross-domain interoperability. SimStadt energy simulation results are stored in the Energy ADE schema within PostgreSQL/3DCityDB database. These data are published through an OGC Web Feature Service (WFS), while 3D city geometries are served as 3D Tiles. In the CesiumJS web-viewer, both services are linked via GML identifiers, enabling coordinated interaction between geometry and energy data for real-time visualization of the district-scale energy balance. The platform was tested with citizens, who learned about load profiles, photovoltaic (PV) potential, and energy efficiency while acting as “district energy planners.” Their responses/willingness to adopt PV and/or modify energy-use behavior were translated into slider inputs to visualize real-time energy-balance outcomes through the platform. Results demonstrate the potential of interoperable, OGC-compliant UDTs to connect data providers, planners, and citizens in a shared decision-support environment. The architecture’s open, modular design enables wider replication, promoting scalability and long-term municipal adoption for participatory energy-transition planning.

9:30am - 9:45am

Developing BIM-Based Data Analytics Dashboards for Sustainable Construction and Demolition Waste Management and Environmental Evaluation

Sara Abbasian, Mojgan Jadidi

Department of Civil Engineering, Lassonde School of Engineering, York Univeristy, Canada

Building Information Modeling (BIM) is increasingly mandated worldwide as part of the digital transformation of the construction industry. While widely used in design and construction, its potential for managing construction and demolition waste (C&DW) remains underexplored, despite demolition accounting for 70–90% of building-related waste and 30–40% of global solid waste. Revit models provide rich data but are computationally intensive and require specialist expertise, limiting their direct use for waste quantification and sustainability evaluation. This study develops a BIM-enabled data integration and visualization framework that automates waste estimation, material classification, and environmental evaluation by linking BIM data with heterogeneous datasets through Speckle connectors and Power BI dashboards. Supplementary datasets included material densities, expansion coefficients, recycling rates, and environmental factors such as CO₂ emissions and energy intensities. A case study of York University’s Bergeron Centre illustrates the framework’s effectiveness across three demolition stages. The non-invasive dismantling phase highlighted significant opportunities for material recovery, while semi-invasive deconstruction captured recyclable structural components with moderate landfill requirements. The final core demolition stage revealed the greatest potential for recycling, particularly in concrete and steel, though it also underscored the challenges of diverting large volumes of residual waste from disposal. By integrating BIM with environmental datasets and interactive dashboards, the system delivered holistic insights into recovery, landfill diversion, and CO₂ reduction. Findings confirm its scalability, accessibility, and value as a decision-support tool for sustainable demolition and circular economy objectives.

9:45am - 10:00am

Urban Intervention Effects on Land Surface Temperature: A Prototype EO-Based Simulation Framework for Urban Digital Twin Applications

Daniele Oxoli, Alberto Vavassori, Maria Antonia Brovelli

Dept. of Civil and Environmental Engineering, Politecnico di Milano, Milano, Italy

This contribution presents a prototype Earth Observation-based simulation framework to assess how large-scale urban interventions affect Land Surface Temperature (LST). Focusing on the Metropolitan City of Milan (Northern Italy), the framework integrates thermal (Landsat 8/9) and multispectral (Sentinel-2) satellite imagery with Local Climate Zone (LCZ) maps, urban morphology and material fraction layers. Random Forest regression models are trained to predict seasonal LST patterns. A simulation module, based on raster algebra, enables scenario testing by modifying predictor layers to reflect planned urban transformations, generating corresponding LST responses. The framework is conceived for integration into Urban Digital Twin platforms to support “what-if” scenario analyses for climate-resilient urban planning and adaptation.

8:30am - 10:00am

WG III/8H: Remote Sensing for Agricultural and Natural Ecosystems
Location: 715A

8:30am - 8:45am

Integrating multi-source remote sensing and soil attributes through ensemble learning for large-scale soil organic carbon estimation

Jayantrao Diliprao Mohite¹, Suryakant Ashok Sawant¹, Danielle Berard², Sonali Kulkarni¹, Ankur Pandit¹, Dineshkumar Singh¹

¹Tata Consultancy Services, India; ²EMILI, Manitoba, Canada

Accurate estimation of Soil Organic Carbon (SOC) is essential for sustainable land management, agricultural productivity, and climate change mitigation. This study presents a novel framework for SOC estimation using machine learning models and diverse predictors, including spectral bands, vegetation and soil indices, topographical features, soil texture components, and HSV-derived soil color proxies. SOC data from 180 samples collected between 2007 and 2020 across 21 fields in Manitoba, Canada, were used for model training and validation. Landsat 5, 7, and 8 data were utilized to extract spectral and soil indices, while SoilGrids and SRTM DEM provided texture and topographical features. Random Forest (RF), Extreme Gradient Boosting (XGB), and a BC-VW-based ensemble model were evaluated across five feature scenarios. The ensemble model achieved the highest accuracy, with an R² of 0.57, RMSE of 0.25, and RMSPE of 7.87%, outperforming individual models. SHAP-based feature selection identified Clay%, SWIR1, and Value (HSV) as the most critical predictors. Independent validation using data from 2021 and 2023 confirmed the model's robustness, with RMSPE values of 10.93% and 12.83%, respectively. This study demonstrates the importance of integrating soil-specific indices, texture, and color features with ensemble modeling to improve SOC predictions. The framework offers a scalable and reliable approach for large-scale SOC monitoring, contributing to sustainable agriculture and carbon sequestration efforts. The findings underscore the need for robust uncertainty analysis and independent validation, setting a benchmark for future SOC modeling studies.

8:45am - 9:00am

Leveraging Post-Rainfall Spectral Proxies and Multi-Sensor Imagery to Refine Soil Salinity Maps in Dryland Environments

Jamal-Eddine OUZEMOU^1,2, Ahmed LAAMRANI^1,2,3,4, Ali EL BATTAY^2,5,6, Joann Whalen^1,2, Abdelghani Chehbouni^1,2

¹Center for Remote Sensing Applications (CRSA), Mohammed VI Polytechnic University (UM6P), Ben Guerir 43150, Morocco; ²College of Agriculture and Environmental Sciences (CAES), UM6P, Ben Guerir 43150, Morocco; ³Department of Geography, Environment & Geomatics, University of Guelph, Guelph, ON N1G 2W1, Canada; ⁴Institut de Recherche sur les Forêts (IRF), Université du Québec (UQAT), Rouyn-Noranda, Québec, Canada; ⁵Center for Sustainable Soil Sciences (C3S), UM6P, Ben Guerir 43150, Morocco; ⁶Department of Natural Resource Sciences, McGill University, Ste-Anne-de-Bellevue, Québec, Canada

Soil salinization is a major form of land degradation in drylands, where closed hydrological systems, shallow water tables, and strong evaporative demand favor the recurrent buildup of salts at the surface. Accurate and spatially explicit salinity assessment is crucial for guiding agricultural management and land rehabilitation, yet conventional soil sampling remains spatially restrictive and most remote-sensing approaches insufficiently capture the hydrological and pedological processes that drive seasonal salt redistribution. This study evaluates whether post-rainfall spectral information can improve soil salinity mapping in a large endorheic depression in central Morocco (Sehb El Masjoune). A dataset of 121 ECe-measured topsoil samples was combined with multi-sensor optical imagery from Sentinel-2, Landsat-9, and PlanetScope. In addition to standard salinity, soil, vegetation, and moisture indices, two new post-rainfall predictors were developed: a Depression Proxy (DP), delineating moisture-retentive micro-depressions where salts accumulate, and a Soil Cluster Proxy (SCP), capturing soil textural and compositional contrasts from spectral responses. These predictors were integrated into Random Forest and Gradient Boosting Regressor models and evaluated using repeated cross-validation on Box–Cox-transformed ECe. The combination of DP and SCP with Sentinel-2 predictors yielded the highest performance (R² = 0.92; RMSE = 20.53 dS·m⁻¹), outperforming models relying only on spectral indices and topographic covariates. Seasonal salinity maps revealed strong intra-annual dynamics associated with rainfall events and subsequent evaporative concentration. The proposed DP–SCP framework offers transferable, physically interpretable predictors for dryland salinity assessment and provides a scalable step toward process-informed remote-sensing approaches supporting climate-resilient land-use planning.

9:00am - 9:15am

Enhancing Soil Nitrogen Mapping Using Reconstructed Water Vapor Bands in PRISMA Hyperspectral Imagery

Nadir El Bouanani^1,2, Ahmed Laamrani^1,3, Jamal-Eddine Ouzemou¹, Adam Bouhnidira², Hamd Ait Abdelali², Mohamed Bourriz^1,2,4, Francois Bourzeix², Abdelghani Chehbouni¹

¹CRSA, Mohammed VI Polytechnic University (UM6P), Campus Ben Guerir 43150, Morocco; ²Analytic Laboratory (Alab), UM6P, Campus Rabat 11103, Morocco; ³Department of Geography, Environment & Geomatics, University of Guelph, Guelph, ON N1G 2W1, Canada; ⁴Friedrich Schiller University Jena, Department of Geography, Jena 07743, Germany

Soil total nitrogen (TN) is a critical nutrient for sustainable agricultural management, yet large-scale mapping remains constrained by high laboratory analysis costs. Spaceborne hyperspectral remote sensing offers a promising alternative, but its effectiveness is limited by spectral gaps caused by atmospheric water-vapor absorption in nitrogen-sensitive NIR and SWIR regions. This study evaluates the contribution of reconstructing missing spectral domains to improve soil TN estimation from PRISMA hyperspectral imagery. A spectral gap-filling framework combining a conditional generative adversarial network (cGAN) with a self-supervised masked autoencoder pretraining strategy was developed to reconstruct reflectance spectra across water-vapor absorption intervals (950–990 nm, 1320–1500 nm, and 1780–2050 nm), achieving R² = 0.95 on PRISMA test data and R² = 0.91 against ASD FieldSpec III measurements. Applied to 1,037 samples across three Moroccan agricultural regions, incorporating reconstructed bands consistently improved TN prediction: R² increased from 0.83 to 0.89 in Al Haouz, 0.73 to 0.79 in Doukkala, with R² = 0.73 in Khouribga. Feature-selection analyses identified reconstructed water-vapor bands among the most informative predictors (1050–1450 nm, 1800–2100 nm, and 2300–2400 nm). These findings demonstrate that spectral gap filling enhances spaceborne hyperspectral data usability for operational soil TN monitoring and precision agriculture.

9:15am - 9:30am

Evaluation of a High-Resolution L-Band RPAS-Mounted Sensor for Soil Moisture Estimation

Hunter Rusk¹, Aaron Berg¹, Jaison Thomas Ambadan¹, Alexander McLaren¹, Laxhman Ramsahoi¹, Maik Wolleben²

¹University of Guelph, Canada; ²Skaha Labs, Canada

This study investigates the performance of a novel L-band passive microwave radiometer mounted on a Remotely Piloted Aerial System (RPAS) for high-resolution soil moisture retrieval. Soil moisture is a critical variable for predicting crop stress, scheduling field operations, and optimizing irrigation, yet traditional measurement approaches have limitations. Satellite radiometers provide broad spatial coverage but coarse resolution, while in situ sensors offer high accuracy with limited spatial representativeness. RPAS-based sensing offers an intermediate solution, enabling fine-scale mapping with flexible deployment. The sensor evaluated in this research, developed by Skaha Remote Sensing Ltd., measures brightness temperature (Tb) at 1.4 GHz, a frequency where soil emissivity varies strongly with moisture content.

Field campaigns were conducted from May to October 2025 at the Elora Research Station in Ontario, with weekly flights over plots containing different crops and tillage conditions. Concurrent ground measurements of soil moisture, leaf area index (LAI), and vegetation water content (VWC) supported evaluation of vegetation impacts. Statistical analyses, including Pearson correlation and linear regression, revealed the relationships between microwave emissions, soil moisture, and vegetation properties.

Results show a strong inverse relationship between microwave emissions and soil moisture, with vertically polarized signals exhibiting the highest sensitivity. Vegetation effects were crop-dependent due to the unique canopy structures. These findings demonstrate that RPAS-mounted radiometers can provide reliable, high-resolution soil moisture measurements and highlight the importance of crop geometry in interpreting microwave observations.

9:30am - 9:45am

Unmasking drought dynamics: a physically interpretable GMM-MST framework for high-resolution diagnostic monitoring

Jinhui Hu¹, Qiuwen Zhang², Changtao Deng³

¹Huazhong University of Science and Technology - Main Campus; ²Huazhong University of Science and Technology - Main Campus; ³Pearl River Water Resources Research Institute

Drought represents one of the most devastating natural hazards, causing billions in economic losses and threatening global food security. Conventional single-variable drought indices often fail to capture drought's multifaceted nature, while existing composite indices are frequently constrained by linear assumptions or operate as 'black boxes,' obscuring physical drivers. This study introduces the State-Space Gradient Drought Index (SSGDI), developed via a novel Gaussian Mixture Model–Minimum Spanning Tree (GMM–MST) framework that re-conceptualizes drought as a trajectory within a physical system. By modeling a 3D state-space composed of the Standardized Precipitation Index (SPI), Standardized Soil Moisture Index (SSMI), and Standardized Runoff Index (SRI) with a Gaussian Mixture Model (GMM), the framework learns distinct hydro-climatic archetypes; a Minimum Spanning Tree (MST) then imposes physically plausible connections among these archetypes to define the principal wet-to-dry gradient. The final SSGDI is derived from a data point's probabilistic position along this gradient and is complemented by a classification system that diagnoses the drought's physical type. Applied to the Central China Triangle, the framework successfully uncovered the hydro-climatic system's intrinsic, non-linear structure. Validation showed the SSGDI provides a significantly more robust measure, with SSGDI-6 achieving a spatially-averaged Pearson correlation of r = 0.80 against the PDSI benchmark—a marked improvement over any single component. The SSGDI framework bridges robust statistical aggregation with clear physical interpretation, offering a powerful tool that provides not just a severity score but a diagnostic narrative for proactive drought management.

9:45am - 10:00am

Applications of Coherent Fine Resolution Synthetic Aperture Radar Imagery for Mid-Season Corn Yield Prediction

Abbey Papadimitriou¹, John Lindsay¹, Aaron Berg¹, Qiaoping Zhang²

¹University of Guelph, Canada; ²ICEYE Oy, Finland

Synthetic Aperture Radar (SAR) has become a popular form of remotely sensed data for agricultural management due to its ability to acquire cloud-free images at extremely high temporal resolutions. A particularly useful product that can be derived from SAR imagery is coherence, which visualizes structural target changes over time based on phase decorrelation. In a crop management context, coherence is largely unexplored. This is in part due to the fine resolution image requirements that field-scale vegetation monitoring demands. Within agricultural fields, high image coherence should correlate to areas with minimal to no crop growth, whereas low image coherence should correlate to areas where crops are consistently growing. Based upon this hypothesis, our research investigates the applications an ICEYE fine spatial resolution X-band SAR imagery time series has for detecting low yielding regions within corn fields using coherent change detection.

8:30am - 10:00am

WG IV/1C: Spatial Data Representation and Interoperability
Location: 715B

8:30am - 8:45am

Hierarchical Polygon-to-Point Collapsing for Multi-Scale Representation Based on the Straight Skeleton and Dual Half-Edge Data Structure

Amin Gholami¹, Pawel Boguslawski¹, Martijn Meijers²

¹Wroclaw University of Environmental and Life Sciences, Institute of Geodesy and Geoinformatics, Grunwaldzka 53, 50-357 Wroclaw, Poland; ²GIS Technology, Faculty of Architecture and the Built Environment, Delft University of Technology, Julianalaan 134, 2628 BL Delft, The Netherlands

This paper presents a hierarchical method for collapsing a polygon to point within a structured multi-scale representation. The approach is based on the straight skeleton, which drives the shrinking process through event-based transformations such as edge and split events. These events define how the polygon changes during collapse and produce a hierarchy of intermediate geometric states between the initial polygon and the final point.

The resulting hierarchy is integrated into a Dual Half-Edge (DHE) structure, where the primal space represents successive geometric states and the dual space represents the hierarchical relations between them. This produces a connected 2D+1D representation in which the third dimension corresponds to scale rather than physical height. The resulting model is interpreted as a LoD Transition Space (LTS), allowing the full polygon-to-point transition to be represented continuously across scale.

The proposed framework contributes to model-based multi-scale representation by explicitly linking geometric transformation, topological change, and hierarchical structure within a unified representation. In addition to its relevance for vario-scale cartography and generalisation, the method also has potential applicability in domains where gradual geometric transformation is required, such as procedural modeling, animation, and related geometric applications.

8:45am - 9:00am

The Research on Renewal Theory and Method for the CGCS2000 Reference Framework

Zhihao Jiang, Ju Bai, Hao Yu, Yunlu Peng, Linlin Che, He Zhang

National Geomatics Center of China

The CGCS2000 (China Geodetic Coordinate System 2000) reference framework, which has been employed since July 1, 2008 is based on the ITRF97 reference framework and only meets the application requirements of China's regional. With the sustained development of China's economy and society, and the globalization of the applications of BeiDou navigation satellite system (BDS), there is a need to establish global CGCS2000 reference framework. This paper studies mathematical method for construction Global CGCS2000 reference framework, the theory and algorithm of two-step method with the inner constraints theory is analysed. The constraint conditions of coordinate reference are redefined according to the minimum standard of frame transition parameters and rate variation. As a result, the adjusted network enjoys the highest degree of fitting to the shape of the initial network and maintain the inherent purity of the coordinate network using different observation technologies, this research result can improve the basic theory of terrestrial reference framework determination, and provide scientific methods for the globalization of the CGCS2000.

9:00am - 9:15am

Open Source 3D Cadastre Visualisation Pipeline

Pavan Sai Goud Goddu, Sisi Zlatanova, Mohsen Kalantari

University of New South Wales, Australia

Interpreting multi-storey property rights is difficult when information is scattered across 2D plans and text or locked inside desktop projects. We present a web-based pathway that communicates strata lots and common property consistently across levels in a standard browser. Aligned with the 3D Cadastral Survey Data Model and Exchange (3D CSDM) of Australia, we propose an open-source, web-first approach. The method couples a lightweight browser viewer (level/tenure filters, plan overlay, search, readable legend) with an explicit conversion step that standardises common GIS inputs into a fixed core JSON profile, with limited official CSDM-aligned JSON-LD hooks applied only to selected keys that have exact matches in the published vocabularies. Using a New South Wales case study, we evaluated the viewer against ISO 9241-11 criteria (effectiveness, efficiency). Across repeated trials (cache disabled/enabled), mean page-open times were 0.60 s (Chrome) and 1.48 s (Edge); interaction averaged 50–60 FPS; level filters applied in 40–55 ms; all five tasks succeeded. Practically, this delivers fast, consistent 3D communication of lots and common property without installs, lowering access barriers for agencies and owners while aligning with 3D CSDM’s web-first direction. Next, we will finalise parity between Upload-and-View and the Reference Viewer and add a light in-viewer validation panel.

9:15am - 9:30am

Shadow Geometric Analysis Utilising CityGML Models and FME

Pawel Boguslawski¹, Malgorzata Jarzabek-Rychard¹, Stanislaw Biernat²

¹Wroclaw University of Environmental and Life Sciences, Poland; ²infoSolutions Sp. z o.o.

This research presents a methodology for conducting shadow geometric analysis, specifically the shadow boundary in an urban model. Input data include a georeferenced CityGML LoD2 and terrain model. Additional land cover data is used to exclude some parts of the model from analysis. Shadow computation is based on a sunray vector, which is computed based on the sun position on the given day and time. The geometry of original models are divided into parts classified as either exposed to the sun or shaded. It can be used for analytical purposes in other applications, such as urban planning, energy assessment, and photovoltaic potentiality analysis, by accurately identifying sunlit and shaded areas within 3D city models. The analysis is performed in the FME software package, which is a general-purpose ETL tool.

9:30am - 9:45am

Software Development for Producing Texture Images Mapped on a Building Surface of a 3D City Model Using Aerial Images

Ryuji Matsuoka, Masato Ishikawa, Tomoaki Inazawa, Yoshihiko Nakanishi, Futa Kawamata, Masahito Takada, Takuya Danjo

Kokusai Kogyo Co., Ltd., Japan

It is desirable that a 3D city model at level of detail 2 (LOD2) has texture images mapped on building surfaces. Owing to the cost of image collection, it would be the best way to use aerial images for texture mapping at present. Although aerial oblique images provide higher-resolution texture images, using aerial oblique images has a major issue of occlusion. Accordingly, we develop software for texture mapping to a 3D city model using aerial nadir and oblique images, aiming to minimize the impact of occlusion. The software designed to be used in ordinary operation includes the features of automatically detecting occlusions on building surfaces within images by utilizing the geometry of a 3D city model and automatically selecting appropriate oblique and nadir images for texture mapping. The major feature of the developed software is its ability to process grid by grid on a building surface. The validation experiment results confirm the software's satisfactory performance in practice. Moreover, the experiment results indicate that the performance of the software depends on the ability of a 3D city model to represent buildings. Since we have recognized that it would be effective if each pixel of a texture image has its own resolution, we plan to modify the software so that each pixel can have its own resolution.

9:45am - 10:00am

Automatic detection and condition assessment of agricultural plastic greenhouses using deep learning and aerial rgb images

Davoud Omarzadeh¹, Mehran Alizadeh Pirbasti², Hamed Bahrevar³, Hoda Khalaghi⁴, Gavin McArdle², Bahram Salehi⁵

¹Institut d’Estudis Espacials de Catalunya (IEEC), Barcelona, Spain.; ²School of Computer Science, University College Dublin, Dublin, Ireland.; ³University of Tabriz, East Azerbaijan, Iran.; ⁴Universitat Autònoma de Barcelona, Barcelona, Spain.; ⁵State University of New York College of Environmental Science and Forestry (SUNY ESF), Department of Environmental Resources Engineering, Syracuse, USA.

Rapid urbanization in developing countries such as Iran has intensified pressure on agricultural land, highlighting the need for sustainable and efficient food production systems. Agricultural Plastic Greenhouses (APGs) have become a scalable alternative by enabling year-round cultivation and optimized land utilization. However, their rapid expansion necessitates continuous monitoring to evaluate structural integrity and environmental impacts, including soil degradation, plastic waste accumulation, and water consumption. This study presents a deep learning-based framework for the automated detection and condition assessment of APGs using 0.5~m resolution Google Earth imagery across four major agricultural regions in Tehran County: Pakdasht, Qarchak, Pishva, and Varamin. The proposed pipeline integrates YOLOv11 for precise APG segmentation with a U-Net architecture employing a MobileNetV2 backbone for classifying damaged and intact structures. Out of 158,912 analyzed image tiles, 6,835 contained APGs, covering an estimated area of 18.73~km\textsuperscript{2}. Among these, 1,863 damaged structures were identified, predominantly located in Pakdasht and Pishva. Around 20\% of the annotated greenhouses were verified on-site, improving labeling reliability, and the relatively standardized design of APGs in Iran suggests the model could generalize to similar regions, with minor fine-tuning using local samples if necessary. GIS-based spatial analysis further delineated potential plastic waste risk zones, supporting targeted environmental management. Comparison with government statistics and Sentinel-2 imagery from 2021 and 2024 revealed a continued shift toward greenhouse farming in response to declining cropland availability. The proposed framework provides a scalable and replicable tool for periodic APG monitoring, facilitating data-driven policymaking and sustainable agricultural planning.

8:30am - 10:00am

IvS1: Recent Advances in Iceberg Monitoring and Tracking
Location: 716A

8:30am - 8:45am

Ocean Target Discrimination in SAR Imagery through Machine Learning: Towards a Fully Automated Approach

Maria Yulmetova, Murilo Silva, Pradeep Bobby, Kelley Dodge

C-CORE, Canada

Accurate discrimination of ocean targets using satellite images is crucial for marine safety, environmental monitoring, dark vessel detection, and search and rescue operations. Artificial intelligence technologies are rapidly advancing as state-of-the-art solutions for computer vision problems, including satellite imagery target classification. This research assesses the capability of machine learning (ML) for ocean target discrimination using SAR images. Unlike other studies focusing on binary iceberg-ship classification, this paper goes a step further to investigate the opportunity for multi-class discrimination between icebergs, ships, and false alarms, both within and outside sea ice. The proposed approach enables the fully automated elimination of false alarms while accurately classifying icebergs and ships. As part of a research initiative, the first large dataset of ocean targets was compiled and utilized to train an ML model. The targets were detected in RADARSAT Constellation Mission (RCM) images over Canadian waters. During the evaluation phase, the model achieved classification accuracies of 93% for binary classification and 95% for three-class discrimination. The robustness of the fully automated approach was further validated through an additional test, yielding an overall accuracy of 91%. Moreover, the system exhibited high reliability in reducing false alarms, correctly identifying 96% of them. The implementation of the developed algorithms significantly enhances the efficiency of target detection and classification processes, thereby reducing the workload of human analysts. Such advancements are especially significant in light of the rapidly increasing volume of satellite data and the growing demand for automated, scalable solutions in maritime surveillance.

8:45am - 9:00am

Is Pre-Training Enough? Towards Multi-Task Foundation Models for Sea Ice Classification

Javier Noa Turnes¹, Muhammed Patel¹, Fernando J. Pena Cantu¹, Linlin Xu², David A. Clausi¹

¹University of Waterloo, Canada; ²University of Calgary, Canada

Synthetic aperture radar (SAR) is the primary data source for operational sea ice monitoring, providing coverage independent of illumination or weather conditions. However, annotation scarcity and the domain gap between sea ice and land based scenes hinder the direct reuse of existing pretrained models. Recent studies \cite{Allen2023,Wang2025} point toward self-supervised learning (SSL) as a way to leverage abundant unlabeled SAR imagery. In particular, masked autoencoders (MAE) \cite{He_2022_CVPR} have shown promise in remote sensing contexts by reconstructing masked inputs and learning transferable representations. We investigate whether MAE pre-training is sufficient to yield a foundation model transferable across multiple downstream sea ice tasks: concentration (SIC), stage of development (SOD), and floe size (FLOE).

9:00am - 9:15am

Automated Iceberg Detection in RADARSAT Constellation Mission (RCM) Imagery

Abigail Dalton, Lynn Pogson, Mélanie Lacelle, Benjamin Deschamps

Environment and Climate Change Canada (Canadian Ice Service), Canada

Since the 1980s, the Canadian Ice Service (CIS) has provided iceberg information for navigation in the North Atlantic. Following the breakup of the Milne Ice Shelf on Northern Ellesmere Island in 2020 and increasing risk to ships navigating bergy waters in the Canadian Arctic Archipelago and Beaufort Sea, CIS has initiated two projects with the goal of improving their operational iceberg monitoring program. The first combines RCM imagery and in-situ observations to evaluate the applicability of existing automated detection and modelling methods for monitoring icebergs and ice islands drifting in open water in the western Arctic. The second explores the use of high-resolution RCM imagery (5M and 16M) for emergency response iceberg monitoring.

9:15am - 9:30am

Automatic Segmentation of SAR imagery Using Mixture Models

Zahra Jafari^1,2, Pradeep Bobby^1,2, Rocky Taylor¹, Ebrahim Karami¹

¹Memorial University of Newfoundland; ²C-Core, Canada

Synthetic Aperture Radar (SAR) image segmentation underpins target detection, land cover classification, and environmental monitoring, yet remains challenging due to speckle, non-Gaussian backscatter statistics, and outliers. This paper presents a comparative evaluation of mixture-model–based segmentation tailored to SAR, with a focus on Radarsat Constellation Mission (RCM) imagery. We propose a segmentation algorithm that selects one of three statistical mixture models—Rayleigh, Gamma, or Lognormal—to model SAR backscatter and produce soft (posterior) segmentations, followed by posterior thresholding and optional MRF‑ICM post‑processing to enhance spatial coherence and suppress speckle-induced errors. We compare against traditional threshold-based methods (CFAR, multi-threshold Otsu) and conventional mixture-model labeling that designates the largest-scale component as the target.

On RCM data, the Rayleigh Mixture Model (RMM) is the strongest: at target pixels, the posterior probability of the largest-mean component is typically very close to 1, allowing a single Rayleigh component to capture the main body of the iceberg reliably. Unlike threshold-based baselines that yield hard segmentations, our Mixture Model (MM) approach outputs soft posteriors, enabling principled HH/HV fusion and downstream machine learning (ML). These results underscore the promise of RMM for robust iceberg detection; future work will integrate Rayleigh-based posterior features with lightweight ML classifiers to further improve performance across sensors and conditions.

9:30am - 9:45am

Cross - Sectional Morphology of Sea Ice features from IPS observations across the Newfoundland and Labrador shelf

Alka Dash¹, Rocky Taylor¹, Ian D. Turnbull²

¹Memorial University of Newfoundland, Canada; ²C-Core, St. John's, Canada

Sea ice on the Newfoundland and Labrador shelf can create major risks for ships and offshore structures. This study uses Ice Profiling Sonar and upward looking ADCP data from three moorings on the Northeast Newfoundland Shelf to examine the cross sectional morphology of important sea ice features. The data were converted from time series to spatial draft profiles using measured ice drift. From these profiles, level ice, keel features, and floes were extracted and compared across the three locations.

The results show that level ice and keels form clearly different morphological populations. Keels are generally deeper, narrower, rougher, and more peaked, while level ice is wider, smoother, lower in relief, and more rectangular in cross section. Maximum draft, mean draft, width, relief range, aspect ratio, rectangularity, and roughness provide the clearest separation between the two classes. The study also examines floe size to better understand how local ice features form. Small floes contain a higher proportion of keel features, while medium, big, and vast floes are more strongly dominated by level ice, although this pattern varies by site. NENS3 shows a higher keel fraction across floe size classes than NENS2, suggesting stronger and more persistent deformation. These findings provide new regional information for sea ice characterization and ice interaction studies.

8:30am - 10:00am

Forum2A: The Future of Space- based Earth Observation
Location: 716B

8:30am - 10:00am

ICWG III/IVb: Remote Sensing Data Quality
Location: 717A

8:30am - 8:45am

MAPSRNet: Task-Oriented Super-Resolution Network for Building Detection in Urban Area

Yuwei Cai, Zhimeng He, Meiliu Wu, Brian Barrett

University of Glasgow, United Kingdom

High-resolution (HR) satellite imagery is essential for urban monitoring and disaster management, but its use is constrained by high cost and limited accessibility. Super-resolution (SR) offers an efficient alternative by reconstructing high-quality images from low-resolution (LR) inputs, making large-scale geospatial analysis more feasible. We propose the Multi-Attention Pyramid Super-Resolution Network (MAPSRNet), which delivers two main innovations:

1. A multi-attention model that integrates a Pyramid Vision Transformer for long-range spatial dependencies with a cross-channel Involution+ module to enhance feature interactions, generating SR images with superior structural preservation and sharper boundaries.

2. The first SR network to surpass the performance of original HR images in downstream tasks, demonstrated through building detection with a ConvNeXtV2 backbone and U-Net decoder. MAPSRNet reduces false positives and negatives and, across multiple datasets, exceeds HR performance in IoU, F1-score, and overall accuracy.

Extensive experiments on the Massachusetts building dataset, the Wuhan University building dataset, and the Waterloo building datasets confirm that MAPSRNet consistently outperforms representative SR methods in both image fidelity (PSNR, SSIM) and task-level metrics. Its ability to preserve fine structural details, suppress background noise, and learn resolution-invariant features through multi-resolution training makes the reconstructed images more task-aware than raw HR data. Beyond buildings, this flexibility suggests strong potential for generalization to other land-cover classes such as roads, vegetation, and water bodies.

These results establish MAPSRNet as a cost-effective alternative to HR acquisitions and a milestone in task-driven SR research, advancing both image reconstruction and downstream geospatial analysis.

8:45am - 9:00am

Automated Monitoring of Geolocation Consistency in Micro-satellite SAR Imagery

Angel Caroline Johnsy¹, Eyrin Kim², Qiaoping Zhang¹, Valentyn Tolpekin¹, Michael Wollersheim¹

¹ICEYE, Finland; ²Stanford University, USA

High revisit-rate SAR constellations generate large volumes of imagery that require consistent geolocation accuracy to support applications such as change detection and interferometry. However, variations in orbit determination, attitude knowledge, and external factors such as Global Navigation Satellite System (GNSS) interference can introduce geolocation errors that vary across acquisitions, making large-scale validation challenging. This study presents an automated approach to detect and quantify geolocation offsets in ICEYE SAR imagery by aligning orthorectified scenes with reference images using feature-based matching and correlation-based refinement. The method is validated against independently derived absolute geolocation measurements from corner reflector calibration sites in the United States, Canada, Australia, and Poland. Evaluation across 726 acquisitions demonstrates strong agreement with reference measurements, achieving an overall root-mean-square error (RMSE) of 1.39 m, with RMSE values of 1.18 m for Spotlight mode and 1.93 m for Stripmap mode. Operational applicability is demonstrated through large-scale acquisition campaigns, including nationwide Stripmap coverage over Japan and coherent image stack analysis. The results show that the proposed method can reliably estimate geolocation offsets, detect anomalies, and monitor geometric consistency across large SAR archives, providing a practical and scalable solution for automated geolocation quality control in micro-satellite SAR constellations.

9:00am - 9:15am

Calibrated U-Net with HELIX-Based Label Enrichment for Ageing-Aware Spatio-Temporal Urban Change Detection

Sarah Hauser^1,2,3, Stephanie Dachsberger², Andreas Schmitt^2,3, Stefan Hinz¹

¹Karlsruher Institut für Technologie (KIT), Germany; ²Geoinformatics Department, Munich University of Applied Sciences (HM); ³Institute for Applications of Machine Learning and Intelligent Systems (IAMLIS)

Urbanisation and land-use change increase the demand for temporally consistent urban maps from high-resolution Earth observation imagery. A key obstacle is label ageing: benchmark annotations are often years older than current true orthophotos (TOP), causing semantic and geometric mismatches (e.g., demolished/new buildings, shifted vegetation boundaries) that degrade supervised learning, calibration, and transfer. This paper presents a probabilistic, quality-aware segmentation framework based on a compact U-Net. Legacy annotations are converted into edge-adaptive soft labels to encode boundary uncertainty. A HELIX-derived per-pixel supervision quality score Q is computed and integrated as a weight in a Q-weighted Kullback--Leibler objective with an agreement-focal component, reducing the influence of unreliable or outdated regions. Global temperature scaling is then applied to obtain calibrated per-class probability fields with comparable confidence magnitudes. Experiments on ISPRS Potsdam and Vaihingen combined with recent (2024) TOPs evaluate temporal transfer (archival supervision vs. updated imagery of the same area) and spatial transfer (cross-city application). Finally, calibrated probability fields are used to derive probabilistic semantic transitions and temporal reliability scores, supporting uncertainty-aware mapping of urban change such as construction, sealing, and vegetation loss.

9:15am - 9:30am

The survivorship bias in remote sensing

Laurent Polidori¹, Saeid Pirasteh²

¹UFPA, Brazil; ²Shaoxing University, China

Survivorship bias refers to the fact that conclusions are drawn from a non-representative sample limited to cases that have survived a selection process. This article shows that this bias affects scientific literature, which tends to select successful experiments and hide failures. Remote sensing, like other data-driven sciences, is affected by survivorship bias, making it difficult to have a clear idea of the data's and methods' actual potential and limitations. A typology of failure causes is proposed to encourage critical reading of the bibliography, and perspectives are outlined to overcome survivorship bias by improving practices within the academic and industrial remote sensing communities.

9:30am - 9:45am

A dynamically weighted framework for adaptive reference-based super-resolution

Chae-Eun Kim¹, Junhwa Chi²

¹Department of Data Engineering, Pukyong National University, Busan, Republic of Korea; ²Major of Big Data Convergence, Division of Data Information Sciences, Pukyong National University, Busan, Republic of Korea

Satellite remote sensing is inherently constrained by a fundamental spatio-temporal trade-off by physical sensor limitations. Super-Resolution (SR) techniques are required to overcome these constraints and obtain high-resolution time-series data. However, Single Image Super-Resolution (SISR) provides insufficient information for robust restoration. To address this, Reference-Based Super-Resolution (Ref-SR), which utilizes a high-resolution (HR) reference (Ref) image, has been investigated. Nonetheless, Ref-SR introduces the challenge of reference misuse, stemming from the temporal mismatch (or inconsistency) between the target low-resolution (LR) image (e.g., clouds, seasonal changes) and the Ref image (often a long-term median composite). To address this reference misuse problem, this study proposes an adaptive Ref-SR framework that incorporates a similarity weight map derived from the LR and Ref information. This weight map is computed solely from the pixel-wise similarity between the LR and Ref inputs, requiring no ground truth HR, and functions as a gating mechanism. This allows the network to dynamically control Ref reliability, guiding it to suppress Ref influence in mismatched regions and leverage its textures in similar ones. Validation experiments using Sentinel-2 data (LR 240m, Ref/HR 60m) demonstrate that the proposed method achieves significant performance improvements over SISR in both spatial (Peak Signal-to-Noise Ratio, Structural Similarity Index) and spectral (Spectral Angle Mapper, Error Relative Global Dimensionless Synthesis) metrics. Furthermore, qualitative analysis confirms that the framework effectively suppresses artifacts caused by the blind injection of Ref textures in inconsistent areas. This framework could contribute to the future fusion and quality enhancement of heterogeneous LR sensor data, such as GOCI-II.

9:45am - 10:00am

Ground Based Observation for Validation (GBOV): Extension Of The Analysis Ready Validation Data Service

Christophe Lerebourg¹, Rémi Grousset¹, Jean-Sébastien Carrière¹, Jadu Dash², Somnath Paramanik², Finn James², Zaib Un Nisa², Ana Pérez-Hoyos³, Darren Ghent⁴, Jasdeep Anand⁴, Ritika Shukla⁴, Jan-Peter Muller⁵, Rui Song⁶, Marco Clerici⁷, Nadine Gobron⁷

¹ACRI-ST, France; ²University of Southampton; ³Albavalor; ⁴University of Leicester; ⁵Blue Sky Imaging; ⁶EarthRayView; ⁷7EC-JRC

The Copernicus Land Monitoring Service (https://land.copernicus.eu) has been providing geophysical data derived from Earth Observation (EO) at a global scale for several decades. This global dataset includes temperature and reflectance, vegetation, soil moisture, snow and water bodies variables. To ensure the quality of these dataset, yearly validation assessment is performed. The collection and processing of ground data for the purpose of validating Copernicus products represents in itself a huge task. In 2018, the European Commission (EC) has established a new service to ensure the independent production of these data: Ground-Based Observations for Validation (GBOV) https://gbov.land.copernicus.eu). The prime objective of GBOV has been for the last 8 years, to provide high-quality validation data for seven Copernicus Land Monitoring Service core products:

• Top Of Canopy Reflectance (TOC-R),

• Albedo (ALB),

• Leaf Area Index (LAI),

• Fraction of Absorbed Photosynthetically Available Radiation (FAPAR),

• Fraction of Vegetation Cover (FCOVER)

• Surface Soil Moisture (SSM) and

• Land Surface Temperature (LST).

In its third phase, new product have been included to support the growing Copernicus land products portfolio, namely:

•GPP and NPP

•Phenology

•Evapotranspiration

GBOV includes three components in the service:

•Component 1: consists of using data from existing in situ networks to generate EO validation datasets. Multi-year ground-based observations of high relevance for EO are collected from these global networks.

•Component 2: consists of upgrading existing monitoring sites with new instrumentation or establishing entirely new monitoring sites to close thematic or geographic gaps.

•Component 3: deals with data distribution of the validation dataset to the user community.

10:00am - 10:30am

Morning Coffee Break
Location: Exhibition Hall "E"

10:00am - 5:30pm

Exhibition
Location: Exhibition Hall "F"

Showcase Theatre

ISPRS Summer School Design Competition Winners
IEEE GRSS IDEA Three-Minute Thesis (3MT®) Competition

10:30am - 12:00pm

Plenary Session 2
Location: Exhibition Hall "G"

Keynote 1: Jean-Claude Piedboeuf (CSA)
Awards Ceremony:

The Jack Dangermond Award
The Gottfried Konecny Award

Keynote 2: Dr. Eleni Paliouras (ESA)

12:00pm - 1:30pm

Lunch
Location: Exhibition Hall "E"

1:30pm - 3:00pm

WG III/1B: Remote Sensing Data Processing and Understanding
Location: 713A

1:30pm - 1:45pm

Multi-modal semantic segmentation for open vocabulary interactions with remote sensing images

Jinkun Dai, Tao Peng, Yuhang Xue, Xianping Ma, Yuanxin Ye

Southwest Jiaotong University, Chengdu 611756, China

Semantic segmentation of multi-modal remote sensing imagery plays a pivotal role in land use/land cover (LULC) mapping, environmental monitoring, and precision earth observation. Current multi-modal approaches mainly focus on integrating complementary visual modalities (e.g., optical and synthetic aperture radar (SAR) imagery), yet neglect the incorporating of non-visual textual data a rich source of knowledge that can bridge semantic gaps between visual patterns and real-world concepts. To address this limitation, we propose TSMNet, a text supervised multi-modal open vocabulary semantic segmentation network that synergistically integrates textual supervision with visual representation for open-vocabulary semantic segmentation. Unlike conventional multi-modal segmentation frameworks, TSMNet introduces a dual-branch text encoder to extract both scene-level semantic and object-level label information from various textual data, enabling dynamic cross-modal fusion. These text-derived features dynamically interact with visual embeddings through the proposed text-guided visual semantic fusion module, enabling domain-aware feature refinement and human-interpretable decision-making. Moreover, integrating text opens pathways for open-vocabulary semantic segmentation, enabling systems to recognize and classify unseen categories through natural language descriptions, thereby overcoming the rigid constraints of predefined class taxonomies. To verify our method, we innovatively construct two new multi-modal datasets, and do a lot of extensive experiments are carried out to make a comprehensive comparison between the proposed method and other state-of-the-art (SOTA) semantic segmentation models. Results demonstrate that TSMNet achieves superior segmentation accuracy while exhibiting robust generalization capabilities across diverse geographical and sensor-specific scenarios. This work establishes a new paradigm for explainable remote sensing analysis, demonstrating that textual knowledge integration significantly enhances model generalizability.

1:45pm - 2:00pm

Meta-Prompting with Open-Source Language Models for Zero-Shot Scene Classification in Remote Sensing

Antonis Promponas¹, Eirini Baltzi¹, Valsamis Ntouskos², Konstantinos Karantzalos¹

¹Remote Sensing Lab, National Technical University of Athens, Greece; ²Department of Engineering and Sciences, Universitas Mercatorum, Rome, Italy

Zero-shot visual recognition with vision-language models (VLMs) has shown strong generalization to unseen categories in natural-image benchmarks, yet its effectiveness in remote-sensing (RS) imagery remains less explored. In this paper, we investigate whether meta-prompting with large language models (LLMs) can improve zero-shot scene classification in RS by automatically generating semantically rich class descriptions. Building on the Meta-Prompting for Visual Recognition (MPVR) framework, we evaluate three open-source LLMs, Mixtral-8x7B, Qwen 2.5 7B, and LLaMA 3.1 8B, as prompt generators across five RS benchmark datasets. The resulting descriptions are encoded with several VLMs, including CLIP, MetaCLIP, RemoteCLIP, and CLIP-LAION-RS, and compared against generic single-template and handcrafted domain-specific prompting baselines. Our results show that LLM-generated prompts are competitive with, and in several cases improve upon, manually designed templates, while revealing that the gains depend on both the dataset and the visual backbone. Overall, the study highlights the potential of open-source LLMs as scalable prompt generators for zero-shot remote-sensing recognition and provides insight into the transferability of meta-prompting beyond natural-image domains.

2:00pm - 2:15pm

Knowledge graph enhanced for zero-shot semantic segmentation in remote sensing imagery

Wubiao Huang¹, Huchen Li¹, Shuai Zhang¹, Haibing Liu¹, Zizhen Chen¹, Shihan Chen¹, Fei Deng^1,2

¹School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China; ²Hubei Luojia Laboratory, Wuhan 430079, China

Zero-shot semantic segmentation (ZSSS) is a crucial task in remote sensing image understanding, yet existing methods still suffer from limited generalization to unseen classes. To address this issue, we propose a Knowledge Graph (KG) enhanced ZSSS framework, which introduces explicit hierarchical and relational information into class embeddings to achieve more structured and semantically consistent representations. Specifically, a KG class encoder is designed, consisting of the class enhanced query (CEQ) and class enhanced embedding (CEE) modules, which extract class-relevant subgraphs from a self-constructing Remote Sensing Semantic Class Knowledge Graph (RSSCKG) and generate knowledge-enriched embeddings through a text encoder. Experiments on three public remote sensing datasets demonstrate that the proposed method consistently improves performance across seven state-of-the-art ZSSS frameworks. The integration of KG-based embeddings yields significant gains in the evaluation metrics, with particularly strong improvements on unseen classes, while maintaining accuracy on seen classes. Compared with enhancement strategies based on large language model (LLM) generated descriptions, the proposed KG class encoder exhibit superior semantic separability and stability. These results validate the effectiveness, generalization, and scalability of the proposed framework for ZSSS in remote sensing imagery.

2:15pm - 2:30pm

Segmentation-driven statistics-aware workflow for detailed scene description of UAV images using Mistral and LORA fused model

Bhargav Parulekar, Anandakumar M Ramiya

Indian Institute of Space Science and Technology, Thiruvananthapuram, Kerala, India

In the era of explainable AI, rapid data processing, analysis, and generation have become essential. Over the past few years, many approaches have been developed to process such heavy data and present it in an explainable manner, including in the field of remote sensing. One of such applications is remote sensing scene description. Many established workflows and models exist, but these models either fail to incorporate essential geospatial information or suffer from hallucination. We present a hybrid multimodal captioning methodology that tightly couples semantic segmentation outputs (via a LoRA-adapted Segment Anything Model) with a small, high-quality LLM- Mistral to produce descriptive, interpretable, and data-grounded scene captions. Rather than relying on direct image-to-text pipelines, our approach first extracts structured scene statistics (class proportions), spatial context (quadrant dominance and object localization), and color fingerprints (dominant colors per semantic class). These structured signals are converted into compact, factual prompts that the LLM consumes to generate coherent, informative, and verifiable captions. A comparison with the established Florence-2 model in terms of quantitative description demonstrates a significant improvement, with the Precision Vocabulary Index increasing from 0.077 to 0.232 due to the proposed workflow.

2:30pm - 2:45pm

Evaluating the Adaptation Potential of SAM2 for Glacier Segmentation in Severe Weather

Bindusara Nagathihalli Lokesh, Laura Camila Duran Vergara, Hans-Gerd Maas, Anette Eltner

Dresden University of Technology, Germany

Ground based time lapse cameras provide continuous, high frequency observations of glacier dynamics; however, automated analysis of these image streams remains challenging due to fog, snowfall, lens contamination, and variable illumination. This study investigates the potential of adapting the foundation segmentation model Segment Anything Model 2 (SAM2) for glacier segmentation from ground-based monitoring. To enable integration into automated pipelines, SAM2 is configured in image mode with a learned prompt generation strategy, while fine-tuning is restricted to the prompt encoder and mask decoder. In addition, the internal Intersection over Union (IoU) prediction head is utilized as a confidence estimator to assess segmentation reliability. Experimental results demonstrate that the adapted model achieves stable segmentation under moderate environmental variability, while degrading under severe visibility loss. This stability is consistent across model scales and input resolutions. The confidence estimation further provides a meaningful signal for identifying uncertain predictions, supporting reliability-aware processing in downstream workflows.

2:45pm - 3:00pm

Reasoning-guided ego-path segmentation for autonomous trains using vision–language models

Mohammadjavad Ghorbanalivakili, Ashley Varghese, Gunho Sohn

York University, Canada

Autonomous train perception must identify the train’s valid path under complex railway geometry, particularly at merging and diverging switches where multiple candidate tracks coexist. Existing approaches are primarily trained as purely visual predictors and typically do not provide justification for route selection, despite the fact that valid paths depend on structured cues such as blade–stock contact, rail gaps, and track continuity. In this work, we adapt the Large Language Instructed Segmentation Assistant (LISA) to railway ego-path perception and formulate the task as reasoning-guided segmentation: given a forward-facing railway image and a natural-language query, the model predicts the valid ego-path mask and, when prompted, generates a textual explanation grounded in visible switch geometry. Our approach integrates railway-specific prompting, a tailored annotation scheme, and efficient finetuning, along with semantic segmentation supervision to support general scene understanding. Experiments on a RailSem19-based evaluation set show improved ego-path segmentation performance over the original LISA checkpoint and increased robustness to prompt variation, while qualitative results indicate that the model can produce plausible, though not always consistent, reasoning. Notably, these capabilities emerge despite the reasoning-specific dataset consisting of only 54 samples, highlighting the data efficiency of the approach. These results highlight the potential of vision-language models for more interpretable railway perception, while also underscoring the need for stronger supervision and evaluation in safety-critical settings. Code and reasoning segmentation data are available at https://github.com/mvakili96/Railway_Perception_FoundationModel.

1:30pm - 3:00pm

WG II/9B: Vision Metrology
Location: 713B

1:30pm - 1:45pm

Quantization-Aware Training for Efficient Object Detection on FPGAs: Case Studies

Xuanshu Luo, Gabor Fogarasi, Alan Syrgak, Paul Walther, Martin Werner

Technical University of Munich, Germany

Deploying object detection models for resource-constrained remote sensing applications necessitates on-board model inference capabilities. While Field Programmable Gate Arrays (FPGAs) offer massive parallelism as energy-efficient hardware platforms, model quantization remains essential to further balance computational efficiency with detection accuracy. Compared to post-training quantization methods that involve multiple-stage development with consistent dependency on domain datasets, quantization-aware training (QAT) integrates quantization constraints into training, providing a simpler pipeline for model compression. However, QAT introduces quantization errors to which smaller objects are more vulnerable. To address this issue, we propose object-scale-aware (OSA) regularization that amplifies quantization error penalties for smaller targets. Our approach is validated through two case studies: bird detection at airports and aerial-view building detection. We perform 8-bit QAT on YOLOX series models using the MVA2023 dataset and the Bavarian Building Dataset for the respective studies. Our method achieves up to 50.2 times inference acceleration with minimal accuracy loss on Xilinx Kria KV260 FPGAs compared to full-precision models. The ablation study and detection examples further demonstrate the effectiveness of OSA regularization in small object detection.

1:45pm - 2:00pm

Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics

Dennis Haitz¹, Athradi Shritish Shetty¹, Michael Weinmann², Markus Ulrich¹

¹Karlsruhe Institute of Technology, Germany; ²Delft University of Technology, Netherlands

A broad evaluation of state-of-the-art Visual Place Recognition methods is presented. The evaluation focuses on tasks where a fast image pair retrieval is of high importance, such as image-driven scene registration, SLAM or Structure-from-Motion correspondence search. This implies, that the focus of the study is geared away from typical Visual Place Recognition and towards scenarios of interest in computer vision and robotics. A sophisticated evaluation pipeline for retrieval and runtime performance is presented. Prepared datasets based on widely used benchmarks from different domains are utilized, such as indoor-SLAM, outdoor object-centric as well as autonomous navigation in urban and sub-urban areas.

2:00pm - 2:15pm

MVM-IOD: An Industrial Object-Centric Benchmark Dataset for the Evaluation of 3D Reconstruction Methods

Robert Langendörfer, Markus Hillemann, Markus Ulrich

KIT, Germany

3D object reconstruction, camera pose estimation, and novel view synthesis in industrial applications are challenging tasks, as errors are costly while the timewindow for solving these tasks is often limited. The complexity of typical industrial objects further complicates these tasks. Different datasets that can be used to evaluate current methods on these tasks exist, however, most of them do not depict realistic industrial scenarios. We introduce the Machine Vision Metrology Industrial Object Dataset (MVM-IOD)

that addresses this lack of datasets. The hardware setup to acquire the dataset consists of a camera, mounted upside down due to space restrictions, at the end effector of an industrial robot arm. Images of typical industrial objects are captured systematically, by moving the camera on a hemisphere around the objects. MVM-IOD contains the camera poses, the acquired RGB images, and the 3D point cloud of 9 objects and 2 background choices resulting in 18 scenes, which allows evaluation of all image based

methods that compute a 3D reconstruction, camera poses, and/or novel views. Based on our dataset, we extensively evaluate current state-of-the-art 3D reconstruction and camera pose estimation methods, such as Structure from Motion, Multi-View Stereo, Visual Geometry Grounded Transformer (VGGT), π3, as well as 2D Gaussian Splatting and report our findings to create a baseline for future research.

2:15pm - 2:30pm

A Critical Synthesis of Uncertainty Quantification and Foundation Models for Semantic Segmentation

Steven Landgraf, Joceline Hinz, Markus Ulrich

Karlsruhe Institute of Technology, Germany

Foundation models are increasingly breaking what seemed to be impossible not long ago by enabling unprecedented accuracy and

cross-domain generalization. Yet their lack of interpretability, tendency to be overconfident, and sensitivity to real-world domain

shifts pose critical challenges for safety- and mission-critical applications. Uncertainty quantification (UQ) offers a principled way

to address these issues, but its integration into segmentation foundation models has yet to be explored. In this paper we present the

first systematic evaluation of UQ methods applied to a foundation model for semantic segmentation. We fine-tune a lightweight

DPT decoder on top of the pretrained SAM2 encoder to establish a simple yet competitive baseline and benchmark four representative

UQ approaches – Monte Carlo Dropout, Deep Sub-Ensemble, Test-Time Augmentation, and Evidential Deep Learning – across

Cityscapes, NYUv2, and two challenging out-of-domain settings. Our analysis compares segmentation accuracy, calibration, uncertainty

quality, and inference time, revealing clear trade-offs between predictive performance, reliability, and computational cost.

These results highlight both the promise and the current limitations of uncertainty-aware foundation models, pointing to the need

for future work that jointly optimizes accuracy, robustness, and efficiency for real-world deployment.

2:30pm - 2:45pm

The Impact of CutMix on Reliability and Robustness in Semantic Segmentation

Steven Landgraf, Markus Ulrich

Karlsruhe Institute of Technology, Germany

Ensuring not only high accuracy but also reliable and robust predictions is critical for the deployment of semantic segmentation

models in safety-critical applications such as autonomous driving. Despite the widespread use of CutMix – a simple yet powerful

data augmentation strategy – its effect on the reliability and robustness in dense predictions tasks remains unexplored. Motivated

by recent findings that semi-supervised segmentation methods, where CutMix is a core component, can severely degrade reliability,

this study isolates and systematically analyzes the influence of CutMix on segmentation accuracy, calibration, and uncertainty

quality. We evaluate two representative architectures, the CNN-based DeepLabV3+ and the transformer-based SegFormer, across

both in-domain and out-of-domain scenarios. Our results show that CutMix has only a minor impact on segmentation accuracy

but consistently improves the reliability, particularly under distribution shifts. These improvements indicate that CutMix primarily

enhances the trustworthiness of the model’s calibration and uncertainty rather than the raw segmentation prediction itself. This

distinction is crucial for safety-critical deployment, where reliable confidence estimates are as important as raw performance.

2:45pm - 3:00pm

Uncertainty Quality of VGGT: An Analysis on the DTU Benchmark Dataset

Markus Hillemann, Robert Langendörfer, Steven Landgraf, Markus Ulrich

Karlsruhe Institute of Technology, Germany

Visual Geometry Grounded Transformer (VGGT) has already attracted a great deal of attention in a short period of time, not least due to the Best Paper Award at CVPR-2025. Similar to DUSt3R and MASt3R, VGGT aims to bring about a paradigm shift by replacing established methods like bundle adjustment and feature matching with a simple, unified, feed-forward neural network that predicts camera poses, depth maps, and dense 3D structure directly from multiple images of a scene in a few seconds. A key aspect is its ability to process an arbitrary number of views consistently in a single forward pass without any post-processing or iterative optimization. For photogrammetry, this opens new possibilities for real-time, scalable, and accessible 3D reconstruction. In this context, not only high reconstruction accuracy but also high-quality uncertainty estimates are crucial, as they foster trust and enable robust quality assurance. This paper therefore investigates the quality of VGGT’s uncertainty predictions. The analysis identifies an effective confidence threshold for filtering VGGT’s raw output and demonstrates that enhancing uncertainty quality holds strong potential for improving the accuracy of its 3D reconstructions.

1:30pm - 3:00pm

WG I/3: Multispectral, Hyperspectral and Thermal Sensors
Location: 714A

1:30pm - 1:45pm

First Field Validation of a New VNIR/SWIR-Based Six-Band Multi-Camera System for UAVs over Winter Wheat

Alexander Jenal¹, Fabian Reddig², Andreas Bolten², Leon Vehlken², Hubert Hüging³, Thuy Huu Nguyen³, Jens Bongartz¹, Georg Bareth²

¹Application Center for Machine Learning and Sensor Technology (AMLS), University of Applied Sciences Koblenz, Germany; ²Institute of Geography, GIS & Remote Sensing Group, University of Cologne, Germany; ³Institute of Crop Science and Resource Conservation (INRES), University of Bonn, Germany

Shortwave infrared (SWIR) imaging from uncrewed aerial vehicles (UAVs) remains rare despite strong sensitivity to canopy water and protein. We present the first field validation of a six-band VNIR/SWIR multi-camera system designed for plot-scale monitoring of winter wheat using mid-sized UAVs. The payload utilized narrow bandpass filters (910, 980, 1100, 1200, 1510, and 1650 nm; FWHM 10–12 nm) and was operated at an altitude of approximately 30 meters above ground level, achieving a ground sampling distance of approximately 4 cm. Empirical line calibration, employing in-scene gray panels, was validated against material-distinct panels and spectroradiometer measurements. The spectral response functions were approximated using Gaussian convolution due to the narrow passbands. Five bands (980–1650 nm) exhibited excellent performance: empirical line model fits achieved R² values approaching 1.000 (RMSE = 0.003–0.009), independent panel validation demonstrated near-unity slopes (R² = 0.998–0.999; RMSE = 0.005–0.013), and plot-level canopy measurements (n=36) maintained strong agreement between camera and spectroradiometer (slopes = 0.943–1.079; R² = 0.58–0.85; RMSE = 0.010–0.023). Two SWIR normalized ratio indices exhibited robust cross-sensor agreement: NRI[1100,1200] (R² ≈ 0.93) and NRI[1650,1510] (R² ≈ 0.90). The 910 nm channel displayed systematic errors (slope = 0.442±0.040 for plots; MAPE ≈ 33%) due to identified out-of-band leakage from incomplete long-wave blocking, leading to its exclusion from accuracy claims. Mitigation strategies include higher optical density short-pass blocking and system-level spectral response function verification. The filter-reconfigurable payload provides quantitative reflectance and robust SWIR indices at the plot scale by integrating panel-anchored empirical line modeling with bandpass-aware harmonization, thereby advancing operational SWIR monitoring capabilities for precision agriculture applications.

1:45pm - 2:00pm

PanX.4: A Gyrocopter‑Borne Six‑Band VNIR Multicamera System for Sentinel-2‑Aligned Multitemporal Vegetation Monitoring

Alexander Jenal¹, Felix Kröber^2,5, Christopher Frank³, Lina Krisztian⁴, Markus Metz⁴, Ribana Roscher⁵, Jens Bongartz¹

¹Application Center for Machine Learning and Sensor Technology (AMLS), University of Applied Sciences Koblenz, Germany; ²Institute of Bio- and Geosciences, Forschungszentrum Jülich, Germany; ³CISS TDI GmbH, Germany; ⁴mundialis GmbH & Co. KG, Germany; ⁵Institute of Geodesy and Geoinformation, University of Bonn, Germany

This contribution presents PanX.4, a gyrocopter-borne six-band VNIR multicamera system developed within the KIBI project on AI-based identification and classification of protected plant communities (mFUND, FKZ 19F2276) to support cross-scale monitoring at Natura 2000 sites. The system is designed for spectral alignment with Sentinel-2 MSI bands B02–B06 and B08 and is integrated into a tri-sensor airborne suite on the FlugKit carrier platform together with a high-resolution RGB camera and a complementary six-band VNIR–SWIR imaging system. Using system-level spectral response characterization and spectral band adjustment factor (SBAF) analysis based on 1,057 ECOSTRESS spectra, the study quantifies the harmonization quality between PanX.4 and Sentinel-2A, S2B, and S2C. All bands achieved R² > 0.99, while comparative screening of alternative spectral configurations showed that careful band design is critical, particularly in the red-edge region. An additional inter-satellite sensitivity analysis further indicates that harmonization should account for band-dependent differences between Sentinel-2 units when multitemporal airborne and satellite observations are combined. To support multitemporal habitat monitoring, the paper also analyzes 86,947 first-mowing observations from 2017 to 2024 and derives a three-window acquisition concept synchronized with pre-mowing, post-regrowth, and senescence phases. This creates an operationally relevant framework for planning repeated airborne campaigns that can support validation, boundary refinement, and future machine-learning workflows for habitat classification. The contribution therefore establishes the sensor-design, spectral-harmonization, and temporal-planning basis for Sentinel-2-consistent airborne monitoring at sub-meter resolution. Operational airborne image products and in-flight validation are beyond the present contribution and form the next step for future deployment.

2:00pm - 2:15pm

Atmospheric correction of aerial imagery using satellite-derived reflectance data

Alexane Nghien, Manchun Lei, Mathieu Brédif

Univ Gustave Eiffel, Géodata Paris, IGN, LASTIG

Atmospheric correction of large-scale aerial imagery remains a major challenging, mainly due to the difficulty of accurately estimating atmospheric parameters within the images. This study proposes a novel atmospheric correction method based on satellite-derived Surface Reflectance (SR). The method is a semi-empirical linear correction approach that leverages Pseudo-Invariant Features (PIFs) as reference points. Experimental results show that, the proposed method achieves performance comparable to radiative transfer models approach when accurate atmospheric parameters are available, and provides more reliable corrections when such parameters are uncertain or unavailable.

2:15pm - 2:30pm

Abundance Estimation Methods in Spectral Unmixing for Real Data

Daniele Cerra, Miguel Pato, Emiliano Carmona

German Aerospace Center (DLR), Germany

Spectral unmixing estimates the fractional abundances of materials, having associated spectra called endmembers, in pixels acquired by imaging spectrometers. Validation of abundance estimation methods typically relies on synthetic data or comparisons to results obtained by other algorithms. This study considers results of typical abundance estimation algorithms on the DLR HySU (HyperSpectral Unmixing) benchmark dataset, which contains actual imaging spectrometer data acquired over several arrangements of known-size material patches for physically traceable validation. Abundance estimates are compared against measured target areas in pixels with different degrees of mixtures. We evaluate least squares and sparse unmixing methods across different noise scenarios on real data, and by contaminating the library through addition of non-relevant endmembers. Additionally, as a way to approximate hard sparsity constraints, we enforce cardinality constraints on endmember subsets, identifying those minimizing abundance errors relative to the full library. Results suggest that fully constrained least squares yields usually the best results, but struggles in cases of highly mixed pixels. Finally, we test quantization of abundance values as a way to enforce sparsity in non-negative least squares with limited but encouraging results. Overall, the increase in accuracy of results enforcing sparse solutions support the use of computationally efficient sparse unmixing methods in practical scenarios, part of which may become feasible if quantum computing capabilities improve in the future.

2:30pm - 2:45pm

Operational Band-to-Band Correction and Attitude Refinement of Pelican-2: dual-panchromatic Attitude Restitution and selective Bundle Adjustment with preliminary Application to Earthquake Displacement and DEM Generation

Saif Aati, Antonio Martos, Eric L. Peters, Frank Warmerdam, Graham Mills, Adam Weber, Luna Gray, Minh Radel

Planet Labs PBC

The Pelican satellite constellation, first launched by Planet Labs in 2025, continues the high-resolution imaging capability established by the SkySat program. The change to pushbroom sensor in Pelican presents new geometric challenges: satellite attitude variations and platform instabilities during acquisitions can produce band misregistration and geolocation errors that degrade downstream products. This paper presents an operational workflow developed for Pelican imagery, validated on Pelican-2, a technology demonstration satellite. The approach exploits the dual-panchromatic focal plane configuration to independently measure satellite wobble to greater accuracy than on onboard attitude sensors, combined with selective bundle adjustment and B-spline spatial correction to achieve sub-pixel band alignment without dense ground control points. Validation on 963 Pelican-2 scenes demonstrates sub-pixel band-to-band registration accuracy (RMSE < 0.12 px) and 4 m CE90 geolocation accuracy. Applications illustrate the potential for operational geoscience workflows: earthquake surface displacement mapping of the March 2025 Myanmar M7.7 rupture detects 4.0 m co-seismic offsets on the Sagaing Fault with minimal post-processing, and digital surface model generation from an opportunistic multi view acquisition yields preliminary elevation products free of jitter artifacts, demonstrating operational feasibility for constellation-scale processing.

Initial applications showcase operational potential: earthquake surface displacement mapping detects 4.0 m co-seismic offsets from the March 2025 Myanmar M7.7 rupture with minimal post-processing; digital surface model generation yields elevation products free of jitter artifacts. Results establish feasibility for constellation-scale processing and inform next-generation Pelican development.

1:30pm - 3:00pm

WG III/8B: Remote Sensing for Agricultural and Natural Ecosystems
Location: 714B

1:30pm - 1:45pm

Estimating the leaf area index of urban trees using terrestrial LiDAR and the PATH method: sensitivity analysis and comparison with optical and direct methods

Camille Taufflieb¹, Nathalie Breda², Ronghai Hu³, Tania Landes¹, Vincent Lecomte¹, Françoise Nerry⁴, Georges Najjar⁴

¹1 Université de Strasbourg, CNRS, INSA Strasbourg, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, 67000, Strasbourg, France; ²Université de Lorraine, AgroParisTech, INRAE, UMR Silva, Nancy, France; ³College of Resources and Environment, University of Chinese Academy of Sciences, Beijing, 100049, China; ⁴Icube Laboratory (UMR 7357), University of Strasbourg, Strasbourg, France

Urban trees play a crucial role in mitigating urban heat islands through shading and transpiration, processes directly linked to Leaf Area Index (LAI). However, estimating LAI for individual urban trees remains challenging due to their geometric and temporal heterogeneity.

This study evaluates the PATH (Path length distribution) method, a terrestrial laser scanning (TLS) based approach, to estimate LAI for three urban tree species in Strasbourg, France. The PATH method models foliage area volume density from point clouds, accounting for non-random foliage arrangements and woody structure contributions, unlike traditional optical methods.

TLS campaigns were conducted in three streets at three phenological. The sensitivity of PATH to geometric reconstruction parameters was assessed to optimize LAI estimation. Results show that envelope geometry significantly influences PAI estimates, with concave shapes (of at least 3000 facets) yielding more accurate values, while leaf angle distribution has minimal impact.

The obtained LAI estimates varied by species, reflecting species-specific crown densities. PATH-derived PAI was compared to LAI-2000 optical sensor measurements and direct LAI obtained by leaf collection. PATH estimates aligned more closely with true LAI than LAI-2000, especially during early leaf expansion, though discrepancies arose due to branch pruning and polycyclic flushing. The study highlights the importance of adapting scanning protocols and PATH parameters to species-specific morphology.

In conclusion, this work highlights the potential of TLS-based methods for providing robust PAI estimates for urban trees. Future research will link these species-specific estimates to urban microclimate benefits.

1:45pm - 2:00pm

Evaluation of Machine Learning Methods for Estimation of Leaf Chlorophyll Content (LCC) Across 15 Soybean Cultivars During Early Reproductive Stage

Carli Kriek¹, Philemon Tsele¹, George Chirima², Adolph Nyamugama²

¹Department of Geography, Geoinformatics and Meteorology, University of Pretoria, Pretoria 0002, South Africa; ²Agriculture Research Council Natural Resource & Engineering (NRE), Pretoria, 0001, South Africa

South Africa is the leading soybean producer in Africa, contributing approximately 35% of the continent’s total production. Soybean is important for national food security and agricultural sustainability–– serving as a key nitrogen-fixing crop that support soil fertility and economic growth. Whilst monitoring biochemical parameters such as leaf chlorophyll content (LCC) is essential for assessing the soya bean health, cultivar-level variability can complicate the use of remote sensing–based approaches. This study evaluates the performance of four machine-learning algorithms, XGBoost, Random Forest, Partial Least Squares Regression, and Artificial Neural Network, using unmanned Aerial Vehicle based data across 15 soybean cultivars during the early reproductive phase. Results show that model performance is strongly cultivar dependent. Tree-based models achieved the highest accuracy, with XGBoost and Random Forest reaching RMSE values as low as 2.9 µmol m⁻² for PHIP62T16R and R² values up to 0.96 for RA655R, while ANN and PLSR performed substantially worse for cultivars with more complex spectral responses, such as PAN1555R. Residual results from generalised models revealed systematic over- and under-prediction in several cultivars, indicating that models developed using pooled data are unable to fully account for cultivar-specific spectral differences. Variable-importance analyses identified red-edge, NIR, and greenness-enhancing indices as the most influential predictors of LCC, highlighting their strong sensitivity to canopy structure and chlorophyll variation. Overall, the study shows that cultivar-specific, ensemble-based modelling delivers stronger predictions of chlorophyll in soybean. Incorporating cultivar information and using stratified model calibration improves the reliability of UAV-based chlorophyll monitoring in heterogeneous soybean canopies.

2:00pm - 2:15pm

Potential of very high Resolution Pléiades Neo Satellite Data to monitor Crop Traits

Georg Bareth¹, Christoph Hütt¹, Alexander Jenal², Andreas Bolten¹, Fabian Reddig¹, Leon Vehlken¹, Annika Klee¹, Jan Wolf¹, Hannah Firl¹, Hubert Hüging³

¹Institute of Geography, GIS & Remote Sensing Group, University of Cologne, Germany; ²AMLS, University of Applied Sciences Koblenz, Remagen, Germany; ³INRES - Crop Sciences, University of Bonn, Germany

The monitoring of crop traits on a landscape scale is of key interest in the context of precision farming and food production. Many studies use moderate-resolution satellite data like Sentinel-2, Landsat for crop monitoring. However, enhanced spatial resolution is improving monitoring quality significantly. In this context, commercial but expensive very high resolution (VHR) satellite data from Ikonos, Quickbird, Formosat-2, and WorldView-2 have been successfully applied for crop monitoring over the last two decades.

The focus is on the research question “Can Pléiades Neo data quantify plot-scale variation in dry biomass and N uptake?” and on developing an analysis workflow which could support precision farming on a landscape scale using VHR satellite data.

In this contribution, we propose the application of pansharpened Pléiades Neo satellite data for the monitoring of crop traits like dry biomass and N uptake - in our study for winter wheat. The very high spatial resolution of 0.3 m even allows to investigate field experiments with plot sizes of several m2 and therefore, would be suitable for crop phenotyping.

2:15pm - 2:30pm

Development of a transferrable hybrid retrieval model for mapping sweet potato chlorophyll at matured growth stage using ultra high-resolution UAV data

Philemon Tsele¹, Abel Ramoelo^2,1, Lucy Moleleki¹, Sunette Laurie³, Whelma Mphela¹, Natasha Tshuma¹

¹University of Pretoria, South Africa; ²South African National Space Agency, South Africa; ³Agricultural Research Council, South Africa

Smallholder farmers play a critical role in the growing of underutilized crops, such as sweet potato. Obtaining accurate maps of sweet potato biophysical variables is essential for farmers to assess and monitor crop health at different growth stages. Integrating radiative transfer model (RTM) data with vegetation indices (VIs) based on unmanned aerial vehicle (UAV) data, may have the potential for accurately estimating leaf chlorophyll concentration (LCC) across multiple crop varieties. Firstly, in this paper we developed and tested varying hybrid retrieval models by combining PROSAIL RTMs with broadband, narrowband and leaf-pigment VIs applied to 2-cm resolution UAV imagery, to retrieve LCC over 20 sweet potato varieties at 120 days i.e. matured growth stage. Secondly, the best hybrid retrieval model was transferred to a different site which contain similar sweet potato varieties at matured growth stage for the estimation of sweet potato LCC. Results show that the most accurate retrievals of LCC were achieved by integrating a larger database containing 11000 PROSAIL simulated reflectance samples with broadband indices, particularly the enhanced vegetation index (EVI) with coefficient of determination (R2) of 0.85, root mean squared error (RMSE) of 5.93 µg/cm2, and relative RMSE (RRMSE) of 9.87%. Furthermore, when transferred to a different site containing similar sweet potato varieties at matured growth stage, this model achieved 60% agreement with field LCC measurements and responded fairly well by capturing LCC variability. These findings have significant implications in sweet potato breeding programmes for developing new cultivars.

2:30pm - 2:45pm

Principal component analysis of UAV-derived vegetation indices and laboratory tissue nutrients for crop health assessment

Oluibukun Gbenga Ajayi^1,2, Fatima Abiola Ogunlesi³

¹Namibia University of Science and Technology, Namibia; ²University of Pretoria, South Africa; ³Federal University of Technology, Minna

Remote sensing and laboratory assays can improve field-scale crop assessment and management. This exploratory pilot study analyses relationships between leaf tissue nutrients and UAV-derived normalised difference vegetation index (NDVI) using seventeen paired samples collected across a mixed crop trial. Tissue measures for nitrogen, phosphorus and potassium were standardised and entered into principal component analysis to reduce pairwise correlation and extract orthogonal nutrient axes. The first principal component explained 54.79% of variance, the second explained 34.10%, together accounting for 88.9%. Principal component scores for the first two axes were used in linear and polynomial regression models to predict NDVI. Model skill was assessed on training data and with leave-one-out cross-validation, and bootstrap resampling produced empirical confidence intervals for component loadings. Linear models built on principal components provided the most stable cross-validated performance, while polynomial expansions improved training fit but generalised poorly. These findings indicate that a low-dimensional nutrient representation can predict NDVI with reasonable stability and that combining spectral and biochemical data supports spatially explicit nutrient assessment. The study recommends expanded and stratified sampling, reflectance calibration and targeted spectral bands for follow-up studies, and external validation before wider applications.

2:45pm - 3:00pm

Multiscale Multispectral–Hyperspectral Data for Estimating Coffee Yield Using Machine Learning Algorithms

George Deroco Martins, Lucas Carvalho, Filipe Silva, Rayssa Barbosa, Maria Cecília Santos

Federal University of Uberlândia, Brazil

This study integrates multispectral (UAV) and hyperspectral (ground-based) remote sensing data to estimate coffee (Coffea arabica) yield using machine learning algorithms. Forty field plots were analyzed with multispectral Mavic 3M imagery and hyperspectral Blue Wave spectroradiometer data. Spectral indices such as NDVI, NDRE, GNDVI, CIRE, and PRI were correlated with yield, revealing distinct responses across spectral domains. Neural networks achieved the best predictive performance (R = 0.93; RMSE = 7.9%), followed by SVM models (R = 0.90). The Red Edge and Green bands were most sensitive to productivity variations in multispectral data, while hyperspectral narrowband indices provided superior correlations with canopy physiological traits. The integration of both datasets highlights the complementary strengths of spatially extensive multispectral imagery and the spectral precision of hyperspectral sensing. This multiscale approach enables more accurate and operational yield estimation for perennial crops and supports the development of precision agriculture protocols for coffee production systems.

1:30pm - 3:00pm

ICWG III/IVa-B: Disaster Management
Location: 715A

1:30pm - 1:45pm

Mapping flood footprints: a review of remote sensing approaches for quantifying physical asset information extraction

Wei Jiang¹, Quan Liu¹, Xiaohui Ding², Zhiguo Pang¹, Denghua Yan¹, Yizi Shang¹, Rong Li¹, Akiyuki Kawasaki³

¹China Institute of Water Resources and Hydropower Research, Beijing, 100038, China; ²School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang, 330032, China; ³Institute for Future Initiatives, The University of Tokyo, Tokyo, Japan

Flooding stands as one of the world's prominent natural hazards, which exerts severe threats to sustainable socioeconomic development. Physical asset information in flood disasters refers to the location, quantity, and damage severity of exposed elements within the affected area. Rapid and accurate extraction of such information is crucial for flood disaster emergency management. To achieve this goal, a remote sensing-based framework for extracting physical asset information in flood disasters is proposed in this paper. This framework summarizes extraction methods for flood damage to typical asset types including cropland, buildings, and roads, and comparatively analyzes the advantages and limitations of multi-source remote sensing data, geographic data, and social media data in physical asset information extraction. This study further investigates the differences between statistical analysis, shallow learning methods, deep learning, and transfer learning approaches, with respect to three key dimensions, namely extraction accuracy, scenario applicability, and computational efficiency. Future research should focus on: (1) Development of operational technologies for flood emergency response and disaster mitigation; (2) multi-source data fusion and dynamic simulation based on digital twin technology; (3) intelligent mining of multi-modal information and development of generalized extraction models driven by foundation models, with the aim of providing technical support for rapid flood emergency response.

1:45pm - 2:00pm

Rapid flood damage assessment in detention basins using multi-source remote sensing: a case study of the 2023 dongdian flood event in china

Wei Jiang¹, Quan Liu¹, Denghua Yan¹, Wenlong Song¹, Zhiguo Pang¹, Xiaohui Ding²

¹China Institute of Water Resources and Hydropower Research, China, People's Republic of; ²School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang, 330032, China

Rapid flood damage assessment is essential for emergency response and post-disaster recovery. Following catastrophic flooding in the Haihe River Basin on July 28, 2023, the Dongdian flood detention basin was activated on August 1, with inundation persisting until early October. This study integrates satellite remote sensing, UAV imagery, and field surveys to develop a rapid multi-source approach for comprehensive flood loss assessment. The methodology comprises: (1) extraction of inundation characteristics (spatial extent, depth, duration); (2) classification of exposed assets (agricultural land, forests, residential and industrial areas); (3) comprehensive damage and economic loss evaluation. Results show that 301.49 km² (79.55% of the basin) was inundated from August 1 to October 5, 2023, with an average depth of 2.64 m. The central-western zone sustained the most severe damage, with prolonged residential inundation. Complete corn crop failure occurred, and agricultural-forestry production suffered near-total losses. Direct economic losses exceeded 17.5 billion yuan. Compared to traditional field methods, this approach demonstrates superior efficiency and accuracy, providing scientific support for flood risk management in detention basins.

2:00pm - 2:15pm

Shoreline extraction and coastal change detection from satellite SAR using thresholding-based methods

Corienne Erasmus^1,3, Shelley Haupt², Bolelang Sibolla^1,2

¹Department of Geography, Geoinformatics and Meterology, University of Pretoria, Pretoria, South Africa; ²Next Generation Enterprises and Institutions, Council for Scientific and Industrial Research, Pretoria, South Africa; ³AOS-SAMOS, Department of Oceanography, University of Cape Town, Rondebosch 7700, South Africa

Coastal environments provide various economic, ecological and societal benefits. Coastal erosion which is the gradual loss of sediment over time, poses a significant threat to South Africa’s coastline. The monitoring and detection of coastal erosion is essential for the effective management of coastal environments. One way to quantify coastal erosion is the delineation of coastal boundaries. Remote sensing techniques such as Synthetic Aperture Radar offers a unique opportunity to extract shoreline positions over large areas of the coast. Furthermore, thresholding and edge detection methods have been successfully used to extract land-water boundaries. In this study, C-band SAR data was used to derive backscatter coefficients for three different areas of interest in the Eastern Cape province in South Africa over a ten year period. The coastal erosion and accretion trends were calculated from the results indicated that the Linear Regression Rate (LRR) for the three different study area showed various coastal erosion seasonality trends. The shoreline LLR ranged between -0.01 and -3.28 m/year for the Cape Recife area and -0.17 and -4.78 m/year for the Nelson Mandela Bay beach front. The overall pattern was erosion during the winter months and accretion during the summer months. In contrast, for the Kings Beach area, there was a consistent accretion trend where the LRR values ranged between 0.94 and 1.68 m/year. The findings confirm that SAR remote sensing is suitable for detecting and monitoring coastal changes in three different coastal environments.

2:15pm - 2:30pm

Enhancing Oil Spill Interpretation Through Multisensor Fusion and Temporal Reconstruction: A Case Study Near the Strait of Gibraltar

Tom Avikasis Cohen, Dror Angel, Anna Brook

University of haifa, Israel

Oil spills in confined maritime corridors often evolve faster than any single satellite mission can observe. This often complicates the interpretation of individual images and create gaps in understanding how a spill progresses between satellite overpasses. This study examines whether combining Sentinel-1 and Sentinel-2 observations can provide a more coherent picture of its development of a spill event, using the case of an oil spill occurred near the Strait of Gibraltar in late August 2022 after a collision between the OS35 and the Adam LNG.

The preliminary analysis evaluated each sensor separately. Sentinel-1 highlighted changes in surface roughness, while Sentinel-2 revealed reflectance anomalies linked to modified optical properties of the water. Since neither dataset on its own offered a complete account of the surface conditions, a fusion procedure was applied to the closest pair of post-event images. The fused map displayed sharper boundaries and more spatial detail than the radar scene alone, offering a clearer outline of the affected area. To address the temporal mismatch between acquisitions, intermediate surfaces were also reconstructed for both sensors, producing estimated representations of the marine conditions at dates not directly observed.

Taken together, the fused and reconstructed products formed a more continuous sequence of the spill’s evolution, capturing both its fragmentation and its short-term reorganisation. Although the approach does not replace dedicated operational monitoring, it demonstrates that combining complementary satellite data can reduce ambiguity in single-sensor interpretation and strengthen situational awareness in regions where surface conditions change quickly and unpredictably.

2:30pm - 2:45pm

Windstorm hazard index development for malaysia

Nur Hidayah Zakaria¹, Nur Atiqah Hazali², Siti Aekbal Salleh^3,4, Nurul Amirah Isa¹, Nini Nurdiana Johari¹, Mohd Badrul Hafiz Che Omar¹, Arnis Asmat⁴, Andy Chan⁵

¹Faculty of Asia Built Enviroment and Surveying, Universiti Geomatika Malaysia (UGM), Malaysia; ²Geospatial Science & Technology College (GSTC), Malaysia; ³Institute for Biodiversity and Sustainable Development (IBSD),Universiti Teknologi MARA; ⁴Center of Studies Surveying Science and Geomatics, Faculty of Built Environment, Universiti Teknologi MARA (UiTM) , Malaysia; ⁵Southampton Solent University, England

Windstorms in Peninsular Malaysia have increased in both frequency and severity, posing growing risks to communities, infrastructure, and the national economy. Despite these escalating threats, the region currently lacks a comprehensive, location-specific index capable of evaluating and categorizing windstorm hazards for effective planning and mitigation. This study develops a Windstorm Hazard Index (WHI) tailored to Peninsular Malaysia to assess spatial patterns of windstorm risk and support evidence-based decision-making. Four objectives were addressed: (1) identifying key environmental and geographical factors influencing windstorm occurrences; (2) quantifying these parameters using windstorm records from 2008–2018, numerical simulations generated via WRF-ARW, and urban morphology modelling using Envi-MET; (3) formulating the WHI through the integration of Analytic Hierarchy Process (AHP) and Principal Component Analysis (PCA); and (4) validating the index using documented windstorm events from 2020–2024.The WHI categorizes the peninsula into six hazard levels ranging from very low (0.1–0.5) to extreme (0.901–1.0). Southern and central states, including Negeri Sembilan and Pahang, generally exhibited very low hazard levels, while Kelantan and Terengganu showed moderate risk. High-risk zones were concentrated in northern and coastal regions such as Penang, Kedah, and Perlis, with extreme-risk areas detected in parts of Kedah and Perlis. Results indicate that wind speed, temperature, humidity, precipitation, land use, and urban density strongly influence windstorm intensity, particularly in coastal and densely built environments. Validation confirmed the WHI’s reliability, as extreme-risk classifications aligned with recorded damage patterns. Overall, the WHI serves as a robust framework for regional hazard assessment and disaster-resilient infrastructure development across Peninsular Malaysia.

2:45pm - 3:00pm

FRI-R: A Data Driven Flood Risk Index for Resilience Decision-Making

Bandana Kar¹, Margaret Glasscoe²

¹ResIntSoft LLS, United States of America; ²University of Colorado, Boulder, United States of America

Flooding is one of the most frequent and costliest hydro-meteorological hazards, impacting every nation and causing significant societal and economic disruption. Despite the abundance of Earth Observation (EO) datasets and hydrodynamic models available to map, monitor, and forecast flood events, decision-makers and first responders often struggle to translate these resources into actionable insights. To bridge this gap, we’ve developed the Flood Risk Index for Resilience (FRI-R), a data-driven machine learning model designed to support resource planning, emergency response, and downstream analytics. FRI-R is powered by the Model of Models (MoM), an operational, open-source ensemble framework that integrates outputs from hydrologic models and EO data from optical imagery. Leveraging historical MoM outputs, FRI-R analyzes the spatial and temporal patterns of past flood events and classifies sub-watersheds from high to low risk based on flood frequency and duration, offering a dynamic lens into vulnerability hotspots. MoM has proven effective in disseminating early flood warnings. Building on this success, FRI-R is designed to enable targeted interventions for at-risk populations and critical infrastructures, thereby empowering communities and decision-makers to proactively mitigate and improve long-term resilience.

1:30pm - 3:00pm

WG II/3C: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

1:30pm - 1:45pm

CityLangSplat: Integrating CityGML Semantics into 3D Language Gaussian Splatting for Urban Scene Understanding

Qilin Zhang^1,2,3, Jinyu Zhu¹, Olaf Wysocki⁴, Boris Jutzi^1,3

¹Technical University of Munich; ²Munich Center for Machine Learning; ³Karlsruhe Institute of Technology; ⁴University of Cambridge

Combining visual semantics with language representations has made 3D interpretation more flexible and intuitive. Recent advances in Gaussian Splatting extend this to efficient 3D language fields supporting open-vocabulary queries. However, existing approaches show limited generalization in large urban scenes, especially for detailed building segmentation. Semantic 3D city models such as CityGML, by contrast, provide hierarchical and geometry-aligned structural semantics that complement appearance driven visual cues. We introduce CityLangSplat, which integrates CityGML semantics into 3D Language Gaussian Splatting for urban environments. CityLangSplat rasterizes CityGML into pixel-aligned semantic maps, extracts vision-language features from SAM-derived segments and CityGML regions, and compresses both sources into a shared latent space via a lightweight autoencoder. 3D Gaussians are then optimized with a coverage-aware loss that balances accurate, building-focused CityGML supervision with broader SAM supervision, enabling geometry-aligned open-vocabulary reasoning in urban scenes. Experiments on TUM2TWIN and ZAHA datasets show consistent gains over LangSplat, with relative improvements of 22.9% in 2D and 15.1% in 3D evaluation while preserving real-time rendering. CityLangSplat provides a practical framework for combining semantic city models with language-embedded 3D Gaussian Splatting for geometry-aligned urban scene interpretation. Code will be released at https://github.com/zqlin0521/CityLangSplat.

1:45pm - 2:00pm

RoofVIP benchmark dataset: 2D roof planar polygons and very high-resolution digital orthophoto pairs

Chaikal Amrullah, Daniel Panangian, Guneet Mutreja, Youssef Abdelhedi, Ksenia Bittner

German Aerospace Center (DLR), Germany

Accurate building roof modeling is fundamental to urban analytics, digital twins, and 3D city reconstruction; however, progress in deep learning–based reconstruction is constrained by the limited availability of diverse, high-resolution datasets with detailed geometric annotations. This study introduces the RoofVIP dataset, a large-scale benchmark featuring very high-resolution RGB orthophotos paired with 2D roof vectors that capture diverse urban morphologies across Munich, Germany. Following Level of Detail (LoD) 2.0 principles, RoofVIP encompasses a wide range of roof geometries and architectural complexities, enabling evaluation of both segmentation- and vectorization-based reconstruction methods. Two paradigms are examined: a two-step segmentation-based approach (Cascade Mask R-CNN, Mask R-CNN, SOLOV2, YOLACT) and a one-step direct vector prediction approach (HEAT, PolyRoof). ImageNet-pretrained region-based models, particularly Mask R-CNN and Cascade Mask R-CNN, achieve the highest segmentation accuracy, effectively delineating complex roof boundaries while revealing limitations on small or irregular structures. Geometry-based models show complementary strengths, with HEAT emphasizing topological regularity and PolyRoof focusing on geometric precision. Although performance is lower than on simpler datasets such as HEAT and Roof Intuitive, RoofVIP exposes challenges related to geometric diversity and scale variation, serving as a rigorous benchmark. The dataset includes predefined training, validation, and test splits, enabling consistent benchmarking across methods. By providing a challenging and diverse geometric landscape, RoofVIP aims to advance geometry-aware deep learning approaches and support scalable, high-fidelity 3D urban modeling. The dataset is publicly available through the project page at https://chaikalamrullah.github.io/RoofVIP/.

2:00pm - 2:15pm

Evaluating 3D Scene Representations for Aerial Photogrammetry across Diverse Cityscapes

Shihan Chen¹, Zhaojin Li², Qingsong Yan¹, Haibing Liu¹, Huchen Li¹, Wubiao Huang¹, Fei Deng^1,3

¹School of Geodesy and Geomatics, Wuhan University, Wuhan, China; ²Technology and Engineering Center for Space Utilization, University of Chinese Academy of Sciences, Beijing, China; ³Hubei Luojia Laboratory, Wuhan, China

The proliferation of continuous Neural Radiance Field (NeRF) and 3D Gaussian Splatting (3DGS) has shifted the paradigm of 3D aerial reconstruction from relying solely on geometric stereo matching to inverse rendering optimization. However, while these emerging rendering-based frameworks excel in synthesizing photo-realistic novel views, their capability to extract accurate surfaces in complex aerial scenarios remains ambiguous compared to traditional methods. To establish a clearer understanding, this study presents a comprehensive evaluation of five representative frameworks spanning traditional Structure from Motion (SfM), purely Signed Distance Field (SDF) representations, unstructured 3D Gaussians, hybrid voxel-Gaussians, and strictly explicit sparse voxels. By systematically standardizing identical computational environments, inputs, and unified mesh-extraction pipelines on both real-world airborne LiDAR datasets and synthetic cityscapes, we assess their performance regarding visual fidelity, geometric accuracy, and resource efficiency. The experimental results reveal that while traditional MVS produces the highest overall geometric precision by strictly enforcing multi-view rigid geometry, it is prone to failures in texture-less regions. Among rendering-based representations, a fundamental trade-off exists: highly flexible, unstructured 3DGS achieve highest visual scores but degrade the underlying geometric surfaces; conversely, explicitly structured techniques demonstrate distinct superiority in regularizing topological coherence and floating artifact suppression. Furthermore, we observe that integrating structured voxels avoids the severe memory bottlenecks associated with extracting geometries from chaotic unorganized primitives. These empirical findings emphasize that for large-scale aerial photogrammetry, integrating explicit spatial structuralization into differentiable rendering pipelines is imperative for achieving scalable operations and bridging the geometric accuracy gap with traditional methods.

2:15pm - 2:30pm

Development of a 3D City Model-Based System for Pre-Flight Evaluation and Optimization of Aerial Image Acquisition Plans

Lixian Zhao, Saki Kato, Kenta Imai, Maki Itazu, Aya Matsui

Kokusai Kogyo Co., Ltd., Japan

In dense urban environments, aerial image acquisition often suffers from occlusions and redundant data due to the lack of quantitative evaluation tools at the flight-planning stage. To address this issue, this study develops a flight-planning support system that enables pre-acquisition visibility analysis for both terrain and building surfaces using existing 3D city models. The system performs ray-casting simulations based on user-defined flight parameters to quantify and visualize occluded and visible regions before flight, allowing planners to evaluate data quality and optimize image acquisition efficiency. Experiments were conducted using real flight plans with two representative aerial cameras: the Leica CityMapper-2 for multi-directional texture mapping and the Vexcel UltraCam Eagle 4.1 for nadir-based topographic mapping. The results show that the system effectively visualizes occlusions on roofs and walls, predicts building lean in nadir imagery, and assesses the influence of overlap ratios on ground visibility. These analyses enable users to design more cost-effective and geometrically consistent flight plans by identifying redundant overlaps and ensuring sufficient coverage for DSM and true-orthophoto generation. The proposed framework provides a quantitative and objective approach to improving the transparency and reliability of aerial survey planning, and it offers a foundation for integrating visibility simulation with subsequent photogrammetric workflows such as surface reconstruction and texture mapping.

2:30pm - 2:45pm

Image LiDAR based change detection and updating for urban 3D reconstruction

Teng Wu, Bruno Vallet

Univ Gustave Eiffel, Géodata Paris, IGN, LASTIG, F-77454 Marne-la-Vallée, France

There is a high demand for accurate and up-to-date territorial digital twins for human activities, but their production and updating costs remain prohibitive for many applications. Their generation relies on acquiring LiDAR and/or image data over the territory of interest. Each modality has its advantages: LiDAR is more accurate but more costly, while images are noisier but less costly and more easily accessible. Combining these two technologies to produce and update digital twins is thus a promising avenue.In this paper, we propose a pipeline based on 3D change detection to update a LiDAR point cloud using newer aerial imagery. First, triangle meshes are generated from LiDAR data and image-based dense matching. Then, 3D ray tracing is used to detect changes. After removing the changed parts, the point clouds are fused to update the scene.The proposed method is demonstrated on two datasets in France.The code will be open source on Github: https://github.com/whuwuteng/ChangeUpdateJN.

2:45pm - 3:00pm

SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting

Zihan Li, Tengfei Wang, Wentian Gan, Hao Zhan, Xin Wang, Zongqian Zhan

School of Geodesy and Geomatics, Wuhan University, China PR.

Lightweight building surface models are crucial for digital city, navigation, and fast geospatial analytics, yet conventional multi-view geometry pipelines remain cumbersome and quality-sensitive due to their reliance on dense reconstruction, meshing, and subsequent simplification. This work presents SF-Recon, a method that directly reconstructs lightweight building surfaces from multi-view images without post-hoc mesh simplification. We first train an initial 3D Gaussian Splatting (3DGS) field to obtain a view-consistent representation. Building structure is then distilled by a normal-gradient–guided Gaussian optimization that selects primitives aligned with roof and wall boundaries, followed by multi-view edge-consistency pruning to enhance structural sharpness and suppress non-structural artifacts without external supervision. Finally, a multi-view depth–constrained Delaunay triangulation converts the structured Gaussian field into a lightweight, structurally faithful building mesh. Based on a proposed SF dataset, the experimental results demonstrate that our SF-Recon can directly reconstruct lightweight building models from multi-view imagery, achieving substantially fewer faces and vertices while maintaining computational efficiency.

1:30pm - 3:00pm

IvS2: Canadian Advances in Geospatial AI for Intelligent and Resilient Mobility
Location: 716A

1:30pm - 1:45pm

Toward a Unified Geospatial Intelligence Framework Utilizing Edge Computing, IoT, and Multimodal Generative AI for Climate Risk Mitigation and Adaptive Evacuation Planning

Truong Thanh Hung Nguyen, Hung Cao

Analytics Everywhere Lab - University of New Brunswick, Canada

Climate-induced hazards are increasing in frequency and complexity, creating a pressing need for real-time, adaptive, and spatially aware decision-support systems. Existing climate monitoring and evacuation planning approaches often rely on centralized analytics and static geospatial products, which limit their ability to respond to rapidly evolving conditions. This research introduces a Unified Geospatial Intelligence Framework that integrates Edge Computing, Internet of Things (IoT) sensor networks, and Multi-Generative AI (GenAI) models to enhance climate risk mitigation and adaptive evacuation planning. The framework is conceptualized as an extension of the Intelligence Everywhere paradigm, which promotes pervasive, context-aware intelligence across distributed sensing and computational environments.

The framework fuses satellite imagery, UAV data, environmental IoT streams, mobility traces, and other geospatial sources into a multi-layer analytics ecosystem. IoT and edge nodes perform decentralized, low-latency inference for early hazard detection, ensuring resilience even under degraded network conditions. Multi-GenAI models—including generative geospatial models, large language models, and graph neural networks—provide predictive hazard analytics, uncertainty quantification, and scenario simulation to support proactive decision-making.

An adaptive evacuation module integrates real-time transportation data, connected vehicles, and mobility models to dynamically optimize evacuation routes as conditions evolve. Mobile platforms, such as drones and emergency vehicles, act as intelligent edge nodes, enriching situational awareness and enabling distributed coordination.

The proposed framework advances geospatial AI and disaster informatics by demonstrating how pervasive intelligence can significantly improve hazard detection, evacuation efficiency, and climate resilience.

1:45pm - 2:00pm

A Theoretical Framework for Environmental Similarity and Vessel Mobility as Coupled Predictors of Marine Invasive Species Pathways

Gabriel Spadon¹, Vaishnav Vaidheeswaran¹, Claudio DiBacco²

¹Faculty of Computer Science, Dalhousie University, Halifax - NS, Canada; ²Fisheries and Oceans Canada, Bedford Institute of Oceanography, Dartmouth - NS, Canada

Marine invasive species spread through global shipping and generate substantial ecological and economic impacts. Traditional risk assessments require detailed records of ballast water and traffic patterns, which are often incomplete, limiting global coverage. This work advances a theoretical framework that quantifies invasion risk by combining environmental similarity across ports with observed and forecasted maritime mobility. Climate-based feature representations characterize each port's marine conditions, while mobility networks derived from Automatic Identification System data capture vessel flows and potential transfer pathways. Clustering and metric learning reveal climate analogues and enable the estimation of species survival likelihood along shipping routes. A temporal link prediction model captures how traffic patterns may change under shifting environmental conditions. The resulting fusion of environmental similarity and predicted mobility provides exposure estimates at the port and voyage levels, supporting targeted monitoring, routing adjustments, and management interventions.

2:00pm - 2:15pm

Congestion-aware multi-agent reinforcement learning for wildfire evacuation routing

Bahareh Raei, Reza Safarzadeh, Xin Wang

University of Calgary, Canada

Wildfires are increasing in frequency and severity, placing growing pressure on communities and emergency management systems. When evacuations are ordered, large populations must move simultaneously over road networks never designed for such concentrated demand, particularly in small towns with only a few access corridors where delays or closures can sharply increase exposure to roadway hazards.

Evacuees often rely on everyday navigation apps that compute a fastest route for each driver. Although effective for routine travel, these systems optimise individual convenience rather than collective performance. When widely used during an emergency, they concentrate traffic onto the same nominally optimal links and offer little ability to reflect fire progression, road closures, or rapidly evolving congestion. As a result, standard navigation tools can unintentionally channel evacuees toward capacity-limited roads near advancing fire fronts.

This paper introduces a congestion-aware multi-agent reinforcement learning framework for wildfire evacuation. Operating on an OpenStreetMap-derived road graph and parcel-level building data for Lytton, British Columbia, each road junction hosts a Q-learning agent that learns exit-directed navigation policies and, during deployment, adjusts its decisions using penalties based on real-time edge usage and mapped fire zones. The framework formulates parcel-based evacuation as a distributed decision process and incorporates evolving congestion through traffic-aware batch routing. Through a detailed case study, we demonstrate substantial reductions in peak edge loading and fire-zone incursions compared with fastest-path routing while maintaining competitive travel distances.

2:15pm - 2:30pm

Exploring Bus Stop Passenger Ridership Using explainable Machine Learning

Ge Cui

University of New Brunswick, Canada

Over the past decade, promoting sustainable urban transportation has become increasingly important in North America due to growing populations and rising traffic congestion. Public transit, particularly bus systems, plays a critical role in reducing reliance on private vehicles. This study examines bus stop ridership in Fredericton, Canada, considering several explanatory variables, including public transit infrastructure, socio-economic factors, and local amenities. XGBoost was used to model the relationship between ridership and these variables, and SHAP was applied to quantify the contribution of each feature for enhancing interpretability. Results indicate that higher levels of bus service, specifically the number of bus routes and service frequency, are the most influential factors, showing strong positive associations with ridership. Other transportation infrastructure features, such as the availability of shelters, also have a positive impact. The findings suggest that strategically locating bus stops near high-amenity areas and well-planned bus transfer hubs can attract more passengers. Additionally, distributing bus hubs more evenly could help alleviate the exceptionally high volume at the current bus hub at Kings Place. By combining XGBoost and SHAP, this study provides both accurate predictions and transparent insights, supporting urban planners in optimizing public transit systems and promoting sustainable mobility.

2:30pm - 2:45pm

Advancing Geospatial Analysis with Foundation Models and LLMs in ArcGIS

Mohamed Ahmed

Esri Canada, Canada

Foundation models and large language models (LLMs) are rapidly transforming geospatial artificial intelligence, yet their effective use in operational remote sensing and GIS workflows remains insufficiently defined. Although these models offer strong generalization capabilities, a key challenge is translating them into robust, domain-relevant tools that support practical analysis and decision-making. This presentation addresses that gap by showing how foundation models and LLMs can be integrated into ArcGIS workflows to improve the extraction, interpretation, and use of information from Earth observation imagery and unstructured geospatial content.

Using examples based on models such as the Segment Anything Model (SAM), Prithvi, and other foundation models for image segmentation and Earth observation analysis, the session demonstrates how these architectures can support feature extraction, land-cover classification, hazard mapping, and related remote sensing tasks with reduced reliance on large labelled datasets. In parallel, the presentation examines how LLMs extend geospatial analysis beyond imagery through natural-language interaction, geospatial reasoning, entity extraction, and the synthesis of spatially relevant information from unstructured sources.

A central focus of the session is the adaptation of general-purpose models to geospatially specific problems. The presentation therefore highlights efficient fine-tuning strategies, including Low-Rank Adaptation (LoRA), as practical mechanisms for customizing foundation models to local environments, imagery characteristics, and application domains without the computational burden of full retraining. Through applied examples in ArcGIS, the session illustrates how these models can be combined into scalable workflows that reduce manual effort, accelerate analysis, and enhance the quality and usability of geospatial outputs for research and operational practice.

1:30pm - 3:00pm

Forum2B: The Future of Space- based Earth Observation
Location: 716B

1:30pm - 3:00pm

Forum7A: Entrepreneurship in the Industry 4.0 Geospatial Landscape
Location: 717A

1:30pm - 3:00pm

InS3: Industry Tech Session
Location: 717B

1:30pm - 5:00pm

General Assembly (tentative session)
Location: 701A

3:00pm - 3:30pm

Afternoon Coffee Break
Location: Exhibition Hall "E"

3:30pm - 5:15pm

ThS12: TLS-based Deformation Analysis
Location: 713A

3:30pm - 3:45pm

Complementing and validating uncertainty of terrestrial laser scanning via interval analysis

Reza Naeimaei, Steffen Schön

Institut für Erdmessung (IfE), Leibniz University Hannover, Hannover, Germany

Terrestrial laser scanning (TLS) enables dense spatial sampling; however, millimeter-level deformation analysis is limited by uncertainty rather than resolution, as inter-epoch differences can arise from actual change or residual systematic effects. Classical methods capture random variability under distributional assumptions but do not guarantee bounds for persistent systematic effects.

This paper presents a complementary interval-based framework that provides reliable, distribution-free bounds for TLS uncertainty and integrates seamlessly with least-squares workflows.

Starting from a measurement and instrumental correction model for high-end panoramic scanners, deviations of effective parameters are propagated to TLS observations and represented as interval radii at the observation level. We then extended the Least-Squares Adjustment, which linearly maps observation-level interval bounds to residuals and parameter estimates, providing conservative first-order enclosures alongside stochastic covariances.

Validation without a trusted nominal is addressed via a residual-based strategy that exploits two-face (Face 1/Face 2) acquisitions. This paper proposes a framework to validate intervals without existing nominal values. It begins with challenges and also guides addressing these challenges to ensure fair validation and test the proposed method on real TLS data. Overall, the proposed framework provides guaranteed bounds for remaining effects, improves discrimination between actual deformation and systematic effects, and offers actionable diagnostics for TLS-based monitoring.

3:45pm - 4:00pm

Point-based, profile-based and 3D point cloud-based vibration monitoring of structures: comparisons based on a lab experiment

Oliver Geißendörfer¹, Victoria Rosa², Hans-Berndt Neuner², Christoph Holst¹

¹Technical University of Munich, Germany; ²Technical University of Vienna, Austria

The safety and longevity of civil infrastructure rely on robust structural health monitoring (SHM), yet conventional methods are

often constrained by the high cost and impracticality of contact-based sensors. On the other hand, existing non-contact technologies

typically specialize in either static geometric mapping or spatially limited dynamic vibration analysis, leading to fragmented data

and complex post-processing. This research presents a unified non-contact methodology that addresses this challenge by simul-

taneously acquiring high-resolution 3D geometry time-series vibrational data using a single Light Detection and Ranging (LiDAR)

device. For this purpose, we compare point-based measurements using a total station, an iPhone along with a profile-based LiDAR

and 3D LiDAR point clouds for an experimental analysis. Sensor observations are recorded and analyzed at the same location

on the experimental surface showing flexibility in input dimensionality as well as robustness in resulting scalograms. The core

of the analysis is our developed method, a directional wavelet transform, a signal processing technique uniquely suited handling

non-stationary signals as multidimensional unstructured data. This method enables the characterization of oscillations across the

unstructured 3D surface, a capability beyond traditional modal analysis with one-dimensional time-frequency localization, but using

LiDAR point cloud time series. The result is a richer and more integrated understanding of structural behavior, capable of revealing

vibration behavior in high spatial detail. The study demonstrates that spatio-temporal LiDAR data contains embedded dynamic

information, offering a more comprehensive and efficient way to assess the health and integrity of a structure in the future.

4:00pm - 4:15pm

From tensor-product to truncated hierarchical B-splines: Enhancing spatial Resolution in space-continuous Deformation Analysis based on 3D point clouds

Elisabeth Ötsch, Hans Neuner

TU Wien, Department of Geodesy and Geoinformation, Austria

The quasi-continuous capturing of our environment by terrestrial laser scanning (TLS) in form of 3D point clouds provides the basis for numerous spatial analyses, including space-continuous deformation analysis. In times of aging infrastructure and climate

change-induced, cumulative mass movements, statistically-sound methods for determining areal deformations are becoming increasingly important. However, the lack of reproducibility of absolute point positions between consecutive scans and the resence of measurement noise demand approaches that retrieve credible comparison statements. The representation of point clouds by geometric surfaces supports noise reduction and serves as basis for successive analysis. Tensor-product B-spline surfaces have proven to be particularly versatile geometric representations to derive spatially consistent deformation estimates. This paper extends this concept by investigating the use of truncated hierarchical B-splines for statistically sound deformation analysis. We show that deformation is detectable when partition of unity is preserved through truncation. In a simulated environment, significant deformations between two point clouds were successfully detected. Results indicate that coarse surface representations lead to type-1 errors and underestimated deformation magnitudes, whereas more refined surface representations yield consistent deformation estimates, providing a potential termination criterion for adaptive model refinement.

4:15pm - 4:30pm

Towards a Framework for Benchmarking Dense 3D Displacement Estimation Approaches for Geomonitoring Using Long-Range TLS Data

Nicholas Meyer, Tomislav Medic, Andreas Wieser

Institute of Geodesy and Photogrammetry, ETH Zurich, Switzerland

Accurate and spatially dense 3D displacement estimation can contribute to a better understanding of geomorphological processes, while long-range terrestrial laser scanning (LR-TLS) has emerged as a promising technique for generating such observations. However, selecting the most effective algorithms for dense 3D displacement estimation remains challenging due to the lack of benchmarking. This study introduces an open and extensible benchmarking framework for 3D displacement estimation and provides an initial validation through a systematic comparison of representative 2D projection-based and 3D point cloud--based methods for estimating 3D displacements from LR-TLS scans. The evaluation includes 252 combinations of algorithmic and hyperparameter configurations, covering cross-correlation, optical flow, and salient feature tracking approaches, as well as the 3D displacement estimation method F2S3. All methods were benchmarked on a single common LR-TLS dataset, using sparse GNSS and manually derived displacements as ground truth. Results show that F2S3 achieves the highest agreement with the ground truth, while the top-performing configurations of the 2D approaches reach comparable accuracy, albeit slightly lower than that of F2S3. Our findings further highlight key sensitivities of current methods to parameter choices and data characteristics. The presented open and extensible evaluation framework enables reproducible performance assessment and could provide a foundation for future large-scale benchmarking and further development of 3D displacement estimation techniques for LR-TLS data.

4:30pm - 4:45pm

Joint Stone Segmentation and Feature Driven Deformation Analysis at Water Dams

Annika Tobies, Judith Foth, André Cornelißen, Eike Koller, Lasse Klingbeil, Heiner Kuhlmann

Institute of Geodesy and Geoinformation, University of Bonn, Germany

Structural health monitoring of water dams is crucial to ensure their long-term safety and operational reliability. Traditional geodetic techniques, although precise, are limited to sparse observation points and cannot capture spatially heterogeneous deformations. Laser scanning enables comprehensive, area-wide acquisition, overcoming this limitation. Subsequent deformation analysis often relies on comparisons along the local surface normal, which are limited in detecting in-plane movements. To address this, this study presents an approach that combines image-based stone segmentation with point-cloud-based deformation analysis to estimate both in-plane and out-of-plane displacements across masonry dam surfaces. Individual stones are detected in unmanned aerial vehicle (UAV) imagery using a deep learning segmentation model (Mask R-CNN) and subsequently projected into corresponding point clouds acquired by terrestrial laser scanning (TLS) and UAV laser scanning. By establishing consistent stone correspondences across multi-epoch point clouds via centroid-based matching and local iterative closest point (ICP) alignment, the proposed method enables deformation analysis on a stone-by-stone level. Simulated deformations were applied to TLS- and UAV-based point clouds of a dam to evaluate the method. Results demonstrate that the approach achieves sub-centimeter accuracy for the TLS and low-centimeter accuracy for the UAV point cloud, as measured by the RMSE between the estimated and true deformation. Our approach outperforms conventional model-to-model comparison methods, such as Multiscale Model to Model Cloud Comparison (M3C2), for in-plane displacements. The integration of image segmentation and geometric analysis provides a powerful framework for full-field deformation monitoring of masonry structures, supporting the detection of instabilities and improving dam safety.

4:45pm - 5:00pm

Reducing Non-rigidity in TLS Point Clouds Induced by Inhomogeneous Systematic Errors Using Free-form Surface Modeling

Yihui Yang¹, Corinna Harmening², Daniel Czerwonka-Schröder³, Christoph Holst¹

¹Chair of Engineering Geodesy, TUM School of Engineering and Design, Technical University of Munich, Germany; ²Geodetic Institute, Karlsruhe Institute of Technology, Germany; ³Department of Geodesy, Bochum University of Applied Sciences, Germany

In geodetic monitoring, terrestrial laser scanning (TLS) point clouds are typically assumed to be accurate and true-to-scale, implying that data acquired from different epochs or stations differ only by rigid transformations. Consequently, systematic errors related to scanner or platform variations can be mitigated through rigid point cloud registration. However, variations in the propagation speed and path of laser beams due to atmospheric refraction, as well as ranging biases induced by surface properties, can introduce non-rigid distortions in the generated point clouds. These effects are particularly pronounced under complex meteorological and topographic conditions, such as in mountainous areas. As a result, the acquired point clouds exhibit inhomogeneous and non-linear deviations that cannot be effectively compensated by simple distance corrections or rigid transformations. In this study, robust rigid registration is first performed to minimize the effects of platform offsets. A data-driven approach is then employed to generate sparse stable points, providing distance deviations that incorporate spatially varying systematic errors. Finally, a free-form surface is fitted to these sparse point-wise distance deviations, thereby establishing a 3D correction field for the entire point cloud. For a dataset collected by a permanent TLS monitoring system in the Vals Valley (Tyrol, Austria), the proposed method effectively reduces the registration residuals in TLS point clouds caused by inhomogeneous systematic errors.

5:00pm - 5:15pm

Calibration of Panoramic Terrestrial Laser Scanners using Planar Patches

Eike Koller¹, Lasse Klingbeil², Heiner Kuhlmann³

¹University of Bonn, Germany; ²University of Bonn, Germany; ³University of Bonn, Germany

Using point clouds captured by Terrestrial Laser Scanners for measurement tasks with high-quality requirements is well established in engineering geodesy. However, geometric imperfections within the scanners introduce systematic deviations into the captured point clouds. These deviations often reach several millimeters in magnitude, exceeding the impact of random measurement noise. Calibrating the scanners by estimating these internal imperfections allows these systematic errors to be corrected, thereby preventing misinterpretations of the measurement results. In this work, we develop a methodology that allows users of Terrestrial Laser Scanners to independently determine calibration parameters for panorama scanners and to correct the resulting point clouds using planar patches extracted directly from the captured data. This approach requires no additional hardware or specialized measurement equipment. We evaluate the methodology using an independent point cloud of a water dam and demonstrate that it achieves a substantial reduction in systematic deviations. Furthermore, by estimating calibration parameters in a dedicated state-of-the-art calibration field, we show that our method delivers results comparable to these established calibration procedures—yet without the need for such specialized calibration environments.

5:15pm - 5:30pm

Methodological framework for determining vertical angular variances of terrestrial laser scanners

Jakob Hummelsberger¹, Omar AbdelGafar¹, Derek Lichti², Christoph Holst¹

¹Chair of Engineering Geodesy, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany; ²Department of Geomatics Engineering, Schulich School of Engineering, University of Calgary, Calgary, Canada

Information on the precision of TLS observables is limited. While the range measurement precision can be modeled with respect to the intensity measurement nowadays, the precision of the angular observations still relies on the claims of the manufacturer. This contribution proposes a method to determine the vertical angular variance of a TLS using profile measurements. Supported by a simulation, which serves as proof-of concept, the methodology is laid out. In the end, measurements with a Z+F IMAGER® 5016A are evaluated. A dependency of the angular standard deviation on the rotational speed of the beam deflection unit is observed. The estimation precision of the angular standard deviation is high with consistent values for differing ranges. The estimated angular standard deviations are much lower than the claims of the manufacturer starting with roughly 2" for the slowest rotating settings, up to 4" for the fastest. All this can be achieved by scanning a reflectivity target with at least two adjacent fields of different homogeneous reflectivity. This needs to be aligned to the scanner to reduce and eliminate as many contributing error sources as possible. The target itself provides the fields and the transitions needed to perform the in-situ estimation of the angular precision.

3:30pm - 5:15pm

WG IV/2B: Artificial Intelligence and Uncertainty Modeling in Spatial Analysis
Location: 713B

3:30pm - 3:45pm

Chat2Map: A ReAct-based Agent Framework for Automated Web Map Generation from Natural Language Instructions

Hongping Zhang¹, Peilong Ma², Cong Wang¹, Lei Ding¹, Zhen Wang¹, Heng Li¹

¹National Geomatics Center of China, China, People's Republic of; ²Nanjing Normal University, School of Geography, Nanjing, Jiangsu,China

WebGIS platforms have revolutionized geospatial data dissemination, yet their adoption remains constrained by the steep learning curve of mapping library APIs. Frontend libraries like Leaflet, OpenLayers, and platforms such as Tianditu contain hundreds of classes and methods, requiring substantial programming expertise. This technical barrier prevents domain experts—urban planners, environmental scientists, public health officials—from independently creating the visualizations they need for analysis and decision-making.While Large Language Models (LLMs) have revolutionized code generation, they struggle with domain-specific, low-resource APIs common in geospatial applications. When applied to specialized geospatial APIs, these models exhibit critical failures: they frequently "hallucinate" non-existent functions, misuse parameters, or generate syntactically plausible but semantically incorrect code. This unreliability stems from the underrepresentation of domain-specific libraries in LLMs' training corpora, creating a "last mile" problem that renders them unsuitable for professional geospatial development. This study proposes a ReAct-based agent framework for automated web map generation from natural language instructions. The framework constructs a stateful, cyclic workflow and enables human–AI interactive WebGIS code generation based on the Tianditu JavaScript API. Its effectiveness and generality are validated through multi-model evaluation (GPT-4, Claude 3, Llama 3, Qwen-Max), demonstrating robust performance across diverse application scenarios. Experimental results show that the framework achieves professional-grade quality in both directive-driven and data-driven geospatial visualization tasks.

3:45pm - 4:00pm

Bridging Human Intent and Geospatial Services: A Conceptual Framework and Feasibility Study for Text2GeoAPI

Lei Ding, Heng Li

National Geomatics Center of China, 100830 Beijing, China

With the proliferation of online geospatial services, Geospatial Application Programming Interfaces (GeoAPIs) have become the backbone of modern spatial data interoperability. However, the high technical barriers of GeoAPIs, characterized by complex RESTful syntax and deterministic parameter requirements, create a significant "digital divide" for non-expert users. To bridge the gap between intuitive human spatial intent and technical service execution, this study proposes Text2GeoAPI, a novel conceptual framework for the automatic invocation and composition of geospatial services via natural language. We introduce the Intent-Entity-Operation (IEO) model to formalize spatial tasks, decoupling high-level semantic goals from atomic technical operations. We developed a modular prototype leveraging Large Language Models (LLMs) as cognitive engines to perform structured intent parsing, dynamic workflow planning, and multi-source result synthesis. Experimental evaluations using 100 diverse spatial queries demonstrate an overall task success rate of 86%, with the system effectively orchestrating multi-hop service chains (e.g., Geocoding → Isochrone Analysis → POI Search). The results confirm that Text2GeoAPI significantly lowers the threshold for accessing professional geospatial analysis, shifting the GIS paradigm from "tool-centric" to "intent-centric" intelligence.

4:00pm - 4:15pm

AI for Inclusive Winter Mobility: Multimodal Integration for Detecting Barriers Affecting People with Disabilities

SARA SHAHSAVARANI, Mir Abolfazl Mostafavi

¹Center for Research in Geospatial Data and Intelligence (CRDIG), Department of Geomatics Sciences, Université Laval, 1055, Avenue du Séminaire, Quebec City, QC G1V 0A6, Canada; ²Center for Interdisciplinary Research in Rehabilitation and Social Integration (Cirris), Quebec City, QC G1M 2S8, Canada

Winter accessibility poses critical challenges in cold-climate cities such as Québec, where snow and ice accumulation restrict the mobility of people with disabilities. This study presents an AI-driven multimodal framework designed to detect, classify, and

map winter barriers affecting pedestrian accessibility in Québec City. Building upon the SNOWMAN project, synthetic image and textual datasets were developed to represent seven major snow- and ice-related obstacle categories, including icy ruts, deep

snow, and uncleared sidewalks. The visual modality employed a self-supervised SimCLR model for snow-barrier classification (F1-score = 0.93), while the textual modality used a fine-tuned BERT classifier, achieving a perfect F1-score = 1.00 on validated

synthetic descriptions. Canonical Correlation Analysis (CCA) aligned the two modalities into a shared latent space, enabling spatial fusion of visual and semantic embeddings for integrated analysis within the MobiliSIG Winter Mobility platform. The fused data

produced dynamic accessibility maps revealing clusters of recurring winter hazards in known high-risk zones. The results confirm the feasibility of using synthetic multimodal data to simulate pedestrian-scale winter conditions and demonstrate the potential of

multimodal AI for inclusive, data-driven mobility management in cold-climate cities.

4:15pm - 4:30pm

Assessing residential Land Efficiency with spatial–contextual GMM and human Activity big Data: a Case Study of Shenzhen

Shihao Liang¹, Yixin Liu¹, Renzhong Guo¹, Weixi Wang¹, Ding Ma¹, Ye Zheng², Shengjun Tang¹, Linfu Xie¹, Xiaoming Li¹

¹Research Institute for Smart Cities & MNR Key Laboratory of Urban Land Resources Monitoring and Simulation, School of Architecture and Urban Planning, Shenzhen University; ²Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, 315211, China

As China’s urban development shifts toward stock-based optimisation, identifying inefficient residential land has become important for urban regeneration. Existing approaches often rely on subjective weighting, linear analytical structures, or homogeneous treatment of different residential types, which weakens robustness and transferability. To address these limitations, this study proposes a data-driven framework that integrates mobile-phone signaling and other multi-source spatiotemporal big data in Shenzhen. Two dominant residential forms—formal residential communities and urban villages—are evaluated separately through a four-dimensional framework covering built form, activity vitality, economic efficiency, and environmental livability. Principal component analysis is used to estimate intrinsic dimensionality and initialize a parametric autoencoder. A spatially constrained Gaussian mixture model is then employed to identify inefficient residential clusters while preserving local coherence. The clustering results are interpreted using a random forest model and TreeSHAP, and externally validated by street-view imagery interpretation and limited field surveys. PCA retained five components for urban villages and six for formal residential communities, and the BIC selected six and five clusters for the two residential types, respectively. The results indicate that inefficient formal residential communities show scattered and island-like spatial patterns, whereas inefficient urban villages tend to form more continuous clusters along the edges of larger village agglomerations. Random forest and TreeSHAP further reveal that inefficient urban villages are more strongly associated with deficiencies in service accessibility and local socioeconomic conditions, whereas inefficient formal residential communities are more closely associated with lower residential vitality and relatively high development intensity. External validation indicates acceptable agreement with observed residential conditions.

4:30pm - 4:45pm

Reproducing Geospatial Crowdsourcing: How Consistent Is the Crowd?

David Collmar, Volker Walter, Uwe Sörgel, Roland Ullmann

University of Stuttgart, Germany

This paper investigates the long-term consistency and reliability of paid geospatial crowdsourcing on the online platform Microworkers.com. Over a five-month period, we conducted three crowdsourcing campaigns, each representing a task typical for remote sensing, i.e., pixel classification, point selection, and geometric outline acquisition, to assess whether repeated worker participation enhances data quality and reproducibility. Beyond individual task performance, we examine the broader question of whether crowdsourcing campaigns can yield reproducible results over extended periods. Despite the large and heterogeneous workforce of Microworkers.com, a substantial share of tasks was completed by recurring workers who consistently outperformed one-time participants. Furthermore, across all campaigns, data quality remained largely stable, with only minor variability between epochs. Additionally performed statistical analyses confirm that reproducible outcomes are achievable, highlighting the potential of reliable and reproducible crowdsourcing results for geospatial data acquisition.

4:45pm - 5:00pm

Shaping the Colonial Port: Urban Networks and Spatial Form in the Early Modern Era

Chaoqun Wang, Jie He

Harbin Institute of Technology, Shenzhen, China, People's Republic of

This abstract presents a comprehensive research framework examining the interplay between colonial trade networks and the spatial form of port cities during the early modern era. Firstly, the study constructs a geographic database of nearly 300 colonial port cities, using intercity trade data from East India Company archives as network edges to analyze their structural and morphological evolution. Secondly, it processes historical maps of colonial ports through a fine-tuned multimodal large language model to extract and classify spatial morphological features, establishing a systematic typology of urban form patterns. Thirdly, the research develops regression models to reveal correlations between network status and morphological patterns. Preliminary findings highlight Batavia's dominant yet volatile role within the network and reveal a trend toward decentralization over the 18th century. The research contributes to both urban historical studies and digital humanities by offering a scalable, comparative approach to interpreting colonial port cities as spatial manifestations of global economic and political forces, while establishing empirical relationships between network status and urban form characteristics. It further provides a refined framework for contextualizing their cultural heritage significance within trans-colonial networks.

5:00pm - 5:15pm

Vector generalization of the drainage network

Thalles Aquino¹, Edilson Bias¹, Maurício Paulo², Raul Feitosa³, Felipe Ferrari²

¹University of Brasília, Brazil; ²Institute of Engineering, Rio de Janeiro, Brazil; ³Pontifical Catholic University, Rio de Janeiro, Brazil

This study explores the application of Graph Convolutional Networks (GCNs), specifically the GraphSAGE model, to the cartographic generalization of hydrographic networks in the state of Santa Catarina, Brazil. The generalization of river segments is critical for transitioning from detailed (1:25,000) to generalized (1:100,000) scales. It's traditionally a manual, rule-based process. By modeling drainage systems as graphs and training deep learning models with data from the Brazilian Army's Geospatial Database (BDGEx), this research evaluates how geometric and semantic attributes influence generalization outcomes. This data follows Brazilian Technical Specifications of the Geospatial Vector Data Structure (ET-EDGV), therefore they figure as a systematic data from Brazilian institutions.

GraphSAGE model was trained four times, each incorporating varying combinations of attributes such as segment length, sinuosity, polygon containment, and river flow regime. The model trained with all attributes achieved the highest accuracy (99.98%). Even models using geometric features surpassed 93% accuracy. These results highlight the effectiveness of GCNs in capturing structural patterns.

This study compares GraphSAGE model outputs to those generated by the GeoData Loader for Mapserver (GDLMS), the current operational system for generalization, developed and used by the Geographic Service of the Brazilian Army. It also compares those generalization to reference data acquired by manual generalization using the same 1:25.000 scale input. Visual analysis in GIS environments reveals that GCNs can be an alternative for generalization tasks.

This research demonstrates the viability of using GeoAI methods for automating complex cartographic processes, offering a scalable and data-driven solution aligned with national geospatial data standards.

3:30pm - 5:15pm

WG III/4A: Landuse and Landcover Change Detection
Location: 714A

3:30pm - 3:45pm

ChangeDINO: DINOv3-Driven Building Change Detection in Optical Remote Sensing Imagery

Ching-Heng Cheng¹, Chih-Chung Hsu²

¹National Cheng Kung University, Tainan, Taiwan; ²National Yang Ming Chiao Tung University, Hsinchu, Taiwan

Remote sensing change detection (RSCD) aims to identify pixel-wise surface changes from co-registered bi-temporal images. However, many deep learning–based RSCD methods rely solely on change-map annotations and underuse the semantic information in non-changing regions, which limits robustness under illumination variation, off-nadir views, and scarce labels.

This paper presents ChangeDINO, an end-to-end multiscale Siamese framework for optical building change detection. The model fuses a lightweight backbone stream with features transferred from a frozen DINOv3, yielding semantic- and context-rich pyramids even on small datasets. A spatial–spectral differential transformer decoder then exploits multi-scale absolute differences as change priors to highlight true building changes and suppress irrelevant responses. Finally, a learnable morphology module refines the upsampled logits to recover clean boundaries. Experiments on four public benchmarks demonstrate that ChangeDINO achieves strong accuracy and robustness under cross-temporal appearance variations, yielding cleaner building boundaries with improved data efficiency.

3:45pm - 4:00pm

Hie-DinoMamba: Hierarchical DINOv3 and Mamba Architecture for Multi-Class Building Change Detection

Youngwoong Yoon¹, Jangwoo Cheon¹, Hwiyoung Kim¹, Impyeong Lee²

¹Geospatial Team, Innopam, Seoul, Republic of Korea; ²Department of Geoinformatics, University of Seoul, Seoul, Republic of Korea

Multi-class building change detection in high-resolution aerial imagery is essential for urban monitoring, yet remains challenging due to severe class imbalance and the limited representational capacity of encoders trained from scratch. We propose Hie-DinoMamba, a novel architecture that integrates a frozen 1.1B-parameter DINOv3-L encoder—pre-trained on the SAT-493M satellite dataset—with a newly designed Hierarchical Mamba FPN decoder. To bridge the domain gap between satellite pre-training and aerial imagery without incurring prohibitive computational costs, we adapt the encoder using parameter-efficient Low-Rank Adaptation (LoRA), updating only a small fraction of parameters while preserving the encoder's rich pre-trained knowledge. The decoder fuses multi-scale feature pairs from both time points via channel-wise concatenation and 1×1 projection, then refines them in a top-down manner using Visual State Space Model (VSSM) blocks that capture long-range spatial context with linear complexity. A dual-loss strategy decouples semantic classification (Focal Loss) from boundary delineation (Focal Tversky + Dice Loss), optimizing each objective at a different hierarchical level. On a 4-class aerial building change detection benchmark (41,548 image pairs, 0.1 m resolution, Seoul), Hie-DinoMamba achieves a state-of-the-art mIoU of 65.12% and Kappa of 75.77%, improving over the strongest baseline by 2.1 percentage points. An ablation study confirms that LoRA adaptation is the most critical component. Qualitative analysis further demonstrates robust generalization to geographically unseen regions.

4:00pm - 4:15pm

Stepwise Optimization and Ensemble Pipeline for Building Change Detection in High Resolution Satellite Imagery Using Mamba-Based Model

DongHyuk Jin¹, Junhwa Chi²

¹Department of Data Engineering, Pukyong National University, Busan, Republic of Korea; ²Division of Data Information Sciences, Pukyong National University, Busan, Republic of Korea

This study presents a stepwise optimization pipeline for high-resolution building change detection in dense urban environments using imagery from CAS500-1, Korea’s national land observation satellite. A dataset of 3,816 bi-temporal patch pairs from 29 urban regions was constructed to support model development and evaluation. A Mamba-based architecture, incorporating efficient global context modeling, was adopted as the baseline for binary change detection.

To enhance performance, the pipeline introduced three sequential optimization stages. First, normalization techniques suited for 12-bit radiometric imagery were compared, including percentile-based scaling, gamma adjustment, and logarithmic transformation. Second, augmentation strategies were evaluated, contrasting standard geometric augmentation with extended optical and temporal augmentation designed to improve generalization in structurally complex urban environments. Third, multiple ensemble strategies, ranging from simple averaging to confidence-weighted and hierarchical aggregation, were examined to overcome the limitations of individual model sizes.

Model performance was assessed using a comprehensive set of pixel-level, change-pixel-level, contour-based, and object-based metrics to ensure robust evaluation of both spatial precision and structural consistency. Experimental results showed that gamma-based normalization, comprehensive augmentation, and selected ensemble strategies each contributed measurable improvements. Combining these optimized components yielded a final hierarchical ensemble that improved the F1-Score from 0.7629 to 0.8070, representing a substantial gain over the baseline model.

Overall, this work provides a validated and extensible optimization strategy for high-resolution satellite-based change detection, offering practical guidance for operational applications and adaptability to future ensemble configurations across diverse architectures.

4:15pm - 4:30pm

Leveraging Geospatial Foundation Models for Bi-Temporal Land-Cover Change Detection

Mozhdeh Shahbazi, Mikhail Sokolov, Charles Authier, Marjan Asgari

Canada Centre for Mapping and Earth Observation, Natural Resources Canada, Canada

Recent advances in geospatial foundation models have enabled scalable and transferable solutions for Earth observation (EO) tasks, which can make them good candidates to achieve the requirements mentioned above. Foundation models are types of large-scale artificial intelligence (AI) models trained on massive and diverse datasets. In the EO domain, these datasets may include imagery, elevation models, geographic coordinates, temporal tags, sensors spectral information, and descriptive metadata. These models excel at representation learning through self-supervised training, enabling them to capture rich descriptive features without requiring labelled data. Consequently, they can serve as powerful backbones for downstream tasks such as land-cover change monitoring.

Accordingly, this paper provides an overview of the development process of a geospatial foundation model, Planaura. It demonstrates how this model is best adapted to Canadian landscapes and how it is used to achieve the task of land-cover change detection. Planaura is now accessible publicly via the model hub at HuggingFace: [Link hidden for blind review process]

4:30pm - 4:45pm

A Transformer-Based Framework for Spatiotemporal Unmixing of Land–Water Mixtures in Multispectral Satellite Data

An Bao Nguyen¹, Andreas Schenk², Stefan Hinz²

¹KU Leuven, Leuven, Belgium; ²Karlsruhe Institute of Technology, Karlsruhe, Germany

This paper presents a novel transformer-based framework for spatiotemporally dynamic spectral unmixing of multispectral satellite imagery. Spectral unmixing is essential for analyzing mixed pixels in remote sensing, especially in analyzing small objects such as narrow rivers when using coarse-resolution observations such as Sentinel-2 data. Most deep-learning based unmixing models typically account for a single scene and ignore the tempo-spatial variation of spectra and land-cover proportions.

To address this challenge, we introduce a unified deep learning architecture that leverages transformer attention mechanisms to exploit both spectral and auxiliary information causing spectral variations. The framework models the temporal and spatial evolution of abundances while simultaneously learning representative endmember spectra. By integrating cross-attention between spectral inputs, auxiliary variables, and temporal embeddings, the model can adapt to seasonal changes, illumination conditions, and scene-specific variability. The method is trained using synthetic mixtures derived from Sentinel-2 surface reflectance data.

Applied to monitoring small rivers with strong temporal, and spatial, and intrinsic variability, the proposed approach demonstrates improved accuracy in estimating water abundances and extracting water spectra in highly mixed river pixels (mixed with water and riverbank). The model effectively captures tempo-spatial transitions in water extent and sediment-laden river inflows, offering a more consistent representation than conventional unmixing techniques.

This work contributes a generalizable and end-to-end framework for handling dynamic unmixing scenarios in multispectral remote sensing. It provides new insights into the use of transformers for modeling spatiotemporal interactions and supports applications in environmental monitoring and water resource assessment.

4:45pm - 5:00pm

Land Cover Classification of Optical–SAR Imagery via Cross-Modal Interaction and Feature Alignment

Junqi Zhao, Min Chen, Wei Guo, Jinbo Zhang, Zelan Fu, Xuming Ge, Han Hu, Bo Xu, Qing Zhu

Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu, 611756, China

Land cover classification (LCC) plays a crucial role in geoscientific research and resource monitoring applications. Compared

with traditional single-modal classification methods, multimodal fusion models can more effectively leverage the complementary

information of optical and synthetic aperture radar (SAR) imagery, thereby improving classification performance in complex scen-

arios. However, due to the significant differences in the imaging mechanisms of the two sensors, inconsistencies in radiometric

properties and spatial structures arise between optical and SAR images, posing challenges for cross-modal feature interaction and

fusion. To address this issue, we propose a multimodal optical–SAR fusion network (MOSFNet) for high-precision LCC, which

incorporates two core modules: the Feature Interaction Module (FIM) and the Feature Fusion Module (FFM). The FIM achieves

complementary feature interaction between optical and SAR images through channel splitting and cross concatenation, while in-

corporating a coordinate attention mechanism to enhance the responsiveness of key land cover regions. The FFM leverages a 2D

selective scan (SS2D) mechanism to implement bidirectional cross-modal feature alignment and gated fusion in the hidden state

space, enabling deep correlation and adaptive integration of optical and SAR features. Experiments on the WHU-OPT-SAR dataset

demonstrate that MOSFNet significantly outperforms existing methods in terms of classification accuracy and model generalization,

providing an efficient and robust solution for high-precision land cover mapping with multi-source remote sensing imagery.

5:00pm - 5:15pm

Seasonal-Aware Scale-Semantic Consistency Alignment Change Detection Network

Bing Shao¹, Hanchao Zhang¹, Mingzhu Li², Yunkun Zou³, Ruiqian Zhang¹, Xiaogang Ning¹, Hao Wang¹

¹Chinese Academy of Surveying and Mapping Beijing, China; ²Liaoning Technical University Geomatics and Geographical Sciences, Fuxin, China; ³Joint Laboratory of Spatial Intelligent Perception and Large Model Application, Nanjing, China

Change detection in remote sensing imagery is a crucial method for obtaining dynamic information about land cover. However, pseudo-changes caused by seasonal variations pose a significant challenge to detection accuracy. Seasonal variations, such as vegetation phenology and snow cover, introduce global appearance differences that are often mistaken for actual land cover changes. This phenomenon is particularly prominent in long-term monitoring tasks, where pseudo-changes dominate the detection results. Addressing the issues of global appearance differences and multi-scale feature fusion induced by seasonal changes, We propose a novel Seasonal-Aware Scale-Semantic Consistency Alignment Change Detection Network (SSCANet) for remote sensing image change detection. This approach incorporates a Seasonal-Aware Scale Alignment (ASA) module and a Seasonal-Aware Semantic Guided Fusion (SGF) module. By employing spatial scale transformation and semantic alignment, it reduces information mismatch in multi-scale feature fusion and enhances the perception of details in change regions. Experiments conducted on the GZ-CD and CDD datasets demonstrate that SSCANet achieves overall accuracy with F1 scores of 89.21% and 97.82%, with precision rates of 89.02% and 98.37%, respectively. These results represent significant improvements over other methods, demonstrating that SSCANet outperforms its counterparts in both overall accuracy and seasonal robustness. The findings confirm that this approach effectively suppresses seasonal false changes, enhancing the accuracy and reliability of change detection.

3:30pm - 5:15pm

WG I/2B: Mobile Mapping Technology
Location: 714B

3:30pm - 3:45pm

Mitigating trajectory drift in tunnel mapping: evaluation of conventional and novel approaches applied to SLAM-based mobile mapping solution

Antonio Gualtiero Mainardi¹, Simone Marmaglio², Luca Perfetti¹, Giorgio Paolo Maria Vassena¹

¹Università degli Studi di Brescia, Dept. of Civil Engineering, Architecture, Territory, Environment and Mathematics (DICATAM), Italy; ²Università degli Studi di Brescia, Dept. of Information Engineering (DII), Italy

In Indoor Mobile Mapping Systems (iMMS) the trajectory estimation is implemented by the SLAM (Simultaneous Localization and Mapping) algorithm. By assuming a fixed environment surrounding the instrument, the algorithm relies on stable geometries to establish the trajectory. Drift effects represent the main source for errors and affect the trajectory estimation. These effects can be magnified in feature-deficient or degenerate environments, where the variation of geometrical elements can be minimal, as in the case of tunnels. In this context, difficult environments such as tunnels are suitable for the implementation of alternative algorithms for the trajectory estimation. Considering this kind of scenario, the contribution has the twofold objective of evaluating the results of two trajectory estimation methods, in terms of trajectory drift, with reference to an indoor SLAM-based MMS, and to establish a repeatable methodology to do so. A novel algorithm for the trajectory estimation, not just relying on geometrical SLAM algorithm, but also taking advantage of reflectance images coming from LiDAR sensors mounted on the system, is considered.

The case study is a 200 m long branch of a motor-way tunnel, with a diameter of 15 m. The test is further subdivided by computing all trajectories with different constraining strategies, first without any constraints, then considering global optimisation, loop closure and static control scans, to replicate typical realistic scenarios in tunnel mapping. The results of this work highlight how the novel reflectance-aided SLAM algorithm is beneficial in terms of drift reduction in the estimated trajectories.

3:45pm - 4:00pm

Range Error Detection and Evaluation for retroreflective Road Signs in Phase-Shift MMS Point Clouds

Saori Fukushi¹, Yoji Takahashi¹, Hirofumi Chikatsu²

¹Aero Toyota Corporation; ²Tokyo Denki University

This presentation addresses the challenge of range errors in point clouds of road signs captured by Mobile Mapping Systems (MMS) equipped with phase-shift laser scanners.

Under certain conditions, retroreflective materials cause range errors in point clouds. Previous studies have proposed mitigation techniques for range errors caused by sensor saturation in TOF systems, but similar studies on phase-shift systems are scarce. In addition, existing road sign detection methods assume accurate point representation, making them ineffective when sign points are displaced.

To overcome this limitation, we developed a detection method that first extracts road signs through point cloud visualization and then identifies range errors based on the standard deviation of relative distances from reference emission points.

The proposed approach was validated using 5 km of driving data collected on general roads. Results show that 32 road signs were extracted, and 26 were correctly detected as exhibiting range errors, achieving 100% agreement with manual visual assessment.

This study demonstrates the effectiveness of the proposed detection method and its potential for improving the reliability of identifying range errors of road signs on general roads.

4:00pm - 4:15pm

An RTK-SLAM Dataset for Absolute Accuracy Evaluation in GNSS-Degraded Environments

Wei Zhang, Vincent Ress, David Skuddis, Uwe Soergel, Norbert Haala

University of Stuttgart, Germany

RTK-SLAM systems integrate simultaneous localization and mapping (SLAM) with real-time kinematic (RTK) GNSS positioning, promising both relative consistency and globally referenced coordinates for efficient georeferenced surveying. A critical and underappreciated issue is that the standard evaluation metric, Absolute Trajectory Error (ATE), first fits an optimal rigid-body transformation between the estimated trajectory and reference before computing errors. This so-called SE(3) alignment absorbs global drift and systematic errors, making trajectories appear more accurate than they are in practice. We present a geodetically referenced dataset and evaluation methodology that expose this gap. A key design principle is that the RTK receiver is used solely as a system input, while ground truth is established independently via a geodetic total station. This separation is absent from all existing datasets, where GNSS typically serves as (part of) the ground truth. The dataset is collected with a handheld RTK-SLAM device, comprising two scenes. We evaluate LiDAR-inertial, visual-inertial, and LiDAR-visual-inertial RTK-SLAM systems alongside standalone RTK, reporting direct global accuracy and SE(3)-aligned relative accuracy to make the gap explicit. Results show that SE(3) alignment can underestimate absolute positioning error by up to 76\%. RTK-SLAM achieves centimeter-level absolute accuracy in open-sky conditions and maintains decimeter-level global accuracy indoors, where standalone RTK degrades to tens of meters. The dataset, calibration files, and evaluation scripts are made publicly available. The dataset, calibration files, and evaluation scripts are publicly available at https://rtk-slam-dataset.github.io/

4:15pm - 4:30pm

Novel View Synthesis Under Rainy Conditions with Neural Radiance Fields and Gaussian Splatting

Ivana Petrovska, Boris Jutzi

Karlsruhe Institute of Technology, Germany

Scene reconstruction and novel view synthesis from calibrated multi-view images still attracts a lot of attention in computer vision and graphics. However, the assumption that images are noise-free rarely holds in real-world scenarios where adverse weather conditions are inevitable. Being a part of our environment, we are particularly interested in rain as dynamic semi-transparent occlusion which imposes challenges to a complete and accurate geometry of the underlying features. More precisely, we qualitatively and quantitatively analyze the photometric image quality under rainy conditions generated by radiance field methods, namely: Neural Radiance Fields (NeRFs), 3D Gaussian Splatting (3DGS) and 2D Gaussian Splatting (2DGS) due to the different geometric representation. To assess the impact of rain to the scene reconstruction we consider raindrops and streaks captured with illumination variation as well as occlusion masks with different coverage. The evaluation is based on comparing 2D image metrics of the rendered novel views without and with masks. The experiments and results show that 3DGS achieves highest rendering fidelity in all scenarios without and with masks with SSIM of 0.724 and LPIPS of 0.291, followed by 2DGS with slightly lower scores, while NeRF exhibits lowest correspondence with the input images with SSIM of 0.584 and LPIPS of 0.384. We demonstrate the effectiveness of using masks to handle rain as transient element and radiance field methods’ ability to reliably approximate the geometry behind rain occlusions.

4:30pm - 4:45pm

Toward Seawall Monitoring via Tracking Model-Derived Feature Points of Tetrapods from 3D Point Clouds

Ting On Chan¹, Derek Lichti²

¹School of Geography and Planning, Sun Yat-sen University, China, People's Republic of; ²Department of Geomatics Engineering, University of Calgary, Canada

In recent years, many coastlines worldwide have retreated under the influence of storm surges and other extreme events, exacerbated by intensifying wave conditions in certain regions and seasons. Consequently, wave-dissipating units (e.g., tetrapods) have been widely deployed for coastal protection. In this paper, we propose a novel three-dimensional geometric method for extracting robust feature points from 3D point clouds to track tetrapod displacements and assess seawall safety. The model represents a tetrapod as four cylinders sharing a common center. By fitting this geometric model to the point cloud, we obtain parameters that allow us to derive multiple feature points—such as the intersections of conical surfaces—which can also be verified through alternative measurement techniques. These feature points serve as stable references for position comparison and displacement estimation. As this research is at an early stage, we have not yet collected field data from full-scale tetrapods. Instead, we conducted indoor experiments using a 3D depth camera (Microsoft Azure) in place of LiDAR, utilizing several high-fidelity resin tetrapod scale models (approximately 10 cm in height) as test subjects. The results demonstrate the feasibility of our method: when compared against total-station measurements, our approach yields highly accurate displacement estimates (averaging approximately 3 mm). This provides a solid foundation for the future deployment of 3D laser scanning in seawall monitoring.

4:45pm - 5:00pm

Application of Side-Scan Sonar and Multibeam Echosounder for the Investigation of Underwater Cultural Heritage – A Case Study of a Wreck in the Baltic Sea

Klaudia Pasternak, Paulina Jaczewska, Patryk Wróblewski

Military University of Technology in Warsaw, Poland

As the technology of hydroacoustic sensors advances, there is a growing trend in the use of generated sonar images and point clouds in the analysis of the seabed and objects of anthropogenic origin in water bodies. In the context of cognitive and practical dimensions, obtaining data on sunken ships is of particular importance. Based on the data obtained from hydroacoustic sensors, it is possible to extract their geometric features. As a result, it is possible to develop digital repositories of wrecks, based on sonar and bathymetric data, among others, which in the future may enable the construction of integrated knowledge bases on underwater heritage. The purpose of the work was to extract the geometric features of the wreck of the Zawiszaczek located in the Puck Bay of the Baltic Sea. As part of the work, bathymetric measurements were planned, side-scan sonar and multibeam echosounder data were collected. Based on the acquired data, the geometric features of the wreck were extracted. The differences in the wreck's dimensions, as determined by sonar images obtained from different routes, did not exceed 0.25 m.

3:30pm - 5:15pm

WG II/6: Cultural Heritage Data Acquisition and Processing
Location: 715A

3:30pm - 3:45pm

Open Technologies for the 3D Cultural Heritage Digitisation Pipeline

Fotis Arnaoutoglou¹, Peter Bonsma², Elisa Mariarosaria Farella³, Anestis Koutsoudis¹, Niki Kyriakou⁴, Marco Medici⁵, Vangelis Nomikos⁴, Anthony Pamart⁶, Fabio Remondino³

¹ATHENA Research Centre, Greece; ²RDF Ltd, Bulgaria; ³3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy; ⁴Talent S.A., Greece; ⁵INCEPTION, Spin-off of the University of Ferrara, Italy; ⁶MAP CNRS, Marseille, France

This paper introduces the 3D-4CH project and its open framework, i.e. a sustainable ecosystem of tools designed to overcome the fragmentation and limited maintainability of previous EU-funded 3D heritage initiatives. Aligned with the European Collaborative Cloud for Cultural Heritage (ECCCH), the framework integrates an end-to-end pipeline for 3D data generation and processing, semantic enrichment and long-term dissemination, including metadata and paradata inclusion. The 3D-4CH initiative bridges the gap between ICT research and operational heritage practices, ensuring the scalability and reproducibility of 3D digital assets for cross-institutional data sharing and preservation. All software components, including GitHub repositories and online processing frameworks, are openly available, in accordance with open science principles and FAIR data practices. Further information is available at https://www.3d4ch-competencecentre.eu/en/tools/.

3:45pm - 4:00pm

Metric Reliability and Operational Adaptability in the context of the Integrated 3D Metric Survey of the Genete Leul Palace (Addis Ababa, Ethiopia)

Elisabetta Colucci, Giacomo Patrucco, Mahtab Fallah, Andrea Demartis

Department of Architecture and Design (DAD), Laboratory of Geomatics for Cultural Heritage, Politecnico di Torino, Italy

The paper presents the integrated 3D metric survey of the Genete Leul Palace in Addis Ababa, demonstrating how metric reliability and operational speditivity can coexist through an adaptive hybrid TLS–MMS workflow that supported the restoration project and heritage documentation in a low-infrastructure context.

4:00pm - 4:15pm

Photogrammetry Laser Scanning and Reverse Engineering Conrad’s Jewel

Adam Weigert, Mario Santana, Stephen Fai, Sena Kurcenli Koyunlu

Carleton Immersive Media Studio, Canada

Laser scanning, photogrammetry, and other technical tools are staples for cultural heritage documentation and reverse engineering projects. However, manufacturers and even researchers often conflate the data capture process with reverse engineering itself, even though the data alone cannot provide the insight needed for a full reverse engineering or understanding of the historic site. This paper illustrates how laser scanning and photogrammetric applications were used in reverse engineering the construction and details of Conrad’s Jewel, a 1908 Silver/Gold mill in the Yukon, Canada. Analogous to systems and software engineering fields, the reverse engineering process is framed by considering related designs, existing documentation, personal experience, and general external knowledge.

4:15pm - 4:30pm

Modelling Transparent Surfaces in Heritage Artifacts with Gaussian Splatting

Marco Medici¹, Andrea Sterpin^1,2, Stefano Settimo¹, Matteo Bevilacqua¹, Gianluca Bertolasi³, Simone Rigon³, Elisa Mariarosaria Farella³, Fabio Remondino³

¹INCEPTION s.r.l., Spin-off of the University of Ferrara, Italy; ²Department of Architecture, University of Ferrara, Italy; ³3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy

The 3D reconstruction of cultural heritage artefacts plays a crucial role in documentation, conservation and dissemination. While recent advances in photogrammetry, laser scanning and neural rendering techniques have significantly improved the geometric accuracy and visual realism of digitised assets, the reconstruction of transparent and reflective materials - typical in museal collections - remains a major challenge. Materials such as glass, glazes and varnishes exhibit complex optical behaviours, leading to incomplete or inaccurate 3D models. Recent developments in Gaussian Splatting (GS) offer a potential alternative by enabling efficient, high-fidelity scene representation without explicit surface modelling. However, their application to non-Lambertian and transparent heritage objects remains largely unexplored. This paper presents a study on GS methods for the 3D digitisation of transparent cultural heritage artefacts. Through a series of experimental reconstructions, the work investigates the potential and limitations of GS, highlight the opportunities of hybrid pipelines for addressing long-standing challenges in the digitisation of non-collaborative materials.

4:30pm - 4:45pm

Evaluating generative AI for museum artifacts documentation

Elisa Mariarosaria Farella, Simone Rigon, Gianluca Bertolasi, Fabio Remondino

3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK)

In recent years, the European Commission (EC) identified the 3D digitization of cultural heritage sites and artifacts as one of its priorities and promoted numerous initiatives and recommendations to accelerate documentation campaigns. However, current digitization targets remain far from being achieved, and heritage institutions have been increasingly encouraged to explore faster and cost-effective 3D documentation solutions. Moreover, traditional image- and range-based 3D surveying techniques frequently struggle when reconstructing objects featuring non-collaborative surfaces (such as reflective or transparent objects), are time-consuming, and require specialized knowledge. Generative AI methods, able to generate 3D models also from a single input image, have recently emerged as a potentially faster alternative, yet their performance on heritage assets remains mostly unexplored. This paper evaluates three state-of-the-art and recent single-image GenAI frameworks - SAM3D, Tripo3D and Trellis2 - on several museum artifacts featuring diffuse, reflective, transparent, and mixed-material surfaces of varying scale and geometric complexity, for which accurate ground truth is available. The aim is to analyze whether these frameworks can act as complementary or alternative solutions for fast heritage documentation.

4:45pm - 5:00pm

LiDAR-Guided Illumination-Aware 3D Gaussian Splatting for Cultural Heritage

Xiao LIU¹, Xinyi LI¹, Wan LI², Tao LIU¹, Wei SUN¹, Sheng Zhang³

¹Wuhan Geomatics Institute; ²Hubei Surveying and Mapping Quality Supervision and Inspection Station; ³Langfang Natural Resources Comprehensive Survey Center, CGS

To address the issues of geometric distortion and loss of details in 3D modeling for complex cultural heritage scenes, this paper proposes an improved 3D Gaussian Splatting (3DGS) reconstruction method that integrates LiDAR and illumination-awareness. First, high-precision 3D coordinates from LiDAR point clouds are utilized to guide the initialization of Gaussian Primitives, establishing a precise geometric foundation and effectively overcoming deformation on weakly textured surfaces. Second, an illumination-aware network is constructed to dynamically adjust appearance parameters by combining global illumination from images with LiDAR reflectance intensity. This decouples complex lighting from material properties, accurately reproducing the unique textures of artifacts. Finally, a multi-dimensional joint loss function incorporating photometric, scale, and appearance smoothness constraints is introduced to collaboratively optimize scene geometry, appearance, and camera poses. Experimental results on indoor and outdoor cultural heritage preservation scenarios demonstrate that the proposed method significantly outperforms various comparative algorithms in terms of both visual fidelity and geometric accuracy. The quantitative and qualitative evaluations confirm that our approach effectively eliminates geometric distortions and recovers fine texture details, providing an efficient and reliable technical solution for the digital preservation of cultural heritage.

5:00pm - 5:15pm

Usability and Potential of Historical Glass Plate Images for 3D Object Reconstruction and Comparison to current Monitoring Data

Heidi Hastedt¹, Ferdinand Maiwald², Silke Wiedmann³, Till Sieberth¹

¹Jade University of Applied Sciences, Institute for Applied Photogrammetry and Geoinformatics, Oldenburg, Germany; ²Chair of Optical 3D-Metrology, Dresden University of Technology, Germany; ³German Maritime Museum – Leibniz Institute for Maritime History, Bremerhaven, Germany

Cultural Heritage assets as the Bremen Cog at the German Maritime Museum are often subject to long-term preservation processes and being monitored over time. The Bremen Cog, a clinker-build vessel from 1380, was found in the River Weser in 1962 and thereafter salvaged and reconstructed until 1981. Prior to conservation efforts (1981 to 1999), a photogrammetric 3D measurement campaign was conducted using a stereometric camera SMK 120. Due to deformation a permanent support system was installed in 2003 including the application of local corrections using pressure plates to correct the hull to its measured one from 1980. Since 2020 a long-term geometric monitoring of the cog has been carried out in order to detect deformation. With the analyses of the monitoring data in connection with the measurement conditions, it is of high interest whether the cog in its current shape corresponds to the one estimated in 1980. Historic SMK 120 stereo image pairs on glass plates are analysed in order to estimate their usability and potential for 3D object reconstruction and subsequently comparing the results to the current monitoring data. The proposed workflow includes an optimized digitization process of the glass plate and reconstruction of the interior and exterior orientations. Feature detection and matching methods as well as robust orientation tasks are analysed in order to allow for a 3D hull reconstruction. The reconstruction at least in parts of the cog and with lower precision is desirable and promising in terms of evaluating changes of the hull over time.

5:15pm - 5:30pm

Full Object Photogrammetry for Architectural Artefacts using the “Mask Model Method”

Adam Weigert¹, Miquel Reina Ortiz², Chloe Dennis³, Lauren Daniels¹, Yesmine Bennani⁴, Mario Santana Quintero¹, Stephen Fai¹

¹Carleton Immersive Media Studio (CIMS), Carleton University, Ottawa, Canada; ²Université de Montréal, Montréal, Canada; ³Bytown Museum, Ottawa, Canada; ⁴University of Hong Kong, Pok Fu Lam, Hong Kong

Photogrammetry and laser scanning are widespread tools for documenting movable and immovable cultural heritage assets. Documenting the entire surface of an object presents a set of specific challenges, with various solutions currently available. Complete object documentation relies on established capture techniques that utilize the registration method for different model orientations. This paper presents the “Mask Model Method,” a semi-automatic approach for seamlessly documenting entire objects while seeking high-quality results. This workflow works well for most objects that would be considered viable for general photogrammetric capture. The advantages are also in capturing small and large objects (with and without a turntable) with hinge-type moving parts. This method of documenting full architectural artefacts is useful in heritage conservation, repairs, and restoration; specifically, digital patternmaking, virtual reconstruction, digital annotation of historic materials & geometry, and applied experimental archaeology.

3:30pm - 5:15pm

WG II/3D: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

3:30pm - 3:45pm

CARS: A Photogrammetric Pipeline for Global 3D Reconstruction using Satellite Imagery

David Youssefi¹, Valentine Bellet¹, Yoann Steux², Mathis Roux², Cédric Traizet², Marian Rassat², Tommy Calendini²

¹CNES, France; ²CS GROUP, France

We present CARS, a multiview stereo pipeline developed by CNES. This pipeline will be integrated into the CO3D mission processing chain, a mission whose goal is to generate a 3D model of the Earth in less than four years. Because this is an operational mission involving massive production, particular attention has been paid to ensuring that the software is robust, efficient and includes a set of advanced automatic processing features. The paper will provide a comprehensive overview of all the features developed since its creation to achieve this goal.

3:45pm - 4:00pm

SatGeo-NeRF: Geometrically Regularized NeRF for Satellite Imagery

Valentin Wagner¹, Sebastian Bullinger¹, Michael Arens¹, Rainer Stiefelhagen²

¹Fraunhofer Institute of Optronics, System Technologies and Image Exploitation (IOSB); ²Karlsruhe Institute of Technology (KIT)

We present SatGeo-NeRF, a geometrically regularized NeRF for satellite imagery that mitigates overfitting-induced geometric artifacts observed in current state-of-the-art models using three model-agnostic regularizers. Gravity-Aligned Planarity Regularization aligns depth-inferred, approximated surface normals with the gravity axis to promote local planarity, coupling adjacent rays via a corresponding surface approximation to facilitate cross-ray gradient flow. Granularity Regularization enforces a coarse-to-fine geometry-learning scheme, and Depth-Supervised Regularization stabilizes early training for improved geometric accuracy. On the DFC2019 satellite reconstruction benchmark, SatGeo-NeRF improves the Mean Altitude Error by 13.9% and 11.7% relative to state-of-the-art baselines such as EO-NeRF and EO-GS.

4:00pm - 4:15pm

HDR Radiance Learning and Shadow Regularization for Satellite NeRF 3D Reconstruction

Yongjun Song, Pablo d’Angelo

German Aerospace Center (DLR), Germany

High dynamic range (HDR) variations in satellite optical imagery arise from extreme differences in surface reflectance and illumination conditions. Conventional satellite NeRF frameworks are typically trained on tone-mapped or radiometrically enhanced images, where nonlinear preprocessing alters the physical relationship between measured pixel values and true scene radiance. This leads to biased photometric optimization and loss of geometric fidelity, especially under strong illumination contrasts. To address these limitations, we propose an HDR-consistent learning framework that integrates RawNeRF-style radiance supervision with shadow regularization. The method trains directly on raw satellite imagery using a logarithmic, tone mapping–aware loss that preserves linear radiance and stabilizes optimization under high dynamic range conditions. In parallel, a soft shadow regularization constrains network-predicted shadows using geometric cues derived from solar ray casting, promoting physically consistent irradiance decomposition. Experiments on four AOIs from the DFC2019 dataset demonstrate that HDR-aware radiance learning significantly improves DSM accuracy by maintaining linear radiometric consistency. The proposed shadow regularization also improves geometric consistency in structure-dominated urban scenes, although its effect is limited in vegetation-dominant areas where shadow cues are less informative. Although performance gains are smaller in vegetation-dominant areas, the results confirm that combining HDR radiance learning with geometric shadow regularization yields more radiometrically consistent and geometrically accurate 3D reconstruction from satellite imagery.

4:15pm - 4:30pm

EOGS++: Earth Observation Gaussian Splatting with Internal Camera Refinement and Direct Panchromatic Rendering

Pierrick Bournez¹, Luca Savant Aira², Thibaud Ehret³, Gabriele Facciolo¹

¹Universite Paris-Saclay, CNRS, ENS Paris-Saclay, Centre Borelli, 91190, Gif-sur-Yvette, France; ²Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Torino TO, Italia; ³AMIAD, Pôle Recherche, France

Recently, 3D Gaussian Splatting has been introduced as a compelling alternative to NeRF for Earth observation, offering competitive reconstruction quality with significantly reduced training times.

In this work, we extend the EOGS framework to propose \namemodel, a novel method tailored for satellite imagery that directly operates on raw high-resolution panchromatic data %and multispectral data

without requiring external preprocessing.

Furthermore, we embed bundle adjustment directly within the training process with optical flow techniques, avoiding reliance on external optimization tools while improving camera pose estimation.

We also introduce several improvements to the original implementation, including early stopping and TSDF post-processing, all contributing to sharper reconstructions and better geometric accuracy.

Experiments on the IARPA 2016 and DFC2019 datasets demonstrate that EOGS++ achieves state-of-the-art performance in terms of reconstruction quality and efficiency, outperforming the original EOGS method and other NeRF-based methods while maintaining the computational advantages of Gaussian Splatting. Our model demonstrates an improvement from 1.33 to 1.19 mean MAE errors on buildings compared to the original EOGS models.

4:30pm - 4:45pm

Evaluating multi-view geometry for satellite-based 3D city modeling: towards 1+N constellation configurations

Xu Cheng, Xianfeng Huang, Yingdong Pi, Xinsheng Wang, Mi Wang

State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430079, China

The emergence of satellite constellations enables near-synchronous multi-view optical imaging, offering new opportunities for large-scale 3D city modeling. Yet a practically promising configuration, in which a primary near-nadir view is complemented by multiple oblique side-looking viewpoints, remains under-examined. This study develops a controlled semi-simulation framework to analyze how multi-view imaging geometry affects the recoverability of urban 3D structures. Under idealized conditions with imaging perturbations removed, e.g., radiometric, illumination, and sensor model errors, the experiments focus on three practical factors: the number of side-looking views, view obliqueness, and the constellation’s azimuthal orientation relative to the scene. With parameter sweep analysis, it reveals an asymmetric U-shaped trend between reconstruction performance and both the view count and the obliqueness: moderate angular diversity markedly strengthens urban scene recoverability. In contrast, large obliqueness reduces inter-view overlap and destabilizes matching, while excessive redundancy introduces consistency issues that ultimately degrade reconstruction performance. Furthermore, the results shows that geometric accuracy, completeness, and texture appearance each peak at different parameter combinations, revealing intrinsic trade-offs in multi-view urban reconstruction, as different evaluation criteria favor distinct optimal configurations. The study provides practical guidance for the geometric design and mission planning of multi-satellite constellations aimed at improving satellite-based 3D modeling in urban areas.

4:45pm - 5:00pm

Illumination-prior-based high-resolution DEM reconstruction from single-view lunar image constrained with initial DEM

Siyi Qiu¹, Zhen Ye^1,2, Rong Huang^1,2, Yusheng Xu^1,2, Xiaohua Tong^1,2

¹College of Surveying and Geoinformatics, Tongji University, Shanghai, China; ²The Shanghai Key Laboratory of Space Mapping and Remote Sensing for Planetary Exploration, Shanghai, China

This work presents an illumination-prior-based reconstruction model for high-resolution DEM generation from single-view lunar imagery, developed for the extreme illumination conditions and rugged terrain of the lunar south pole. The model integrates an initial DEM prior with multi-scale monocular image features and incorporates illumination priors derived from solar geometry to enhance stability in shadowed, low-texture, and terrain-transition regions. Through cross-modal feature fusion, it effectively aligns geometric structure with shading and photometric cues, enabling accurate recovery of fine-scale topography even when visual information is severely degraded. Experimental evaluations across multiple south-polar regions show that the proposed reconstruction model outperforms existing deep learning approaches and the classical Shape-from-Shading method in elevation, slope, and aspect accuracy, with independent validation using LOLA laser altimetry points confirming its improved geometric reliability. Visual comparisons demonstrate clear advantages in reconstructing crater rims, steep slopes, and permanently shadowed areas where conventional methods often fail or produce blurred terrain structures. The model also maintains robust performance under varying solar azimuths, highlighting the effectiveness of incorporating illumination priors to improve generalization in challenging environments. Overall, the proposed reconstruction model provides a reliable and effective solution for detailed lunar terrain recovery from monocular images and offers valuable support for scientific investigation, resource assessment, landing-site evaluation, and mission planning in the lunar south polar region.

5:00pm - 5:15pm

Construction of Control Network for Multi-temporal LRO NAC Images Based on Matching of Lunar Impact Craters

Pengying Liu^1,2, Jiayao Wang^1,2, Xun Geng^1,2, Zhen Peng^1,2, Jin Wang^1,2, Haoyu Zhang^1,2

¹State Key Laboratory of Spatial Datum, Faculty of Geographical Science and Engineering, Henan University, Zhengzhou, China; ²College of Geographic Sciences, Henan University, Zhengzhou, China

To address the critical demand for high-precision mapping of the Lunar South Pole (LSP)—a region pivotal for deep space resource utilization yet plagued by extreme illumination variations, extensive permanent shadow regions (PSRs), and weak texture—this study proposes a control network construction method for multi-temporal Lunar Reconnaissance Orbiter (LRO) Narrow Angle Camera (NAC) images, anchored in lunar impact crater matching. Leveraging the morphological stability and spatial consistency of impact craters, we first created a dedicated dataset: 94 multi-temporal LSP orthophotos (1 meter/pixel resolution) with manual annotations, allocating 70% for YOLOv8 model training and 30% for validation to ensure accurate crater detection (extracting center coordinates and semi-major/semi-minor axes). For virtual feature point matching, we integrated crater geometric attributes (coordinates, aspect ratio) and inter-crater topological relationships (distance, azimuth angle) to build local descriptors, enhanced by KD-tree indexing for efficient neighborhood queries, multi-attribute similarity measurement, and bidirectional voting to eliminate mismatches. For large craters, normalized cross-correlation (NCC) was used for secondary matching to refine accuracy. Post-matching, tie points were back-projected from orthophoto to original image space via ground coordinates. Experiments on 1,208 LRO NAC images showed the method outperforms SIFT and SuperPoint: it generated 938,029 tie points (even in dark shadows) with 2,347,629 measurements, and bundle adjustment achieved a sigma naught of 0.68. This work enables automatic high-quality control network construction, supporting reliable LSP topographic mapping for deep space exploration.

3:30pm - 5:15pm

ApS: Applied Session
Location: 716A

3:30pm - 3:45pm

A Multi-Stage Framework for Remote Sensing-Based Detection of Mining Disturbances Across British Columbia to Inform Salmon Habitat Conservation

Chen Shang¹, Olivier Tsui¹, Marc Porter², François-Nicolas Robinne³

¹Hatfield Consultants, 200-850 Harbourside Drive, North Vancouver, BC, V7P 0A3, Canada; ²Salmon Watersheds Program, Pacific Salmon Foundation, 300-1682 West 7th Avenue, Vancouver, BC, V6J 4S6, Canada; ³Forest Operations Branch, Alberta Forestry and Parks, J.G. O’ Donoghue Building, 7000-113 Street, Edmonton, AB, T6H 5T6, Canada

Mining activities constitute a major source of land disturbance in British Columbia and pose long-lasting risks to salmon-bearing watersheds through sedimentation, habitat fragmentation, and water quality degradation. However, existing mining inventories often lack spatial precision and consistency, limiting their usefulness for cumulative effects assessment. This study presents a new multi-stage remote sensing framework designed to systematically detect and map mining disturbances across the province using Landsat time series (1984–2023), Sentinel-2 imagery, and provincial mining databases.

The workflow integrates spectral–temporal change detection (LandTrendr), land cover and disturbance history from the Satellite-Based Forest Inventory, Sentinel-2 spectral clustering, and final visual interpretation using very high-resolution imagery. This approach effectively distinguishes mining disturbances from wildfires, harvesting, and other land surface changes common in BC’s diverse landscapes.

Applied province-wide, the framework identified 1,037 mining sites with a 92% thematic accuracy, producing the most spatially explicit and consistent inventory of mining disturbances currently available for British Columbia. Results highlight persistent mining hotspots and reveal that mineral mines—especially coal, gold, and silver—dominate the cumulative disturbance footprint, with peak activity occurring between 1970 and 1990.

The resulting dataset provides a critical foundation for evaluating the cumulative impacts of mining on salmon habitats and supports ongoing efforts toward transparent, data-driven land-use planning. The framework is scalable, updateable, and transferable to other regions where large-area monitoring of mining activity is needed.

3:45pm - 4:00pm

Compact Polarimetry Data for Estimation of Relative Oil Thickness

Gordon Staples, Ji Chen

MDA Space, Canada

The objective of this study was to investigate the application of RADARSAT Constellation Mission (RCM) CP data for the estimation of relative oil thickness. On July 25, 2020, the bulk carrier MV Wakashio ran aground off the coast of Mauritius with 1000 tonnes of oil was estimated to have spilled into the Indian Ocean. RCM CP data were acquired on August 9, 12, and 13, 2020. CP data entails the acquisition of two phase-preserving channels, CH and CV. A 5x5 polarimetric filter was applied and CP discriminators, Degree of Linear Polarization (DLP), Degree of Polarization (DOP), and Entropy (H), were extracted. For the three images, the DLP, DOP, and H were calculated for “thick” and “thin” oil, and oil-free regions. The performance of the DLP, DOP, and H was consistent with the expected results for both thin and thick oil and oil-free regions. The correlation between the thick, thin, and oil-free regions was calculated based on an Area-based Classification-by-Histogram (ACH). The results for H (August 13) show a strong negative correlation between thick oil/oil free, a small positive correlation between thin oil/oil-free, and a negative correlation between thick/thin oil. The results of the CP discriminators were consistent with theoretical expectations, with H providing the best overall performance. The results of the CP discriminators were consistent with theoretical expectations, with H providing the best overall performance. The results suggest that CP data is a viable option for the estimation of relative oil thickness.

4:00pm - 4:15pm

Automatic detection of eelgrass (Zostera marina) from multispectral satellite data along Canada’s Pacific coast to support conservation and restoration efforts

Anne Webster¹, Weigang Tang¹, Maycira Costa², Nic Dedeluk³, Olivier Tsui¹

¹Hatfield Consultants LLP, 200-850 Harbourside Dr, North Vancouver, Canada V7P 0A3; ²Spectral Lab, Geography, University of Victoria, Victoria, Canada; ³‘Namgis First Nation, 49 Atli St, Alert Bay, Canada

Eelgrass (Zostera marina) is the primary native seagrass species in intertidal areas across North America and plays an important role in marine ecosystems. Current eelgrass mapping is primarily limited to localized areas using various field and remotely piloted aerial systems (RPAS) methods, resulting in limited coverage and update frequency. To support more frequent, wide area monitoring of eelgrass along Canada’s Pacific coast, we are developing Eelgrass Explorer (E2), an automated system to provide eelgrass distribution maps across British Columbia’s (BC) intertidal zones from either Sentinel-2 or Planet SuperDove multispectral data. The deep learning approach central to the system is based on a DenseNet architecture developed for seagrass detection elsewhere in the world, modified for BC conditions. Our proof of concept used training data across 6 sites along the BC coast and obtained 95% accuracy for test points within training sites, a 12% percent improvement over a Random Forest approach using the same data. Future work will include more rigorous validation in new sites, refining the model for even better generalization, and incorporating it into an automated processing pipeline. The resulting 10-meter eelgrass extent maps across BC’s intertidal zone will be made openly available to the research community.

4:15pm - 4:30pm

Autonomous Driving in a GNSS-Denied Environment using Real-Time Sensor Fusion

Elena Liang, Xue-Fen Zhang, Jason Kun, Benjamin Brunson, Vi Huynh, Mohamed Mostafa

Trimble Applanix, Canada

Ensuring robust and precise navigation in GNSS-denied or degraded environments remains a core challenge for autonomous systems. The demand for precise, real-time positioning is critical across various applications, including fleet management, automotive, rail, pavement, and airport safety, particularly within GNSS-limited operational settings. This paper presents a novel approach to integrating Visual Odometry (VO) and Map-Based Localization (MBL) as external aiding sources for inertially-aided navigation. This integrated solution is specifically designed for land mobile mapping applications and leverages a high-precision inertially-aided GNSS solution inherent to the mobile mapping system.

This paper is structured as follows:

• Overview of VO and MBL Techniques: A detailed review of the theoretical principles underpinning the Visual Odometry (VO) and Map-Based Localization (MBL) techniques.

• Real-Time Deployment Strategies: Examination of the specific strategies required for real-time operational deployment, including handling delayed measurements, managing out-of-sequence updates, and implementing dynamic uncertainty adaptation.

• Kalman Filter Framework Design: Development of the Kalman filter framework to accommodate the delta pose data (derived from VO) and absolute pose data (derived from MBL) as distinct aiding sources. This includes modelling specific measurement errors and introducing dedicated state components.

• Theoretical and Practical Accuracy Analysis: Evaluation of the integrated system's effectiveness through a rigorous theoretical and practical accuracy analysis under a wide range of operational conditions, including the quantification of positioning performance enhancement when utilizing low-cost IMUs.

4:30pm - 4:45pm

Integrated Multi-Sensor Data Fusion from Land, Air, and Marine Platforms for Enhanced Geospatial Mapping

Michael Koterba¹, Mohamed Mostafa²

¹MJ Engineering, Architecture, Landscape Architecture, and Land Surveying, P.C, 21 Corporate Drive, Clifton Park, NY, USA 12065; ²Trimble Applanix, 85 Leek Cr., Richmond Hill, Ontario, Canada L4B 3B3

Over the last three decades, advancements in sensor and positioning technology have fundamentally transformed geospatial data acquisition, processing, and quality control, enabling surveyors and professionals to collect, interact with, and produce mapping products with unprecedented accuracy and resolution. Sensor Fusion concepts started at the academic level in the early 1990s (c.f., Schwarz et al., 1993; El-Sheimy, 1996; Mostafa and Schwarz, 1997; Ip et al., 2007; Ravi et al., 2018).

The fusion of LiDAR and photogrammetric sensors paired with GNSS, and inertial positioning systems has effectively supplanted many traditional mapping methods that relied heavily on high-accuracy positioning combined with significant data interpolation (c.f., Scherzinger et al., 2018)

Today, geospatial data acquisition is increasingly performed simultaneously using land mobile mapping systems, UAVs, and marine vessels all equipped with multiple LiDARs and diverse imaging sensors (e.g., panoramic, RGB, NIR, thermal, etc.), rapidly becoming the industry standard. These multi-stream datasets are now typically integrated and optimized within a post-processing environment. This paper will highlight the technology and workflows surrounding these synergistic systems, demonstrating how their fusion is yielding an unprecedented level of speed and quality hitherto unseen in the industry.

4:45pm - 5:00pm

From Satellites to Grain Elevators: using NDVI-based Indices to reduce Price Discovery Gaps in non-Futures Prairie Crop Markets

Samuel Scott

Independent, Canada

This contribution examines whether satellite derived crop condition signals can be translated into a practical market indicator for Prairie crops that do not trade on futures exchanges. In Canada, remote sensing programs such as the Crop Condition Assessment Program already provide in season crop monitoring and support official yield and production estimation. This study builds on that foundation, but asks a different question: how crop condition information is incorporated into prices in decentralized cash markets for non futures crops such as peas, lentils, and mustard.

Using Canada’s operational AVHRR and MODIS NDVI archives, the study outlines a simple method for aggregating weekly NDVI composites to key producing regions, deriving seasonal anomalies and phenological measures, and combining them into a normalized regional index for each week of the growing season. The purpose of this index is not to replace official crop condition or yield models, but to provide a transparent and interpretable signal that can be examined alongside observed cash market pricing behavior.

The empirical focus is on market linkage rather than agronomic prediction alone. Specifically, the study compares the relationship between the NDVI based index and weekly changes in benchmark futures prices with its relationship to posted bids for selected non futures crops. The working hypothesis is that crop condition information is incorporated relatively quickly into futures linked markets, while non futures cash bids respond more slowly and less directly. If confirmed, the index could serve as a public benchmark for price discovery in thin and fragmented specialty crop markets.

5:00pm - 5:15pm

Simultaneous LiDAR & Trajectory Data Optimization for Mobile Mapping Systems in GNSS-Denied Environments

Mohamed Mostafa, Vi Huynh

Trimble Applanix, Canada

Accurate mobile mapping, a critical requirement for various applications, is frequently compromised in GNSS-denied environments, resulting in degraded final mapping products. This research investigates the efficacy of simultaneous optimization of mobile mapping system data, specifically encompassing the trajectory, system calibration, and LiDAR point cloud. The study explores the integration of inertially-aided GNSS data with LiDAR data to mitigate trajectory and point cloud errors and refine installation parameter calibration during GNSS outages. Utilizing datasets acquired with a Mobile Mapping System in a suburban setting in Richmond Hill, Ontario, Canada, the performance of this integrated approach was rigorously evaluated. The results demonstrate the capability of Simultaneous LiDAR & Trajectory Data Optimization to effectively and concurrently compensate for diverse error sources using LiDAR data, GNSS/Inertial measurements, and calibration parameters. This highlights the significant potential for achieving enhanced data accuracy in challenging land mobile mapping scenarios where GNSS availability is limited.

3:30pm - 5:15pm

Forum2C: The Future of Space- based Earth Observation
Location: 716B

3:30pm - 5:15pm

Forum7B: Entrepreneurship in the Industry 4.0 Geospatial Landscape
Location: 717A

3:30pm - 5:30pm

InS4: Industry Tech Session
Location: 717B

3:30pm - 5:30pm

P2: Poster Session 2
Location: Exhibition Hall "E"

Refractive Effects of Planar Protective Layers in Stereo Photogrammetry and Their Correction

Zhaoquan Liu¹, Binbin Xu², Wenxing Xu³, Shigang Liu⁴, Yongfeng Ma⁵, Guanqing Li⁶

¹CCCC First Harbor Engineering Company Ltd., 300461 Tianjin, China – liuzhaoquan@ccccltd.cn; ²No.3 Engineering Company Ltd. of CCCC First Harbor Engineering Company, 116011 Dalian, China; CCCC First Harbor Engineering Company Ltd., 300461 Tianjin, China; Key Laboratory of Geotechnical Engineering, CCCC, 300461 Tianjin, China; Key Laboratory of Port Geotechnical Engineering, Ministry of Transport, PRC, 300461 Tianjin, China; Key Laboratory of Port Geotechnical Engineering of Tianjin, Tianjin 300461, China – 2016046927@ccccltd.cn; ³No.3 Engineering Company Ltd. of CCCC First Harbor Engineering Company, 116011 Dalian, China – xuwenxing1@ccccltd.cn; ⁴No.3 Engineering Company Ltd. of CCCC First Harbor Engineering Company, 116011 Dalian, China – liushigang1@ccccltd.cn; ⁵No.3 Engineering Company Ltd. of CCCC First Harbor Engineering Company, 116011 Dalian, China – mayongfeng1@ccccltd.cn; ⁶School of Environment and Spatial Informatics, China University of Mining and Technology, 221116 Xuzhou, China – guanqing.li@cumt.edu.cn

This study addresses the impact of planar protective layers on stereo photogrammetry and introduces a rigorous refractive correction model based on multi-interface ray tracing. Conventional stereo reconstruction assumes a single viewpoint, but planar layers introduce refraction at two interfaces, causing systematic depth-dominated errors. Through simulations and field experiments using an Intel RealSense D455, the study evaluates the influence of target distance, layer thickness, orientation, and layer-to-camera spacing. Simulations with multiple target planes show that conventional stereo produces significant errors—up to several millimeters in depth—even for thin layers, while the refractive model consistently reconstructs points with sub-millimeter accuracy. Layer distance from the camera has negligible effect on the error magnitude, whereas tilts and thicknesses of the layer strongly influence the bias. Field experiments with a 10-mm acrylic plate confirm these findings: conventional reconstruction exhibits systematic lateral and depth errors, whereas the refractive model eliminates bias, achieving near-zero mean errors. The results highlight that even minimal protective layers induce measurable errors if refraction is ignored, emphasizing the necessity of refractive correction in high-precision applications. The study demonstrates that explicitly modeling refraction in stereo photogrammetry significantly improves reconstruction accuracy and robustness. Overall, this work provides a practical framework for accurate 3D measurement in hazardous environments where imaging through protective layers is unavoidable.

Augmenting City Models with Handheld LiDAR and 3D Gaussian Splatting for Inclusive Pedestrian Infrastructure Assessment

Deni Suwardhi^1,2, Wahyunan Andika², Ratri Widyastuti¹, Widiatmoko Azis Fadilah³, Arnadi Murtiyoso³, Pierre Grusennmeyer³, Fabio Remondino⁴, Farhan Helmy⁵

¹Spatial System and Cadastral Research Group, Institut Teknologi Bandung (ITB), Indonesia; ²PT Inovasi Mandiri Pratama, Spatial Information Company, Indonesia; ³Université de Strasbourg, CNRS, INSA Strasbourg, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, 67000, Strasbourg, France; ⁴3D Optical Metrology (3DOM) Unit, Bruno Kessler Foundation (FBK), Trento, Italy; ⁵Advanced System Computing, Design and Innovation (ASCODI) Laboratory, Indonesia

Urban digital twins increasingly require pedestrian-scale three-dimensional (3D) representations to support accessibility and inclusiveness assessment. However, existing approaches typically emphasize either geometric accuracy or visual realism, while lacking an integrated framework for analysing pedestrian-level conditions. This study proposes a hybrid workflow integrating handheld LiDAR and 3D Gaussian Splatting (3DGS) within a CityGML-based semantic framework for accessibility assessment. Handheld LiDAR provides centimetre-level geometric measurements, enabling the extraction of key indicators such as slope, surface roughness, and obstacle presence. In parallel, 3DGS reconstruction from 360° video imagery enhance visual realism and perceptual understanding. Both datasets are co-registered and structured within the CityGML 3.0 Transportation model to represent pedestrian environments in a unified spatial and semantic framework. Accessibility assessment was conducted using three approaches: LiDAR-based analysis, field survey observations, and immersive evaluation in a Virtual Reality (VR) environment. The LiDAR-based results were used as a reference. Comparative analysis shows the field survey assessment achieves an agreement of approximately 85.7%, while VR-based assessment reaches approximately 75.4%. The results indicate that while VR does not replace metric-based analysis, it enables perception-driven and participatory evaluation. In particular, VR-based assessment shows potential to involve users, including people with disabilities, in accessibility evaluation through immersive and remote interaction. The proposed approach contributes to the development of human-scale urban digital twins by integrating metric accuracy, semantic structure, and participatory evaluation for more inclusive accessibility analysis

AI-driven extraction of road geometry and asset inventory from mobile LiDAR point clouds

Divya Priya Balasubramani, Zaffar Sadiq Mohamed-Ghouse, Sanjay Khanna D, Ravichandran N, Muthu Kumara Samy S

Institute of Remote Sensing, Department of Civil Engineering, College of Engineering Guindy, Anna University Chennai, India

Rapid urbanization and rising traffic demand are placing significant pressure on transportation infrastructure, necessitating more efficient and accurate approaches to road design auditing and asset management. Traditional survey methods are labor-intensive, time-consuming, and lack comprehensive three-dimensional context. This study presents an end-to-end framework integrating Mobile Light Detection and Ranging (LiDAR) with Artificial Intelligence (AI) for automated extraction of road geometric parameters and asset inventory. Mobile LiDAR data were collected along an urban corridor in Bengaluru, India, and preprocessed using Trimble Business Center. Preprocessing involved statistical outlier removal and progressive morphological ground segmentation. A deep learning model based on the PointNet++ architecture with hierarchical set abstraction layers was developed to classify point cloud data into five categories: road, pole, vehicle, tree, and building. The dataset comprised approximately 45 million points, with 10% manually annotated for training. The trained model enabled large-scale semantic segmentation, achieving a mean Intersection-over-Union (mIoU) of 0.86 and an overall accuracy of 92.4%. Using the classified outputs, key road design parameters—including lane width (8.099 m), road segment length (44.383 m), zebra crossing width (7.336 m), and pole height (7.890 m)—were accurately derived. The proposed workflow reduced manual processing time by approximately 85% (from 40 hours to 6 hours per km) while enhancing measurement consistency and scalability. The results highlight the effectiveness of integrating mobile LiDAR and AI for high-accuracy, data-driven infrastructure assessment, offering a scalable solution for improved planning and management of urban transportation systems.

Rigorous Projection for Image Stitching: a 3D-Informed Approach for Accurate Panoramic Photogrammetry

Riccardo Roncella¹, Luca Perfetti²

¹University of Parma, Department of Engineering and Architecture, 43124, Parma, Italy; ²University of Brescia, Department of Civil Engineering, Architecture, Territory, Environment and Mathematics, 25123, Brescia, Italy

Panoramic image stitching traditionally relies on the assumption that all input images share a single projection centre, a condition rarely satisfied by modern multi-camera rigs composed of multiple fisheye sensors mounted with non-negligible baselines. In confined or close-range environments, these geometric discrepancies introduce significant parallax, limiting the reliability of both classical and “parallax-tolerant’’ stitching techniques based on local warping. Although such methods are simple and efficient, they cannot account for the true camera geometry and therefore degrade the metric quality of the final panorama. At the same time, recent photogrammetric software has begun to accept panoramic imagery directly, yet literature demonstrates that optimal accuracy is still obtained when processing raw multi-camera.

This work presents a new 3D-informed approach for generating panoramic images that fully respects the underlying geometry of the acquisition system. Assuming the availability of a 3D model, derived either from photogrammetric reconstruction or from an external sensor such as LiDAR, the method reprojects each pixel of the desired panorama onto the original multi-camera frames using collinearity equations, mirroring the workflow of precision orthophoto generation. This allows the production of parallax-free panoramas with consistent geometric fidelity even in challenging scenarios.

The method is evaluated on several case studies using both compact panoramic cameras and multi-camera systems with larger baselines. Results demonstrate improvements in stitching accuracy, SfM orientation quality, and final 3D reconstruction, including robustness to varying scene complexity and supporting 3D-model resolution.

Extrusion Segmentation Strategy to improve CAD Reconstruction from Point Cloud

Said Harb, Mehdi Maboudi, Markus Gerke

Technische Universität Braunschweig; Institute of Geodesy and Photogrammetry, Germany

Recovering editable CAD models from point cloud scans is a key challenge in reverse engineering and quality control, where the ability to reconstruct the original modeling history of a physical object enables precise deviation analysis and systematic process optimization. While deep learning has driven significant progress in this area, existing models struggle to generalize to complex CAD models, which feature multiple extrusions and intricate geometric structures.

This paper presents an end-to-end deep learning pipeline that reconstructs CAD models from point clouds as structured CAD sequences, which are series of sketch-and-extrude operations that encode the full modeling history. The model demonstrates high-fidelity reconstruction for non-complex objects, including primitive shapes such as cubes and cylinders, as well as their assemblies.

To address the performance gap on complex shapes, we introduce an extrusion-based segmentation strategy that decomposes CAD models into their constituent extrusions. These partial shapes are incorporated into the training set, increasing data diversity without requiring new data collection. The resulting primitive models feature partially occluded point clouds, surfaces hidden in the original assembly are absent, which forces the model to infer missing regions and learn richer point cloud representations. This increases the complexity of the reconstruction problem and thereby improves generalization.

The strategy is model-agnostic and can be applied to any deep learning approach that reconstructs CAD sequences, making it a broadly applicable tool for the community.

Controlled Multi-source Mapping of Lunar South Polar Regions via Combined Bundle Adjustment

Qionghua You¹, Zhen Ye^1,2, Yusheng Xu^1,2, Rong Huang^1,2, Huan Xie^1,2, Xiaohua Tong^1,2

¹College of Surveying and Geoinformatics, Tongji University, Shanghai, China; ²The Shanghai Key Laboratory of Space Mapping and Remote Sensing for Planetary Exploration, Shanghai, China

Integration of LROC NAC and ShadowCam imagery is essential for meter-scale controlled mapping of the entire lunar south pole including Permanently Shadowed Regions (PSRs), but remains challenging due to extreme radiometric differences, sparse overlap across illumination boundaries, and ill-conditioned bundle adjustment networks. This paper proposes a LOLA DEM-mediated multi-source bundle adjustment framework for controlled lunar polar mapping. A hierarchical cross-modality matching strategy is developed using first- and second-order Gaussian steerable gradient features with multi-scale fusion and phase-correlation-based subpixel refinement. Sensor-specific geometric models are established using second-order polynomial transformations for NAC orthoimages and rational polynomial models for ShadowCam map-projected images. Five types of geometric constraints are formulated to integrate intra-sensor, limited cross-sensor, and image-to-DEM observations, with the LOLA DEM acting as a common geometric mediator. To stabilize the heterogeneous network, a hybrid L1-L2 regularization model with adaptive two-stage weighting is optimized using ADMM algorithm. Experiments in the lunar south polar region demonstrate substantial improvements on intra-sensor, cross-sensor, and image-to-reference positioning accuracy. The final seamless 1 m/pixel orthorectified mosaics achieve approximately 5 m absolute accuracy, validating the proposed framework for geometrically unifying illuminated and permanently shadowed terrain in lunar polar controlled mapping.

Automated and Comprehensive Quality Assessment of Nationwide Aerial LiDAR Data: Insights from the LiDAR-ITA Project

Vittorio Casella, Marica Franzini, Davide Lodigiani

University of Pavia, Italy

National LiDAR programs are increasingly adopted worldwide to support land management, infrastructure planning, and environmental monitoring. Following the examples of large-scale initiatives in the United States and Europe, Italy launched its first nationwide LiDAR survey in July 2025 within the Integrated Monitoring System (SIM) project funded by the National Recovery and Resilience Plan (PNRR). This effort represents the most extensive airborne LiDAR campaign ever conducted in the country, covering over 302,000 km², including coastal zones and major islands. The acquisition plan is designed to ensure a minimum point density of 10 points/m² and produce high-resolution DTMs and DSMs at a 0.25 m grid spacing.

Given the unprecedented spatial and data volume, a robust, standardised, and fully automated quality assurance framework is essential. This paper presents the methodology used to evaluate geometric consistency and spatial accuracy across the national dataset. Congruence between overlapping flight strips is assessed by automatically extracting 100 × 100 m patches at regular intervals and computing point-to-point distances and cross-section profiles to detect horizontal and vertical discrepancies. Plano-altimetric accuracy is further evaluated through comparisons with terrestrial laser scanning (TLS) data collected in dedicated control areas, where robust plane fitting enables rigorous three-dimensional error estimation.

Results from two control areas acquired with different sensors demonstrate the effectiveness, scalability, and reproducibility of the proposed automated workflows. The presented approach provides a reliable foundation for delivering high-precision national LiDAR products and offers a framework applicable to future large-scale geospatial acquisition programs.

Synergy of photogrammetric and ULS data for forestry application through the fusion of bundle adjustment and ICP algorithms

Łukasz Wilk¹, Magdalena Pilarska-Mazurek¹, Wojciech Ostrowski^1,2

¹Warsaw University of Technology, Faculty of Geodesy and Cartography, Department of Photogrammetry, Remote Sensing and Spatial Information Systems, Warsaw, Poland; ²Jagiellonian University, Institute of Archaeology, Krakow, Poland

The study explores a workflow for integrating photogrammetric image blocks with LiDAR point clouds acquired via Unmanned Laser Scanning (ULS) in forestry applications. Hybrid datasets combining UAV imagery and LiDAR data are increasingly used for 3D mapping, yet discrepancies often arise due to independent orientation processes and systematic errors. Traditional solutions rely on numerous ground control points (GCPs), which can be impractical in dense forest environments. To address this, the proposed method fuses Bundle Adjustment and Iterative Closest Point (ICP) algorithms in a joint optimization process, aligning multispectral images with ULS point clouds without additional observations or GCPs. The workflow includes a GPU-accelerated filtering step to extract representative canopy points, reducing computational load and improving correspondence selection. Implemented using Python and C++ extensions, the system leverages the Ceres Solver for non-linear optimization, minimizing reprojection, GNSS, IMU, and point-to-cloud errors iteratively. Tests conducted in Żednia Forest District, Poland, during leaf-on and leaf-off seasons demonstrated significant improvements in alignment accuracy: average horizontal errors decreased by over 50%, and maximum offsets were reduced by more than 1 meter. These results confirm that the proposed hybrid adjustment substantially enhances geometric consistency between photogrammetric and LiDAR datasets, offering a cost-effective solution for forestry mapping and monitoring.

Integrating High‑Fidelity 3D Documentation into Immersive Learning: A VR Serious Game for the Holy Aedicule

Margarita Skamantzari¹, Ioannis Georgoulas¹, Ioannis Rallis¹, Antonia Moropoulou², Anastasios Doulamis¹, Andreas Georgopoulos¹

¹Lab of Photogrammetry, School of Rural, Surveying & Geoinformatics Engineering, National Technical University of Athens– Athens, Greece; ²School of Chemical Engineering, National Technical University of Athens– Athens, Greece

This paper introduces an innovative Virtual Reality (VR) serious game designed to enhance immersive learning in cultural heritage education. The game offers an interactive exploration of the Holy Aedicule in Jerusalem, one of the most sacred monuments of Christianity, based on high-resolution 3D documentation captured before, during, and after its rehabilitation.

By integrating photogrammetric data, textured 3D models, and historical research, the application allows users to navigate the monument virtually, engage with embedded educational content, and participate in interactive learning scenarios. Structured as a multi-phase experience, including virtual tours, a digital classroom, and a quiz mode, the serious game aims to promote transdisciplinary knowledge transfer in a user-friendly, entertaining format.

This contribution outlines the game’s methodological framework, educational objectives, development pipeline, and user evaluation results, highlighting its role in redefining how cultural heritage can be communicated through immersive digital tools. Additionally, it addresses the broader challenge of translating complex heritage documentation into accessible and meaningful experiences for learners, researchers, and the wider audience.

GNSS–Camera Systems for Heritage Documentation. Accuracy assessment of measurements of inaccessible points and preliminary tests in photogrammetric applications.

Lorenzo Teppati Losè, Filiberto Chiabrando, Fabio Giulio Tonolo

LabG4CH, Department of Architecture and Design (DAD) - Politecnico di Torino, Viale Mattioli 39, 10125 Torino (Italy)

The contribution investigates the possibility of using a GNSS receiver equipped with a camera for documenting built heritage. In particular, the possibility of measuring GCPs on vertical surfaces thanks to the combination of satellite observations and digital photogrammetric algorithms will be analysed and metrically validated. Moreover, the use of the acquired images in SfM approaches will be tested and discussed.

Generating Synthetic Image Data with Blender to Address Data Scarcity in Military Applications: Leveraging the RF-DETR Model

Julian Cornel Berndt, Tobias Frisenborg Christensen, Lars Würtz Jochumsen

Systematic A/S, Denmark

Military vehicle recognition faces critical data scarcity due to operational security constraints and prohibitive collection costs.

Classification of vehicles demands extensive training data rarely available in defence contexts. We propose a hybrid approach

combining limited real-world data with scalable synthetic generation. Our methodology comprises: (1) a Blender-based pipeline

generating high-resolution synthetic images with domain randomization across 3D models, lighting, and camera angles; (2) training

transformer-based RF-DETR detectors on real-world and synthetic data, respectively; (3) an in-depth evaluation of the trained

networks to determine the effect of synthetic data. Our approach utilizes a baseline RF-DETR detector trained on real-world

imagery to compare against. Then we utilize the custom-made synthetic data generation pipeline to create an equally large synthetic

dataset. This generated data is added to real data subsets, thus creating a mixed datasets containing varying percentages of real data.

We created five datasets containing 5%, 10%, 25%, 50%, and 100%, respectively. With these new mixed datasets we train another

set of RF-DETR detectors. Afterwards we evaluate the influence of the synthetic data by comparing the detectors across computer

vision metrics.

GDC: Geometric diffusion consistency for weather-robust 3D point cloud segmentation

Jing Du¹, John Zelek¹, Michael A. Chapman², Jonathan Li³

¹Department of Systems Design Engineering, University of Waterloo,; ²Department of Civil Engineering, Toronto Metropolitan University; ³Department of Geography and Environmental Management, University of Waterloo

Semantic segmentation of outdoor 3D point clouds degrades significantly under adverse weather, as rain, fog, and snow corrupt the geometric structure of LiDAR returns through backscatter insertion, range-dependent attenuation, and volumetric scattering. Existing domain generalization methods constrain feature values directly, which becomes less effective when weather-induced perturbations alter the local neighborhood topology that underlies feature aggregation. This work proposes Geometric Diffusion Consistency (GDC), a training-time regularizer that enforces consistent feature propagation behavior across geometrically divergent views of the same point cloud. A dual-view augmentation pipeline generates training pairs through weak and strong perturbations, where the strong branch incorporates dual-mode atmospheric extinction modeling, semantic-aware geometric corruption, and weather-coordinated structural perturbation. A lightweight learnable diffusion operator, implemented via sparse convolutions with a gated residual connection, propagates encoder bottleneck features through local voxel neighborhoods. The consistency loss aligns diffused representations at corresponding points across views, preserving topological relationships essential for dense prediction while allowing feature values to adapt to altered geometry. On the SemanticKITTI to SemanticSTF domain generalization benchmark, GDC achieves 38.6% mIoU, exceeding the previous best method by 3.8%, with consistent improvements across dense fog, light fog, rain, and snow conditions.

Integrated workflow for 3D documentation and spatial analysis of Jewish sepulchral heritage – Project "Stone Witnesses Digital: Space, Form, Inscription".

Lea Puglisi, Michael Groh, Patrizia Hanika, Mona Hess

Digital Technologies in Heritage Conservation, Institute of Archaeology, Heritage Conservation Studies and Art History/ Centre for Heritage Conservation Studies and Technologies (KDWT), University of Bamberg

The project 'Stone Witnesses Digital' ensures the exemplary documentation of a selected number of German Jewish graveyards. This paper presents the results from the first years of the project’s geomatics work, including the development of an integrated multi-sensor workflow for 3D imaging—ranging from geographic-scale documentation of entire graveyards (1:200 scale) to detailed feature imaging of individual gravestones (1:20 scale). The workflow supports the long-term research project on Jewish sepulchral culture "Stone Witnesses Digital".The project brings together expertise from Jewish Studies, Digital Technologies in Heritage Conservation, and Historic Building Research.

The overarching scope is to document the location and context of gravestones, their materiality, decorative elements, inscriptions, and the meanings embedded within them—summarized under the guiding concept 'Space, Form, Inscription.' The aim of the project is to create a comprehensive digital dataset that documents inscriptions as well as the spatial and structural characteristics of gravestones, thereby ensuring their long-term preservation and making them accessible for further academic research.

To achieve this, the work-flow must integrate various sensing and 3D imaging techniques, ensure reliable and sustainable data storage, and support reproducible dataset creation for spatio-temporal analyses and long-term monitoring of grave-yards throughout the 24-year project period. It also enables the combination of advanced sensing technologies with semantic web standards and facilitates the creation of informative Open Access outputs compliant with FAIR data principles.

3d Reconstruction of reindeer antlers using a low-cost optical camera system and gaussian splatting

Julian Robert Stevenson Cramb¹, Derek Lichti¹, John Matyas¹, Shabnam Jabari²

¹University of Calgary, Canada; ²University of New Brunswick, Canada

The research presented in this abstract is a novel, low-cost pipeline for the semi-automated 3D reconstruction of reindeer antlers using an optical camera array and Gaussian Splatting (GS). Traditional antler measurement methods are manual, invasive and prone to errors, while existing 3D scanning techniques struggle with subject motion. Photogrammetric bundle adjustment derived point clouds require well defined points which are generally lacking on antlers. To overcome this a system of 16 synchronized Raspberry Pi cameras was used to capture instantaneous imagery within an animal enclosure. A sparse point cloud along with the oriented network of imagery from a bundle adjustment is fed into a GS algorithm, producing an optimized reconstruction of the scene.

The system was initially validated in a controlled lab environment against a terrestrial laser scanner ground truth point cloud. A sub-centimeter accuracy with mean cloud-to-cloud distance of 4.0mm was achieved. Preliminary live-animal testing demonstrates the systems ability to produce a qualitatively accurate reconstruction under various lighting conditions. This method establishes a non-invasive method for high quality 3D reconstructions of complex reindeer antlers, which has applications in wildlife biology, environmental monitoring and biomechanics. Further work will involve rigorous network and camera calibration along with a comprehensive analysis of live-animal data.

A semi-automated pipeline for extracting architectural plans from 3D LiDAR data of ancient heritage sites

Marianna Bartrick-Krana, Roberto de Lima, Aziliz Vandesande, Maarten Bassier

KU Leuven, Belgium

Automatically generating architectural plans from archaeological sites poses a persistent challenge, particularly when dealing with ancient structures that have experienced severe deterioration. Many heritage contexts—especially those involving rock-cut monuments—present highly irregular geometries, collapsed features, eroded walls, and surfaces obscured by sediment or plaster detachment. These conditions make the extraction of reliable 2D plans or cross-sections from 3D data exceptionally difficult using conventional modeling tools.

In this study, we propose a semi-automated processing workflow tailored to the architectural characteristics of the Sheikh Said tombs. The pipeline converts 3D LiDAR datasets into structured 2D plans and vertical cross-sections, with particular emphasis on documenting deep, narrow shafts and multi-chambered tomb layouts.

Spherical Vision meets 3D Semantics: towards efficient LOD3 Model Generation for Smart Cities

Mohammad Saadatseresht^1,2, Hossein Arefi², Qazaleh Askari¹

¹School of Surveying and Geospatial Engineering, University of Tehran, Tehran, Iran; ²i3mainz - Institute for Spatial Information and Surveying Technology, Mainz University of Applied Sciences, Mainz, Germany

The generation of Level of Detail 3 (LoD3) building models is essential for applications such as urban digital twins, energy analysis, and smart city planning. However, conventional approaches based on terrestrial LiDAR or UAV photogrammetry remain costly, labor-intensive, and difficult to scale. This paper presents a scalable framework for transforming LoD1 building models into LoD3 façade representations using openly available urban data, including OpenStreetMap footprints, street-level spherical imagery, and weak point-cloud priors. The proposed method formulates the reconstruction problem as a facet-based modeling task, where each façade is processed independently in a local coordinate system derived from LoD1 geometry. A rectification strategy is introduced to generate fronto-parallel façade images directly from spherical panoramas, avoiding perspective distortions and facilitating image analysis. To address the challenges of unstructured data acquisition, a visibility-driven view selection scheme and a multi-view fusion framework are developed to construct robust façade evidence maps. The 3D geometry is estimated as a depth field through a multi-resolution optimization framework integrating ray consistency, appearance cues, point-cloud support, and structural regularization. Planar segmentation, polygonization, and geometric regularization are subsequently applied to derive structured façade elements. Openings such as windows and doors are detected using combined geometric and image-based evidence and further refined through architectural constraints. Experimental results demonstrate that the proposed framework enables reliable reconstruction of façade geometry and structural details using only open and low-cost data sources, providing a practical pathway for large-scale LoD3 generation in real urban environments.

LiDAR Point Cloud Oversegmentation via SAM-based Knowledge Distillation

Dening Lu¹, Michael Chapman², Jonathan Li^1,3

¹Department of Systems Design Engineering, University of Waterloo; ²Department of Civil Engineering, Toronto Metropolitan University; ³Department of Geography and Environmental Management, University of Waterloo

Large-scale LiDAR point clouds provide rich geometric information, yet learning effective structural representations remains challenging due to the misalignment between semantic categories and geometric structures. To address this issue, we propose a SAM-guided framework for point cloud oversegmentation. We transfer grouping knowledge from 2D vision by constructing a large-scale oversegmentation dataset using the Segment Anything Model (SAM) on bird’s-eye-view projections.

Based on these grouping priors, a structure-aware point cloud encoder is learned via a distillation objective that enforces intra-region compactness and inter-region separation in the embedding space. The proposed approach does not rely on semantic supervision and directly learns generalizable structural representations.

Experiments on various benchmark datasets (STPLS3D, Toronto-3D, DALES, and S3DIS) demonstrate that the proposed method achieves competitive performance.

In particular, it significantly improves boundary recall (e.g., 92.21% on STPLS3D and 93.47% on Toronto-3D) while maintaining high oracle accuracy (up to 97.62%).

Moreover, the model generalizes well to unseen datasets without retraining, showing strong cross-dataset inference capability.

Shape Representation using Gaussian Process mixture models

Panagiotis Sapoutzoglou, George Terzakis, Georgios Floros, Maria Pateraki

National Technical University of Athens, Greece

In this work we propose an object-specific implicit representation: Functional modeling of surface geometry using Gaussian Processes (GPs). n contrast to neural models, our method leverages the ability of GPs to model continuous functions from irregularly sparse sampled data and apply this concept in the context of a probabilistic model that learns the shape of an object as the mixture of multiple directional distance fields anchored at reference points specially placed in the object’s skeletal outline. The resulting mixture model provides continuity, sparsity, and finer shape detail while avoiding the heavy training burden associated with deep implicit methods

A Deep Learning Model for Tree Species Classification Using Ground-Level RGB Imagery and Automated Annotations

Hristina Hristova, Clemens Blattert, Sunni K.P. Kushwaha, Janine Schweier

Swiss Federal Research Institute for Forest, Snow and Landscape Research WSL, Switzerland

Accurate tree species identification is essential for effective forest management, biodiversity monitoring, and resource estimation. While automated methods relying on aerial and canopy-level remote sensing have become prevalent, they often struggle in dense, multi-layered forest stands, where critical lower-stem and bark features are obscured. To address this limitation, we present a Deep Learning (DL) framework for tree species classification utilizing ground-level RGB imagery. Because manual annotation of terrestrial images in forest environments is labor-intensive and complicated by occlusions, we introduce a new "in-situ" forest image dataset alongside an automated labeling pipeline. This pipeline generates training annotations by projecting tree-species data derived from Mobile Laser Scanning (MLS) onto 2D images based on photogrammetric reconstruction. The proposed DL model leverages these automatically labeled images to effectively recognize tree species based on structural and bark characteristics. The model achieves overall F1-scores of 0.78 and 0.75 for object detection and instance segmentation, respectively. Ultimately, our approach complements existing methods for detecting tree positions and diameters, facilitating the creation of a holistic, cost-effective, and scalable forest inventory dataset.

Pattern recognition approaches for the detection of alteration and degradation phenomena in hyperspectral and UAV multispectral imagery: the case study of a historical masonry water bridge

Alessandra Spadaro, Francesca Matrone, Andrea Maria Lingua, Ramin Rashidi Alavijeh

Geomatics Lab, Department of Environment, Land and Infrastructure Engineering, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129 Turin, Italy

Historical masonry hydraulic infrastructures are affected by complex degradation processes, including vegetation growth, moisturerelated anomalies, and salt efflorescence, whose detection requires non-invasive, repeatable, and scalable diagnostic approaches. This study proposes a multi-scale workflow for the detection and classification of degradation phenomena affecting the Cavour Canal water bridge, a nineteenth-century masonry structure in northern Italy. The methodology integrates UAV-based multispectral orthophotos and close-range hyperspectral imagery within a common Object-Based Image Analysis (OBIA) framework. The multispectral workflow was designed for façade-scale screening, whereas the hyperspectral workflow was used to refine the interpretation of selected sectors through detailed spectral characterisation. Multiple supervised classifiers, including Support Vector Machine (SVM), k-Nearest Neighbours (kNN), Decision Tree (DT), Random Trees (RT), and Naïve Bayes (NB), were tested on both datasets. The results show that the multispectral workflow is effective for the identification of vegetation and broad water-related anomalies, with kNN providing the best overall performance, while the hyperspectral workflow improves the discrimination of subtle surface alterations, particularly efflorescence, with SVM yielding the most stable results across the tested configurations. Overall, the proposed methodology demonstrates the value of integrating multispectral and hyperspectral data within a hierarchical workflow for non-invasive degradation mapping of historical masonry infrastructures.

A Framework for Individual Tree Segmentation from Multi-Resolution LiDAR Data in Complex Tropical Forests

Hazem Hanafy¹, Sangyoon Park¹, Songlin Fei², Ayman Habib¹

¹Lyles School of Civil and Construction Engineering, Purdue University, West Lafayette, USA; ²Department of Forestry and Natural Resources, Purdue University, West Lafayette, USA

The increasing demand for accurate forest inventory in tropical ecosystems requires robust, scalable methods for individual tree segmentation. Tropical forests pose particular challenges due to dense understory, high species diversity, and complex multi-layered canopies, which often lead to tree under- and over-segmentation in LiDAR-based workflows. This study presents a general framework for individual tree segmentation from dense, multi-resolution LiDAR point clouds acquired by a Backpack LiDAR system over a 15-year-old palm stand in Belém, Brazil. After trajectory enhancement and mapping, an adaptive cloth simulation filter is used to derive a Digital Terrain Model and height-normalized points. Woody components are then isolated using Otsu-based intensity thresholding, eigenvalue-derived linearity, and statistical outlier removal. Trunk detection combines DBSCAN clustering on lower-stem points with a dual tree-localization strategy based on sum-of-elevation heat maps and RANSAC circle fitting. A segmentation quality-control module addresses over- and under-segmentation before reattaching canopy and foliage via voxel-based KD-tree retrieval to generate final per-tree segments. Compared with 3DFIN and TreeLearn using point cloud–derived reference tree locations, the proposed framework achieves a precision of 92.85%, recall of 95.97%, and F1-score of 94.38%, substantially outperforming 3DFIN (75.97%) and TreeLearn (15.14%). These results demonstrate the potential of the proposed framework to deliver reliable tree-level inventories in complex tropical forests.

Digital Preservation and Augmented Reality for Historical Surveying Instruments: A Photogrammetric Approach to Cultural Heritage Documentation

Clóvis Andrade, Juyara Bezerra, Simone Sato, Karoline Jamur

Universidade Federal de Pernambuco, Brazil

Historical surveying instruments embody centuries of innovation in cartography and engineering, serving as crucial scientific and pedagogical artifacts. Their fragility, risk of damage, and limited exhibition space restrict access and highlight the need for effective preservation strategies (Duester, 2023). Traditional conservation methods protect material integrity but do not address broader challenges related to accessibility and engagement. Digital technologies now offer transformative alternatives capable of creating accurate and interactive representations of these instruments (Farella et al., 2022).

This study proposes a low-cost, replicable digital preservation pipeline integrating close-range photogrammetry and augmented reality (AR). Photogrammetry provides a non-contact method for generating detailed 3D models using consumer-grade smartphones, democratizing access to advanced documentation techniques (Icardi et al., 2018; Förstner & Wrobel, 2016). AR enables users to interact with these digital surrogates in real environments, fostering deeper engagement and overcoming limitations imposed by fragile originals (Spallone, 2022; Gong et al., 2022).

Image acquisition was conducted with a Xiaomi Poco F5 Pro under controlled lighting, maintaining 30–60% overlap. Processing in Agisoft Metashape included alignment, dense cloud generation, mesh reconstruction, and texturing. Post-processing in Blender optimized the models for real-time visualization. Integration into AR was achieved using Unity and the Vuforia Engine SDK.

Results demonstrate high-fidelity 3D models that preserve fine details and offer immersive AR interaction. This pipeline provides durable digital records, enhances educational experiences, and expands public access. The approach aligns with ISPRS Working Group II/6 objectives and offers a scalable model for cultural heritage institutions seeking accessible and effective preservation strategies.

Synthetic Dataset Generation for Partially Observed Indoor Objects

Jelle Vermandere, Maarten Bassier, Maarten Vergauwen

KU Leuven, Belgium

Learning-based methods for 3D scene reconstruction and object completion require large datasets containing partial scans paired with complete ground-truth geometry. However, acquiring such datasets using real-world scanning systems is costly and time-consuming, particularly when accurate ground truth for occluded regions is required.

In this work, we present a virtual scanning framework implemented in Unity for generating realistic synthetic 3D scan datasets. The proposed system simulates the behaviour of real-world scanners using configurable parameters such as scan resolution, measurement range, and distance-dependent noise. Instead of directly sampling mesh surfaces, the framework performs ray-based scanning from virtual viewpoints, enabling realistic modelling of sensor visibility and occlusion effects. In addition, panoramic images captured at the scanner location are used to assign colours to the resulting point clouds.

To support scalable dataset creation, the scanner is integrated with a procedural indoor scene generation pipeline that automatically produces diverse room layouts and furniture arrangements. Using this system, we introduce the V-Scan dataset, which contains synthetic indoor scans together with object-level partial point clouds, voxel-based occlusion grids, and complete ground-truth geometry. The resulting dataset provides valuable supervision for training and evaluating learning-based methods for scene reconstruction and object completion.

Automatic Segmentation of 3D Gaussian Splatting for Urban Cultural Heritage Sites

Widiatmoko Azis Fadilah, Virgile Gauthier, Arnadi Murtiyoso, Tania Landes, Pierre Grussenmeyer

Université de Strasbourg, CNRS, INSA Strasbourg, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, 67000, Strasbourg, France

3D Gaussian Splatting (3DGS) has emerged as a promising method for photorealistic scene reconstructions, yet its application to semantic segmentation in real-world heritage documentation remains underexplored. This study proposes and evaluates an automated semantic 3DGS segmentation pipeline integrating the Segment Anything Model 3 (SAM 3) with per-class prompting for Gaussian reconstruction, applied to a nadiral UAV dataset of the Siti Inggil heritage complex in Cirebon, Indonesia. Segmentation performance of four semantic classes (ground, roofs, vegetations, and water bodies) were assessed against manually segmented 2D and 3D reference data, supplemented by geometric accuracy assessment via the M3C2 analysis. Results reveal both the promise and the inherent challenges of applying 3DGS segmentation to complex real-world heritage scenes, where acquisition geometry, surface characteristics, and foundational model limitations can be observed.

Collaborative Multimodal Drone-Based Remote Sensing for Levee Piping Detection

Tu Hu, Tongqi Wang, Shan Su, Changjun Chen, Haoxiang Liu

Wuhan University, China, People's Republic of

This paper addresses the critical challenge of early and accurate detection of piping, a major failure mode in levee systems. Traditional methods are limited, and even advanced techniques such as infrared thermography struggle to capture weak thermal anomaly signals under complex environmental interference. To overcome these limitations, we propose an innovative intelligent algorithm that achieves breakthroughs by synergistically integrating drone-based infrared imagery and point cloud data.

The methodology follows a rigorous two-stage pipeline. First, potential piping zones are coarsely extracted from thermal infrared images using an enhanced saliency detection model. This involves superpixel segmentation and multi-scale (global and local saliency) analysis to highlight temperature anomalies, followed by adaptive thresholding based on Gaussian distribution fitting for automatic segmentation. Second, a fine discrimination step is introduced, which integrates multimodal prior information from point clouds to significantly reduce false alarms. This is achieved by applying a series of physical constraints: area filtering, temperature variance filtering, terrain-based filtering, and overlap analysis between the infrared and point cloud data.

Validation with field data collected during the flood season demonstrates that this method achieves high-precision localization of piping zones. Its key advantage lies in its ability to effectively suppress false positives caused by environmental clutter while ensuring that the detection results align with physical principles. This study provides a practical and reliable technical solution for enhancing the safety inspection and early warning systems of levee structures.

An Open-Source Pipeline for Runtime-Optimized Heritage Photogrammetry in Game Engines

Arkoun Merchant¹, Adam Weigert¹, Chloe Dennis², Stephen Fai¹

¹Carleton Immersive Media Studios, 1125 Colonel By Dr, Ottawa, Canada; ²Bytown Museum, Ottawa, Canada

This paper presents Mesh2Tile, an open-source pipeline that converts photogrammetric meshes into runtime-optimized 3D Tiles for interactive visualization in game engines. Photogrammetry produces high-polygon meshes that remain difficult to deliver at scale

through interactive platforms. Cloud-based conversion services like Cesium Ion provide a path to the OGC 3D Tiles format but impose cost barriers and raise data sovereignty concerns for confidential heritage projects. Existing open-source converters rely on uniform spatial partitioning, export redundant textures with every tile, and offer limited control over LOD generation. Mesh2Tile leverages Blender's Python API to perform adaptive octree tiling driven by triangle density, per-tile texture baking that eliminates texture redundancy, and parallel processing to generate georeferenced 3D Tiles from OBJ meshes. The pipeline is validated through a case study of the Bytown Museum Commissariat Building on the Rideau Canal UNESCO World Heritage Site. It is processed at three scales from 900 thousand to 90 million triangles. Results demonstrate linear scaling of processing time, up to 62% file size reduction for larger models, and successful runtime streaming in Unreal Engine 5 through the Cesium for Unreal plugin at 120 FPS with comparable tile balance to Cesium Ion's commercial output. The pipeline enables institutions to maintain full control over sensitive heritage data while achieving performance suitable for interactive visualization.

Location determination of dynamic objects using a single CCTV with monocular depth estimation

JiHeon Jung¹, Junhee Youn², Jieun Kim¹, Junho Gong¹, Phillip Kim¹, Sunwoong Paik¹

¹1 Dept. of Future & Smart Construction Research, Korea Institute of Civil Engineering and Building Technology, 10223 Goyang-Si, Gyeonggi-Do, Republic of Korea; ²Corresponding Author : Dept. of Future & Smart Construction Research, Korea Institute of Civil Engineering and Building Technology

This contribution presents a method to determine ground coordinates of pedestrians from a single CCTV frame using monocular depth estimation and orthophoto-based ground control points. Urban crowd monitoring requires pedestrian location information, but many CCTV-based approaches rely on accurate camera calibration or multi-view configurations, which are often unavailable in real deployments. In this study, we exploit relative depth values from a monocular depth estimation model (Depth Anything V2) and ground control points jointly identifiable in both the CCTV frame and an orthophoto in EPSG:5186. For each frame, depth-based distance ratios between the pedestrian and ground control point pairs are used to construct Apollonius circles in the orthophoto plane, and the pedestrian position is estimated by a weighted least-squares adjustment of their intersections. The method is evaluated on 180 frames across two scenes from an urban testbed with camera–target distances within approximately 50 m, across three GCP placement scenarios. For the optimal configuration (Scenario A), a mean RMSE of 1.989 m was achieved, excluding frames in which GCPs were temporarily occluded by moving objects, demonstrating that single-frame CCTV imagery combined with an orthophoto can achieve an accuracy of approximately 2 m without any EOP/IOP information, which is practically useful for urban crowd monitoring and dynamic thematic mapping. The influence of GCP placement geometry and occlusion conditions on estimation accuracy is also analyzed

ML-MIFD: Multi-Level Multimodal Invariant Feature Descriptor

Zening Wang, Haoyu Guo, Yongxiang Yao, Yongjun Zhang, Peihao Wu, Yi Wan

School of Remote Sensing and Information Engineering, 430079, Wuhan, Hubei, China

With the rapid advancement of multi-sensor technology, cross-modal image matching has become a key research focus. However, significant challenges persist, primarily caused by differences in imaging mechanisms that lead to nonlinear radiation variations and feature heterogeneity.Coupled with complex geometric distortions, traditional feature description methods in matching struggle to directly or effectively represent common feature information across modalities, resulting in matching failures. Thus, effectively mitigating noise and radiation distortions to enable robust cross-modal matching remains an open and critical problem, compounded by the intrinsic difficulty of balancing descriptor parameters like patch size and histogram partitioning. To address the aforementioned issues, this paper proposes a novel Multi-Level Multimodal Invariant Feature Descriptor (ML-MIFD), designed to enhance resistance to nonlinear radiometric differences and multi-source noise while maintaining rotation invariance. The proposed algorithm consists of three stages: feature detection, ML-MIFD descriptor construction, and image matching.This paper conducts comparative experiments with various state-of-the-art methods using typical cross-modal image datasets. The results demonstrate that the ML-MIFD method exhibits significant advantages in both registration accuracy and matching stability.

Geomorphological Monitoring of Erosion on Restored Slopes Through the Integration of Drones, GIS, and LiDAR

Mónica López Moncada^1,2,3, Joan-Cristian Padró^1,4, Vicenç Carabassa⁵, Paulo Escandón-Panchana^6,7, Andrés Velastegui-Montoya^2,3

¹Departamento de Geografía, Universitat Autònoma de Barcelona (UAB); ²Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ³Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ⁴Institut Cartogràfic i Geològic de Catalunya (ICGC), Parc de Montjuïc; ⁵CREAF, Universitat Autònoma de Barcelona (UAB); ⁶Departamento de Ingeniería Cartográfica y Topografía, Universidad Politécnica de Madrid (UPM); ⁷Escuela de Ciencias Ambientales, Universidad Espíritu Santo

Mining represents a strategic activity for economic development; however, this activity causes significant impacts on the landscape, soil, and water resources. During the restoration phase, slope erosion represents a challenge for ensuring the geomorphological stability and ecological functionality of the affected areas. This study aims to evaluate the erosion dynamics of restored mining slopes by integrating Geographic Information Systems (GIS) and data obtained from Unmanned Aerial Systems (UAS) for geomorphological monitoring and quantification of soil loss on slopes. The research was carried out at the Lázaro quarry, Tarragona, Spain, using a fixed-wing UAS equipped with a multispectral camera to generate high-resolution orthophotos and Digital Elevation Models (DEMs), and compared with historical LíDAR data. Height Difference Models (HDMs) and volumetric analysis were applied to quantify erosion and deposition processes. Three modelling approaches were compared: ridge-derived DEM (DEMp), filtered DEM (DEMf), and lidar DEM (DEMl), considering their accuracy, spatial detail, and ability to represent erosional microtopography. The findings revealed that the DEMp provides the most consistent estimates of volume loss and most faithfully reproduces pre-erosion morphologies. At the same time, the DEMf tends to smooth relief, while the DEMl provides a lower-resolution overview. These results confirm the effectiveness of integrating UAS data, photogrammetry, and geospatial analysis for monitoring restored slopes, enabling the accurate quantification of eroded volumes and the detailed characterisation of morphological processes. This study contributes to the optimisation of the geomorphological and environmental management of restored mining areas, promoting their long-term stability and sustainability.

Application of SfM Methods for the Photogrammetric Processing of Historical Aerial VHS Videos

Grzegorz Jóźków, Maurycy Hechmann

Wroclaw University of Environmental and Life Sciences, Poland

This submission presents the results and analysis of the SfM application for the processing of historical aerial VHS videos. The test data was collected during the 1997 Central European Flood and poses significant challenges due to the low quality of the data, the manner of the data acquisition (corridor mapping from different altitudes), and the object (a significant part of the images show the water). The SfM processing was executed in commercial software and allowed for successful image block bundle adjustment and creation of subsequent products, such as dense point cloud and orthomosoaics. One of the challenges during processing was the extraction of the approximate position of images and the selection of processing parameters.

Global Block Adjustment for Mosaicked Stereoscopic Satellite Imagery

Michaël Erblang¹, Emelyne Saulnier¹, Guillaume Laurent¹, Nicolas Delaygue², Fabrice Buffe², Alice Latourte², Mathilde Jassaud³, Noémie Bricout³

¹Thales Services Numériques (TSN), 290 Allée du Lac, 31670 Labège, France; ²Centre National d’Etudes Spatiales (CNES), 18 avenue E. Belin, 31400 Toulouse cedex 9, France; ³Institut national de l'information géographique et forestière (IGN), 18 avenue E. Belin, 31400 Toulouse cedex 9, France

Satellite imagery acquired over large areas from multiple viewpoints introduces subtle geometric misalignments that degrade the quality of derived products such as Digital Surface Models (DSMs). This paper presents a global block adjustment workflow designed to correct these errors across overlapping stereo acquisitions from the CO3D constellation, which captures Earth's surface at 50 cm resolution.

The proposed pipeline operates in three stages: individual acquisition refinement using Space Reference Points (SRPs) as Ground Control Points; tie point extraction between overlapping scenes through two-pass image correlation; and a weighted global spatio-triangulation simultaneously optimizing attitude biases, attitude drifts, and per-satellite magnification parameters.

Applied to a large stereo acquisition dataset over the Aorounga crater, Chad, the method demonstrates strong geometric performance. The results highlight that careful parameterization — combining observation weighting, n-tuple point filtering, and per-satellite sensor refinement — is key to producing accurate, geometrically consistent large-scale mosaics from bi-satellite stereo imagery. This paper does not include the in-orbit performances due to confidentiality agreement.

Learning-Based Semantic Segmentation and Context-based Quality Control of Bike-Pack LiDAR data for Tree Mapping in Semi-Urban Environments

Sungwoong Hyung¹, Hazem Hanafy¹, Chunxi Zhao¹, Sangyoon Park¹, Songlin Fei², Ayman Habib¹

¹Lyles School of Civil and Construction Engineering, Purdue University, West Lafayette, IN, 47907, USA; ²Department of Forestry and Natural Resources, Purdue University, West Lafayette, IN, 47907, USA

Accurate tree mapping in semi-urban areas is essential for ecological monitoring and infrastructure maintenance, but is challenged by complex structures and clutter in LiDAR data. This study proposes a learning-based framework using a Superpoint Transformer (SPT) for semantic segmentation. The model is pretrained on the KITTI-360 dataset and then fine-tuned using transfer learning on a high-resolution dataset captured by our in-house Bike-Pack LiDAR system. A key contribution of this work is a context-based quality control process applied after the initial segmentation. This quality control process refines the results by removing building artifacts, correcting misclassifications between vegetation and poles using geometric and intensity analysis, and refining building boundaries. Experiments demonstrate that this QC process significantly improves segmentation accuracy, especially for the critical vegetation and pole classes.

Multitemporal Monitoring of Posidonia Oceanica Banquettes using UAV Photogrammetry

Valeria Longhi¹, Andrea Lingua², Francesca Gallitto², Filiberto Chiabrando³

¹DIST – Interuniversity Department of Regional and Urban Studies and Planning, Politecnico di Torino, Italy; ²DIATI – Department of Environment, Land and Infrastructure Engineering, Politecnico di Torino, Italy; ³DAD – Department of Architecture and Design, Politecnico di Torino, Italy

Posidonia oceanica (PO) meadows represent one of the most valuable coastal ecosystems in the Mediterranean Sea, providing key ecological functions and ecosystem services (Vassallo et al., 2013). Even after detachment, PO leaves and rhizome fragments accumulate along the shoreline forming thick deposits known as banquettes (Rotini et al., 2020). These natural structures play a crucial role in protecting beaches from erosion, buffering wave energy, and contributing to the nutrient cycling of coastal systems (Fonseca and Cahalan, 1992).

Despite their ecological importance, banquette dynamics are not consistently monitored, standardized monitoring procedures are lacking, and their spatial and temporal variability remains poorly understood. Within the framework of the POSEIDON project, funded by the Italian National Recovery and Resilience Plan (PNRR), innovative high-resolution mapping techniques are being developed to monitor PO ecosystems both underwater and on the coast. This contribution presents a methodology based on UAV RGB photogrammetry for the multitemporal analysis of banquette morphodynamics, demonstrating its potential for quantitative assessment of seasonal and interannual changes. UAV photogrammetry has become a widely adopted tool for high-resolution coastal monitoring and topographic mapping, providing centimeter-scale DEMs when combined with RTK positioning and well-distributed ground control points (Zannutta et al., 2020; Vecchi et al., 2021; Yoo and Oh, 2016).

Photogrammetry and 3D Gaussian Splatting for Cultural Heritage. Pro Cons and Main Differences

Xinchen Li, Alessio Martino, Filiberto Chiabrando, Xiang Li

Department of Architecture and Design(DAD), Politecnico di Torino, Italy

This paper presents a comparative analysis of traditional photogrammetric methods and 3D Gaussian Splatting (3DGS) technology in the digitisation of Cultural Heritage (CH). Two representative datasets, differing in scale and image acquisition conditions, were selected to systematically evaluate the performance of both methods in terms of visual quality, geometric accuracy, computational efficiency and stability. The results indicate that 3DGS significantly outperforms traditional photogrammetry methods in terms of rendering quality and real-time visualisation capabilities, generating more realistic and immersive visual effects. However, its geometric accuracy is generally slightly lower than that of traditional methods, a difference that is particularly pronounced in small-scale datasets or under low-resolution input conditions. Among the various implementation methods, Postshot and LichtFeld Studio demonstrated higher stability and robustness, whilst the original GraphDeco method exhibited greater sensitivity to data scale and parameter settings. Photogrammetry offers reliability in high-precision geometric reconstruction, whilst 3DGS demonstrates significant potential for complementing this with a high-fidelity visual experience. The research findings try to provide practical guidance for selecting 3D reconstruction methods across different cultural heritage application scenarios.

Prediction of Understorey Vegetation using Remote Sensing in Fennoscandian Forests

Ritwika Mukhopadhyay, Ruben Valbuena, Inka Bohlin

Dept. of Forest Resource Management, Swedish University of Agriculture (SLU), 90183 Umeå, Sweden

Understorey vegetation (USV) contributes to forest structure, nutrient cycling, species diversity, habitat functions, and disturbance processes in Fennoscandian forests. It also provides non‑wood forest products such as wild berries. Mapping USV is important for understanding ecosystem functioning and its links to overstorey conditions. Although remote sensing (RS) enables large‑scale forest monitoring, its use for USV mapping remains limited because the layer is often obscured by upper‑canopy foliage. This study assesses the accuracy of USV cover prediction (i.e., the ground area covered by USV) using multiple RS data sources, identifies key predictors, and evaluates how canopy cover influences model performance. Field data were collected in 2024 from 487 plots in the Krycklan catchment. Sentinel‑2 summer and autumn imagery provided spectral reflectance, spectral indices, and grey‑level co‑occurrence matrix (GLCM) texture variables. Additional texture variables were derived from canopy height models (CHMs) generated using airborne laser scanning (ALS; 1–2 points/m²) and Pléiades tri‑stereo image matching (0.5 m; 1.5 points/m²). Beta regression and random forest regression (RFR) models were trained on 70% of plots and validated on 30%. Important predictors included seasonal red‑edge differences, greenness‑based indices, CHM texture variables, and ALS‑based canopy cover. Model performances indicated obstruction due to overstorey canopy cover remains for USV cover prediction. Beta regression with Sentinel‑2 data performed slightly better (RMSE = 21.7 m², variance explained = 5%) than RFR. However, best results occurred in low‑canopy plots (≤40%) using RFR with Sentinel‑2 and Pléiades‑derived CHM texture variables (RMSE = 14.6 m², variance explained = 32%).

Sequence-based decoupling Encoder for Well Log Interpretation

Ning Qian, Yiming Xu, Monica Sester

Institute of Cartography and Geoinformatics, Leibniz University Hannover, Germany

Well logging curves play a crucial role in oil and gas exploration and geological engineering, as they provide essential information about subsurface formations and reservoir properties. In recent years, with the growing adoption of deep learning techniques in geoscientific data analysis, well logging data have increasingly been modeled as depth-dependent sequences, enabling the application of sequential neural networks for their analysis. Among these approaches, attention mechanisms have been adopted in log interpretation tasks due to their ability to capture long-range dependencies within sequences. However, directly applying attention mechanisms without considering the intrinsic structure of logging data may introduce model redundancy and increase learning complexity, which can ultimately degrade predictive performance. To address this issue, this study proposes a Sequence-based Decoupling Encoder (SDE). The proposed encoder explicitly disentangles the interactions between logging curves and across depth, enabling the model to learn relationships along different dimensions separately, which allows more effective feature extraction and mapping into a latent space. The decoupling strategy also reduces the learning complexity of the attention mechanism and provides clearer learning objectives for the model. The proposed method is evaluated on the public dataset \textit{FORCE2020} and applied to two common well log interpretation tasks: missing log reconstruction and lithology prediction. We compare SDE against several representative sequential baselines. Experimental results demonstrate that SDE achieves superior predictive performance in both tasks.

Exploring the Potential of the Mandeye Handheld LiDAR System for Ecosystem Characterization

Cosme Hernanz-Gilbert¹, Carlos Cabo³, Álvaro Moreno-Martínez², Mónica Herrero-Huerta¹

¹Desertification Research Centre (CIDE) - CSIC, Spain; ²Image Processing Laboratory (IPL), Universitat de Valencia, Paterna, Valencia, Spain; ³Department of Mining Exploitation, University of Oviedo, Spain

Handheld LiDAR systems are emerging as a promising alternative to traditional terrestrial and airborne laser scanning for environmental research, yet their performance and applicability remain insufficiently explored. The Mandeye LiDAR device, developed between 2022 and 2024, stands out for its lightweight design, portability, integrability with other sensing platforms, and notably low cost. These characteristics make it especially attractive for ecological monitoring, enabling high-resolution structural data collection even in projects with limited resources. Despite this potential, very few studies have evaluated the device’s performance or its capacity to support ecosystem characterization.

This research presents a comprehensive review and experimental assessment of the Mandeye LiDAR system to determine its suitability for environmental applications. Field data are being collected in Mediterranean forest and riparian environments using three acquisition modes, on foot, bicycle, and kayak, to test how platform mobility and scanning geometry influence point cloud quality. The study evaluates point density, coverage, structural accuracy, and noise sensitivity while integrating ground-truth measurements and independent LiDAR references.

Preliminary findings show that the Mandeye performs robustly across diverse environments, with kayak-based acquisitions offering particularly detailed representations of the vegetation-water interface. Walking and cycling configurations provide efficient alternatives for forest structure assessment. Overall, the results demonstrate the value of handheld LiDAR as a flexible, accessible complement to conventional remote sensing methods. The project also aims to establish methodological guidelines for Mandeye deployment, contributing to the broader adoption and standardization of low-cost LiDAR tools in ecosystem monitoring.

VISTA-GS: MVS-Guided virtual view augmentation for sparse-view 3d gaussian splatting

Hongsheng Huang¹, Yaxin Li^1,4, Shengjun Tang², Siqi Du³, Mahmoud Mostafa¹, Mahmoud Adham¹, Wu Chen¹

¹Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong, P.R. China; ²Research Institute for Smart Cities, School of Architecture and Urban Planning, Shenzhen University, Shenzhen, P.R. China; ³College of Urban and Environmental Sciences, Peking University, Beijing, P.R. China; ⁴Micro Dimension Technology Limited, Hong Kong, P.R. China

3D Gaussian Splatting (3DGS) has achieved remarkable success in novel view synthesis with dense input views. However, its performance deteriorates rapidly in sparse-view scenarios, particularly for viewpoints distant from training cameras. This degradation stems from two fundamental limitations: sparse initial point clouds from limited input views and insufficient viewing angle constraints for robust optimization.

To address these challenges, we propose VISTA-GS (Virtual Image Synthesis and Training Augmentation), a novel framework that leverages Multi-View Stereo (MVS) reconstruction for point cloud densification and generates virtual training views through alpha-blending rendering of MVS-reconstructed dense colored point clouds. Unlike existing approaches relying on generative models or learned priors, our method exploits the geometric consistency inherent in MVS point clouds to create physically-grounded virtual views. By rendering dense point clouds from strategically positioned virtual camera viewpoints, we generate additional training images that preserve accurate geometric relationships while providing crucial angular constraints, effectively regularizing 3DGS training without synthesis-induced artifacts.

Our main contributions are twofold. First, we address sparse SfM initialization by employing MVS for dense point cloud generation with adaptive depth-weighted ellipsoid scaling. Second, we introduce a rendering-based virtual view generation strategy that creates geometrically consistent training images around original viewpoints using the same alpha blending principle as 3DGS. This approach enables robust reconstruction from minimal input views (3-12 images), substantially improving novel view synthesis performance while maintaining geometric fidelity that generative approaches often compromise.

An Approach to 3D Digitisation and Segmentation of the Interior and Exterior of a complex Museum Object

Simon Albers, Thomas Luhmann, Till Sieberth

Institute for Applied Photogrammetry and Geoinformatics, Jade University of Applied Sciences, Oldenburg, Germany

The digitisation of cultural heritage objects is an important procedure to conserve, share and analyse artefacts from the past. Nowadays, it is common practice to digitise artefacts using DSLR cameras and Structure from Motion. For most objects, this is a suitable procedure, but in some cases, objects have narrow interiors which cannot be reached with common camera equipment. Our case study is a small kayak model (~ 1 x 0.1 x 0.15 m) from the 19th century with an interior that can only be documented through small openings (0.1 m radius). We developed a method using a modified webcam to safely digitise the interior of the kayak. By comparing three datasets of a test object, we describe advantages and disadvantages of the usage of integrated autofocus and colour balance of the webcam. Furthermore, we extended our approach for segmentation of 3D models to consider the interior and prepare the models for future analysis. There were no major differences between the models of the three datasets, and all of them could reduce the data gaps in the 3D model based on the DSLR images noticeably.

Three-dimensional Reconstruction and Crack Measurement of Cultural Monuments using UAV-based Photogrammetry

Wei-Che Huang¹, Wen-Cheng Liu¹, Yi-Shan Luo¹, Po-Yu Chen², Kuei-Luo Lin³

¹National United University, Taiwan; ²Shin-Mag Industrial Co., Ltd., Taiwan; ³Fullai Construction Co., Ltd., Taiwan

Three-dimensional (3D) modeling for the documentation, preservation, and management of cultural heritage is indispensable. To achieve this goal, a low-cost unmanned aerial vehicle (UAV) combined with the Structure from Motion (SfM) photogrammetric technique was utilized to build a 3D model and conduct surface crack measurements of cultural monuments. The results showed that, under simple conditions, non-specialists can easily generate accurate 3D models from UAV-acquired imagery. In this study, the statistical errors of checkpoints between 3D reconstruction and field measurements, expressed as total RMSE, ranged from 0.103 m to 0.848 m. However, the mean absolute errors of surface crack measurements between tape-based methods and 3D reconstruction ranged from 0.002 m to 0.099 m. Furthermore, UAV-SfM was applied to measure surface crack lengths on an inaccessible cultural monument. The findings demonstrated that employing the UAV-SfM photogrammetric technique for 3D reconstruction of cultural monuments is both feasible and reliable.

Towards transparent geohazard model: XAI for ground deformation susceptibility in Rhenish Coalfields, Germany

Dibakar Kamalini Ritushree^1,2, Marzieh Baes¹, Mahdi Motagh^1,2

¹GFZ Helmholtz Center for Geosciences, Germany; ²LUH Leibniz Universitat Hannover, Germany

Satellite remote sensing has become a vital tool for monitoring environmental change and supporting disaster management, offering consistent and wide-area observations of the Earth’s surface. Combined with the rapid growth of Earth observation data, machine learning (ML) enables the detection of complex spatial patterns and improves the prediction of geohazards. One significant hazard is ground deformation caused by coal mining, which threatens infrastructure, ecosystems and local communities. This study presents an interpretable ML framework that integrates multi-source geospatial datasets with eXplainable Artificial Intelligence (XAI) techniques to map deformation susceptibility in open-pit coal mining regions. Beyond achieving high predictive performance, the approach reveals the key factors controlling ground instability, including proximity to mining operations and faults, groundwater variation and topographic conditions. The results supports enhanced monitoring strategies for reducing disaster risks in mining-affected areas.

Comparative Accuracy Assessment of two Low-Cost Devices for Underwater Structure-from-Motion 3D Reconstruction

Lukas Quirin, Gunnar Lelle-Neumann, Ferdinand Maiwald

Chair of Optical 3D-Metrology, TUD Dresden University of Technology, Germany

Accurate three-dimensional (3D) documentation of underwater environments is essential for evaluating the structural integrity of submerged infrastructure such as dams, pipelines or offshore platforms, as well as for repair operations or monitoring sites affected by potential pollution hazards including underwater chemical or ammunition residues. Automatic 3D surveying plays a key role in fulfilling these tasks remotely with a spectrum of uncrewed systems, such as remotely operated (underwater) vehicles (ROV), autonomous underwater vehicles (AUV) or robots. Conventional underwater surveying methods, including high

resolution imaging sonars and laser-based techniques, often require expensive instrumentation. Advances in photogrammetry and Structure-from-Motion (SfM) techniques enable detailed

3D reconstructions from standard imagery. This study presents a comparative accuracy assessment of two imaging devices for underwater SfM-based 3D reconstruction, giving practical workflow recommendations for low-budget underwater inspection and survey tasks.

UAV Photogrammetry and Laser Pointer Targeting for High-Precision Mapping of Inaccessible Surfaces

Dobromir Filipov¹, Stefan Vlaykov²

¹UACG, Faculty of Geodesy, Sofia; ²ESO PROEKT EOOD, Sofia

Accurate georeferencing is a fundamental requirement in UAV based

photogrammetry, directly influencing the spatial

precision, reliability, and analytical value of the derived 3D

models. However, achieving high

accuracy in areas such as rockslides or steep geological

formations presents considerable challenges, primarily due to

the difficulty or danger associated with placing conventional

Ground Control Points (GCPs) on-site. This

study introduces a novel hybrid methodology that leverages

laser pointer indication and total station surveying to establish

high-precision reference points that can be safely and

effectively integrated into UAV photogrammetric workflows.

The proposed approach aims to improve the absolute and

relative accuracy of photogrammetric models without the need

for physical GCP placement in inaccessible or hazardous areas.

A mixed reality generator for real-world envirinments in real-time

Devrim Akca¹, Çağın Torkut², Gerhard Kemper³, Armin Grün⁴

¹Faculty of Engineering and Natural Sciences, Işık Üniversitesi; ²RedHorizon Technology, Inc.,; ³GGs GmbH; ⁴4DiXplorer AG

By integrating computer vision, photogrammetry, UAV technology, and Extended Reality (XR) solutions, the presented innovative Mixed-Reality (MR) photogrammetry system enables real-time 3D visualization, interaction and measurement of realworld

environments. By eliminating the need for physical presence, the system enhances safety, efficiency and accuracy in tasks like assessing structural integrity, tracking construction progress, and observing environmental changes over time. At the

core of the system is a UAV equipped with a stereo camera rig and onboard processing capabilities. Operated on-site by an operator, the UAV captures high-resolution stereo imagery, which is processed in real time through a centralized Rest API running on cloud infrastructure. Experts located anywhere in the world connect to the system using VR headsets or a webbased application, gaining immersive access to a 3D stereoscopic view with full photogrammetric measurement functionality.

The system supports multi-user collaboration, enabling synchronized analysis and data sharing across different locations. This seamless integration of hardware and software components represents a significant advancement in real-time stereoscopic visualization.

CityZen: LOD2 building reconstruction with point cloud-free model-driven approach

Mehmet Büyükdemircioğlu¹, Ibrahim Sall^1,2, Simone Rigon¹, Fabio Remondino¹

¹3D Optical Metrology (3DOM) Unit, Bruno Kessler Foundation (FBK), Trento, Italy; ²Ecole Nationale des Sciences Geographiques (ENSG), Institut National de l’Information Geographique et Forestiere (IGN), France

Accurate building footprints and 3D models are nowadays essential for a wide range of urban applications, yet the generation of Level of Detail 2 (LOD2) models remains constrained by the availability of dense 3D data such as LiDAR or image matching products. While these sources provide high geometric accuracy, they are costly to acquire and update, creating a gap between data availability and the increasing demand for city-scale 3D modelling. Recent advances in deep learning enable monocular height estimation from aerial imagery, offering a potential alternative to traditional 3D data sources. However, integrated workflows that combine image-based inference with structured 3D reconstruction are still limited. This paper presents CityZen, a point cloud-free workflow for LOD2 building reconstruction from only RGB orthophotos. The proposed approach integrates monocular height estimation (evaluating DSMNet, HTC-DC-Net and TSE-Net), roof type classification and model-driven reconstruction within a unified pipeline. Building footprints are used as geometric constraints, while learned height and semantic cues guide the generation of consistent 3D structures. The proposed framework enables scalable and practical LOD2 city modelling using widely available aerial orthophotos, reducing dependency on costly 3D data acquisition.

Fast acquisition for modelling heritage-related complex scenes based on TLS and spherical photogrammetry

Antonio Tomás Mozas-Calvache, José Luis Pérez-García, José Miguel Gómez-López, Diego Vico-García

University of Jaén, Spain

Documenting complex heritage sites, such as the QH36 Egyptian rock-cut tomb and La Lobera cave (Iberian sanctuary), often faces severe time and logistical constraints (e.g., concurrent activity, limited access). This necessitates a methodology that ensures fast data acquisition while maintaining high geometric and radiometric quality.

This study proposes a data fusion methodology combining Terrestrial Laser Scanning (TLS) and Spherical Photogrammetry (SP). TLS is prioritized for rapid, high-accuracy geometry acquisition, while SP, using a pre-calibrated 360-degree multi-camera, is utilized primarily for detailed texture mapping and supporting geometry in occluded areas.

A key element of this approach is leveraging the TLS point cloud to extract Ground Control Points (GCPs) and Checkpoints (CPs) directly, significantly reducing the need for time-consuming total station surveying and greatly improving field work efficiency.

Results demonstrate that the methodology achieves the core objective:

• Speed: Static capture time is reduced to approximately 5 minutes per station (TLS), less in the case of static spherical photographs, and even less using SP with video.

• Accuracy: Geometric registration errors given by TLS are less than 0.5 cm.

• Efficiency: Texture acquisition is improved at least 6-fold compared to conventional photogrammetry.

This validated approach offers a viable, efficient, and reliable solution for the high-quality 3D documentation of geometrically complex and time-constrained cultural heritage scenes.

Large-Field Binocular Vision Attitude Determination Method for Rocket Recovery

Yuqi Zhang, Xianglei Liu, Runjie Wang, Haibo Shi, Zhao Lu, Haiqian Wu

Beijing University of Civil Engineering and Architecture, China, People's Republic of

High-precision attitude measurement in rocket recovery is critical for reusable launch vehicles (RLVs) and aerospace sustainability, but existing technologies have key flaws. Inertial Measurement Units (IMUs) accumulate drift, misaligning control commands with actual states; high-precision gyroscopes are costly and hard to integrate; Visual-Inertial Fusion (VINS) is light-sensitive, failing in dynamic re-entry—all risking recovery failure.

To address this, a large-field binocular vision method is proposed via four stages. First, camera calibration uses Zhang’s method for intrinsic parameters (left/right reprojection errors: 0.056/0.066 px) and control-point stitching for extrinsics, solving the large-field coverage issue and achieving 33.42 mm 3D positioning error. Next, image preprocessing applies bilateral filtering for denoising, Roberts operator for edge extraction, morphological closing for contour continuity, and multi-threshold Canny fusion to suppress spurious edges, ensuring stable input. Then, total least squares fits the midline, and left/right camera plane intersection extracts the rocket’s spatial central axis, avoiding noise from point-by-point triangulation. Finally, phase correlation resolves roll ambiguity from cylindrical symmetry, and the spatial axis calculates pitch/yaw to build a Z-Y-X Tait-Bryan angle matrix for attitude determination.

Experiments on a 1:20 scale model (1 m long, 0.3 m diameter) used µs-synced high-speed cameras (6 m height, 3 m baseline). Results show roll/pitch/yaw RMSEs of 1.58°/1.54°/1.41°, with 93% mean absolute errors ≤±2°—outperforming ORB+PnP (2.11° roll RMSE), SGBM (2.50°), and Chamfer (3.00°). Ablation experiments confirm key modules’ necessity—removing line support score filtering raises roll RMSE to 1.85°—verifying robustness in dynamic re-entry.

Low-cost stereo vision and deep learning for river water level measurement

Pedro Zamboni¹, Robert Krüger¹, László Bertalan², Xabier Blanch³, Paul Hindorf¹, Anette Eltner¹

¹Dresden University of Technology, Germany; ²University of Debrecen, Hungary; ³Universitat Politécnica de Catalunya, Spain

This study presents a low-cost, non-contact stereo vision system for automated river water level monitoring, addressing the growing need for dense and scalable hydrological observation networks under increasing climate-driven flood risks. The proposed system uses paired consumer-grade cameras combined with deep learning–based image segmentation to estimate water levels without requiring physical reference markers or pre-existing 3D models.

Two processing strategies are evaluated: a standard stereo workflow and an enhanced approach incorporating semantic masking to exclude dynamic regions such as water and sky. Camera pose estimation is assessed using both global and epoch-based optimization methods. Results show that unmasked configurations provide more stable and robust camera pose estimates, while masking improves geometric accuracy but introduces temporal instability.

Water level estimates derived from stereo reconstruction demonstrate strong agreement with reference gauge data, achieving correlation coefficients between 0.70 and 0.77. Both approaches successfully capture overall hydrological trends, including flood dynamics, although accuracy decreases under high water levels and challenging imaging conditions. Masking introduces a systematic offset in absolute values but does not significantly improve correlation performance.

Research on Cloud Control photogrammetry based on Time-series Archived Aerial Photos and Its Application in Urban Governance in Beijing

Xiaokun Zhu¹, Yingchun Tao¹, Huimin Tian¹, Mingce Xu², Yutao Guo¹

¹Beijing Institute of Surveying and Mapping, China, People's Republic of; ²Beijing SmartSpatio Technology, China, People's Republic of

This study applies cloud control photogrammetry to time-series archived aerial photos to support urban governance in Beijing. Addressing challenges such as missing ground control points, heterogeneous coordinate references, and non-digitized aerial triangulation results, the proposed method leverages existing basic geographic products (e.g., DOM, DEM) as dense control sources, enabling automated aerial triangulation and 3D reconstruction without field control points. The workflow includes control source selection and organization, image preprocessing, cloud control point and tie point matching, block adjustment, and time-series product generation. Three experimental applications are presented: (1) reconstruction of river course changes in the Beijing Municipal Administrative Center using KH satellite images (1961–1974) and 1996 DOM, yielding time-series DOM products meeting 1:50,000 scale accuracy; (2) detection of illegal self-built building additions via DSM differencing from ADS80 images (2016–2017), identifying one-to-three-story structures; (3) 3D real-scene modeling of the Grand Canal’s Tonghui River section from 1975 film photos and 2015 control data, revealing 40 years of urban transformation. Results demonstrate that cloud control photogrammetry ensures spatiotemporal consistency and enables quantifiable, multi-temporal 3D analysis for urban change detection, illegal construction monitoring, and cultural heritage preservation.

UrbanVGGT: Scalable Sidewalk Width Estimation from Street View Images

Kaizhen Tan¹, Fan Zhang²

¹Heinz College of Information Systems and Public Policy, Carnegie Mellon University, United States of America; ²Institute of Remote Sensing and Geographical Information System, Peking University, China

Sidewalk width is an important indicator of pedestrian accessibility, comfort, and network quality, yet large-scale width data remain scarce in most cities. Existing approaches typically rely on costly field surveys, high-resolution overhead imagery, or simplified geometric assumptions that limit scalability or introduce systematic error. To address this gap, we present UrbanVGGT, a measurement pipeline for estimating metric sidewalk width from a single street-view image. The method combines semantic segmentation, feed-forward 3D reconstruction, adaptive ground-plane fitting, camera-height-based scale calibration, and directional width measurement on the recovered plane. On a ground-truth benchmark from Washington, D.C., UrbanVGGT achieves a mean absolute error of 0.252 m, with 95.5% of estimates within 0.50 m of the reference width. Ablation experiments show that metric scale calibration is the most critical component, and controlled comparisons with alternative geometry backbones support the effectiveness of the overall design. As a feasibility demonstration, we further apply the pipeline to three cities and generate SV-SideWidth, a prototype sidewalk-width dataset covering 527 OpenStreetMap street segments. The results indicate that street-view imagery can support scalable generation of candidate sidewalk-width attributes, while broader cross-city validation and local ground-truth auditing remain necessary before deployment as authoritative planning data.

Pompeii. From the measurement of small indentations to the calculation of the terminal ballista.

Monil Mihirbhai Thakkar¹, Amir Ardeshiri Lordejani¹, Mario Guagliano¹, Silvia Bertacchi², Sara Gonizzi Barsanti², Adriana Rossi²

¹Department of Mechanical Engineering, Politecnico di Milano, via la Masa 1, 20156, Milan, Italy; ²Department of Engineering, Università degli Studi della Campania Luigi Vanvitelli, Via Roma 29, 81031, Aversa (CE),Italy

During Sulla’s siege of Pompeii in 89 BC, Roman artillery projectiles struck the city’s fortified walls, leaving visible impact craters. The subsequent eruption in AD 79 buried the site, preserving both its architectural layout and the damaged wall surfaces, which were later excavated in the early 20th century. By analysing the visible damage found on the fortified walls of Pompeii, reverse engineering techniques were used to decipher the engineering principles behind Roman military technology. This study simulates the impact of metal projectiles on grey tuff to estimate the impact velocities and the energy required to cause the observed damage, providing insights into the destructive capabilities of Roman weapons. It develops material models and applies finite element analysis, including mesh convergence, velocity calibration, and angular impact studies for both ballista stones and darts to better understand impact mechanics and crater formation.

metal darts on the city walls, along with the simulation of forces and trajectories. Among the objectives is to verify the calculated data against experimental relationships developed in antiquity and applied to the detection of small pyramidal indentations.

BEV-LOC: Real-Time and Lightweight Cross-View Localization via Online BEV Mapping

Jiyong Kwag, Charles Toth, Alper Yilmaz

Ohio State University, United States of America

This abstract presents a deep learning and classical computer vision framework for cross-view geolocalization using 360-degree multi-perspective view (PV) images and an offline global map. Recent studies on cross-view geolocalization typically rely on deep learning models to localize panoramic PV images by matching them with reference satellite imagery. However, such approaches face practical limitations in real-world deployments, due to their dependence on large-scale GPU resources and the need to store extensive satellite image datasets. To address these challenges, we propose BEV-LOC, a lightweight and real-time cross-view geolocalization method. BEV-LOC employs Bird’s Eye View (BEV) encoder that learns to transform 360-degree multi-PV images into a local high-definition (HD) BEV map. The localization is then performed using Intersection Over Union (IoU)-based template matching with an offline global map. Our architecture achieves real-time performance at 30 FPS without the need for high-end GPU hardware and delivers a high positioning accuracy of 1.2 meters.

Remote Pipe Diameter Measurement from a single Image using Laser Scale Projection with a Depth Compensation Model

Alice Bilbáo¹, Leonardo Galvão¹, João Andrade¹, Daniel Regner¹, Moacir Wendhausen¹, Gierri Waltrich¹, Carla Marinho², Tiago Pinto¹

¹Federal University of Santa Catarina, Brazil; ²CENPES/Petrobras, Brazil

Monitoring geometric integrity of risers and pipelines is critical in offshore oil & gas operations, where swell, collapse or torsion often manifest as diametral changes that must be detected safely and efficiently. Historically, this kind of inspection is made by industrial climb, a time-consuming, dangerous and costly operation. Increasing efforts are on remote riser inspection using drones, primarily aimed at qualitative assessment through visual analysis, as well as photogrammetry, which offers accurate inspection but requires many images, image acquisition network design and well-trained drone pilots. To overcome the limitations of a qualitative image inspection and the complexity of photogrammetry, we propose a simple, low-cost method to estimate the pipe diameter from a single image by projecting two laser points of known spacing, building a scale directly in the scene and correcting depth differences between the laser projection plane and the pipe silhouette plane. This work evaluates the proposed method in laboratory conditions for nominal and calibrated focal lengths, distances from 2 m to 10 m and four pipe diameters, demonstrating the improvement of remote pipe diameter measurement by modelling and compensating for this depth difference. The improvement becomes more evident for longer focal lengths, shorter distances, and larger pipe diameters. It has an important effect in minimizing errors, e.g., from 3.5% to less than 0.2% at a 2 m distance for a 165 mm diameter pipe. The next steps include the construction of a lightweight projector to be integrated into a drone camera gimbal.

Evaluating the synergy of hand-crafted and AI-driven feature matching in structure-from-motion 3D reconstruction

Min-Lung Cheng, Yasutaka Kuramoto

SkymatiX Inc., Japan

This study evaluates the effects of hand-crafted and AI-driven feature extraction and matching approaches on 3D scene reconstruction. While hand-crafted methods remain widely adopted in structure-from-motion (SfM), their performance often deteriorates when repetitive or uniform textures occur across multiple images, leading to alignment failures and incomplete reconstructions due to insufficient or erroneous feature correspondences. Recent advances in artificial intelligence have introduced robust pipelines capable of addressing these challenges by improving feature detection and matching in texture-repetitive imagery. In this study, hand-crafted and AI-driven feature extraction and matching techniques are integrated and assessed on challenging datasets to examine their performance in SfM-based 3D reconstruction. Experimental results demonstrate that combining hand-crafted feature points with AI-driven matching significantly enhances the robustness and reconstruction success rate across diverse challenging scenarios. This hybrid approach offers a promising alternative for reliable SfM 3D reconstruction when dealing with images dominated by repetitive or uniform textures.

The Emerging Role of Vision-Language Models in the Automation of Railway Asset Management: A Review and Future Perspective

Ashley Varghese, Mohammadjavad Ghorbanalivakili, Gunho Sohn

York University, Canada

Automated railway inspection is critical for safety, but current deep learning models are limited by a "closed-world" assumption, failing to identify novel or rare assets without costly retraining. This review explores a transformative solution: Vision-Language Models (VLMs). We introduce the concept of "reasoning-powered detection," where a model’s linguistic intelligence is used to guide the identification process.

Multi-Modal LoD2 Building Reconstruction Benchmark for Urban Modeling

Mohammad Moein Sheikholeslami¹, Youssef Korny¹, Andreas Wichmann², Ksenia Bittner³, Gunho Sohn¹

¹York University, Canada; ²Jade University of Applied Sciences, Germany; ³German Aerospace Center (DLR), Weßling, Germany

Accurate 3D building modeling at level of detail 2 (LoD2) is

fundamental for urban analysis, supporting applications such

as realistic city simulations, energy assessment, and infrastructure

planning. While cadastral data is often freely accessible in

many developed countries, existing publicly available 3D building

benchmarks are typically limited either in scale or in the

diversity of input modalities required for developing and evaluating

modern deep learning methods.

We present a new large-scale, open, instance-wise dataset for

LoD2 building modeling from aerial imagery and LiDAR.

Through rigorous processing and validation, it bridges the

gap between raw open geospatial data and structured research

benchmarks. Its modular design supports both single- and

multi-modal reconstruction workflows. The upcoming public

release aims to enable reproducible research in 3D urban modeling,

cross-modal learning, and digital-twin creation, advancing

automated, reliable city-scale 3D reconstruction.

GeoRGMAE: Geospatially Guided Masked Autoencoders for Building Segmentation

Tugba Eraslanoglu¹, Guneet Mutreja², Martin Kada¹, Ksenia Bittner²

¹Technical University of Berlin, Germany; ²German Aerospace Center (DLR)

Accurate building segmentation from high-resolution aerial imagery is essential for various urban applications such as digital twins, geographic information system, and flood risk modelling. However, conventional supervised deep learning approaches require large amounts of pixel-level annotations, which are costly and time-consuming to obtain for large remote sensing datasets. To address this limitation, self-supervised learning has recently emerged as an effective paradigm in order to learn visual representations from unlabeled data. In particular, masked autoencoders (MAE) have demonstrated strong performance by reconstructing masked image patches during pretraining. Nevertheless, conventional MAE frameworks rely on random masking strategies that do not consider the spatial structure and semantic importance of regions in high-resolution remote sensing imagery. In this study, we propose GeoRGMAE, a geospatially guided masked autoencoder for building segmentation. Unlike standard MAE, which rely on random masking, our approach leverages building footprint annotations available in the pretraining dataset to guide the masking process while preserving the original reconstruction objective. We introduce three masking strategies -core, balanced, and density-aware masking- that prioritize semantically relevant building regions under the varying urban densities. The core strategy focuses on building interiors, the balanced strategy distributes masking between buildings and background, and the density-aware adapts masking based on scene-level building density. Experiments on the Roof3D and WHU Building datasets demonstrate consistent, though modest, improvements over standard MAE pretraining, with the most effective masking strategy depending on dataset characteristics. These results indicate that incorporating geospatial priors into masked image modelling can improve representation learning for downstream building segmentation tasks.

Deep Learning-based Roof Detection from UAV Dense Point Cloud for Solar Panels Mapping

Aleksandra Sekrecka, Damian Wierzbicki, Kinga Karwowska, Agnieszka Myrcik

Military University of Technology in Warsaw, Poland, Poland

Photovoltaic panels are becoming increasingly popular, and finding a suitable location for them quickly and automatically is a current and practical problem. In our experiment, we test whether a point cloud from dense multi-image matching can be useful for the automatic detection of the best locations for installing photovoltaic panels. We propose a methodology for processing and analyzing UAV point clouds, where the use of deep learning in combination with the CANUPO algorithm results in high roof recognition efficiency.Two classes were selected: roofs and non-roof objects. This made it possible to filter the detected roofs and remove erroneous objects. The resulting model detected buildings with an accuracy of approximately 80% and an effectiveness of 100% (there were no false detections). the following factors were taken into account in the insolation calculations: roof angles, roof slope exposure, changes in the angle of sunlight throughout the year, and atmospheric transmittance. The roof angles and exposure were determined using a Digital Surface Model (DSM) generated from multi-image UAV data. In our research, we took into account the average angle of incidence of sunlight throughout the year and at quarterly intervals.The use of DSM for roofs and the SVC algorithm combined with CANUPO made it possible to eliminate false detections and significantly increase the effectiveness of location detection. Research conducted for the entire year and quarters enabled the analysis of changes in roof insolation throughout the year, which is crucial when estimating the profitability of installing photovoltaic panels.

Comparison of Different Object Detection Methods for Automatic Facade Enrichment of Existing Building Modells from Arial Images

Johannes Otepka¹, Günter Sükar², Martin Kerschner², Gerald Forkert², Norbert Pfeifer¹

¹TU Wien, Austria; ²UVM Systems GmbH, Wien, Austria

This study investigates the enrichment of existing building models using deep learning-based window detection from oblique aerial imagery acquired by a high-end multi-camera sensor system. While many cities maintain LOD2 building models at Level of Detail 2, higher levels of detail require the integration of facade elements such as windows. Three detection strategies are evaluated using 3D reference building models to assess accuracy and completeness. The test site is located in Vienna and consists of multiple large residential buildings with varying facade characteristics.

The evaluated methods include zero-shot object detection with Grounding DINO combined with Segment Anything Model 2, applied to both oblique images and facade orthophotos, as well as a SAM2-UNeXT network requiring minimal training. Results indicate that zero-shot detection on orthophotos achieves the best performance, with a precision of 0.95 and an F1 score of 0.85. In contrast, the SAM2-UNeXT approach shows lower precision and F1 scores but slightly higher recall.

The investigation shows that detection performance is influenced by facade viewing angles. Steeper viewing angles generally improve detection quality but increase susceptibility to occlusions, particularly in dense urban environments. The article concludes with a detailed outlook on future work, including the extension of the approach to more complex three-dimensional building structures.

Quality Restoration of Point-Cloud-Derived 2D Projections: A Comparative Study of Void-Filling Techniques

Md Rakibul Islam Chowdhury¹, Sang Hyeok Han^1,2, Jong Won Ma³

¹Dept of Building, Civil and Environmental Engineering, Concordia University, Montréal, QC, Canada; ²Centre for Innovation in Construction and Infrastructure Engineering and Management (CICIEM), Gina Cody School of Engineering and Computer Science, Concordia University, Montréal, QC, Canada; ³School of Civil and Environmental Engineering, Yonsei University, Seoul, South Korea

Point-cloud-derived 2D projections enable generating unlimited virtual views for indoor scene analysis and dataset creation. However, projecting irregular 3D samples onto a dense image grid commonly produces void pixels due to sparsity, occlusions, and incomplete scan coverage. These projection-induced artifacts degrade the visual fidelity of rendered images and limit their usefulness in downstream image-based workflows. This study investigates void-filling strategies tailored to point-cloud-generated RGB projections and provides a comparative evaluation of three representative approaches: (i) K-nearest neighbor (KNN) interpolation with KD-Tree accelerated neighbor search, (ii) a rule-based neighborhood method (NNRule) that adapts filling behavior using local variability to preserve edges, and (iii) a mask-normalized Gaussian-weighted propagation method that diffuses valid color information into void regions. Experiments were conducted on multi-view perspective projections generated from Stanford Large-Scale 3D Indoor Spaces Dataset (S3DIS) Area 3, totalling 5,520 images. Restoration quality was assessed using standard pixel-level metrics such as MAE, RMSE, PSNR, and SSIM. Quantitative results show that Gaussian-weighted propagation achieved the best overall performance, followed by NNRule, while KNN performed weakest numerically. Qualitative comparisons further indicate that KNN produces the most visually realistic texture appearance, whereas diffusion-based filling is softened fine details. Finally, the study establishes a practical baseline that enables both academic researchers to advance point-cloud-to-image restoration without relying on paired RGB datasets and industrial practitioners to deploy light weight void-filling pipelines in real-world applications such as digital twins, indoor robotics, facility management, and augmented reality.

Bridging the Gap: Improving handheld Laser Scanning Point Cloud Quality in Forests via RTK-GNSS integrated SLAM

Carolin Rünger, Stefan Binapfl, Sophia Böhme, Ferdinand Maiwald, Anette Eltner

Technical University Dresden, Germany

Accurate forest inventories are essential for sustainable forest management. Handheld personal laser scanning (H-PLS) enables efficient and flexible forest data acquisition. However, ensuring reliable point cloud quality in complex environments remains challenging. While Simultaneous Localization and Mapping (SLAM)-based H-PLS allows rapid data collection, trajectory drift and accumulated registration errors can reduce the accuracy of derived tree parameters and structural metrics. In contrast, Global Navigation Satellite System (GNSS)-based Real-Time Kinematic (RTK) positioning provides centimetre-level absolute accuracy and drift-free trajectories, although its application in forested environments is still emerging. This study evaluates the impact of RTK-GNSS integration on point cloud geometry compared to SLAM-based point clouds without GNSS across two Central European forest plots with contrasting canopy structures. Analyses focused on tree parameter accuracy, structural metrics based on quantitative structural models, point density and noise characteristics. To isolate the effect of GNSS integration, data from the RTK-GNSS enabled H-PLS device were additionally processed without GNSS information, and an open-trajectory scan without loop closure was included for comparison. Results show that RTK-GNSS improves point cloud consistency and especially enhances the estimation of volume- and branch-related metrics. In the dense canopy plot, RTK-GNSS information reduced mean errors in branch number (−6100 to −5369) and crown volume (−492.75 to −357.21 m³). However, overall performance in tree parameter estimation depends on point density. These findings highlight RTK-GNSS H-PLS as a promising approach for flexible and efficient forest data acquisition in inventory applications.

Semantically-Driven Adaptive Registration for Correcting Non-Constant Drift in Multi-Temporal MLS Data

Aimad El Issaoui^1,2, Veikka Taka¹, Harri Kaartinen¹, Antero Kukko^1,2, Juha Hyyppä^1,2

¹Finnish Geospatial Research Institute (FGI), the National Land Survay of Finland; ²Aalto University, School of Engineering, Department of Built Environment

Mobile Laser Scanning (MLS) provides high-accuracy 3D point clouds essential for road infrastructure monitoring. However, multi-temporal MLS analysis is often limited by non-constant, spatially varying trajectory drift caused by GNSS outages and IMU inaccuracies. These misalignments can exceed the magnitude of the changes being monitored, such as pavement deformation, making accurate change detection challenging. This paper presents a fully automatic, semantically driven registration pipeline designed to correct spatially varying drift in directly georeferenced MLS data. The method first applies Principal Component Analysis (PCA) and intensity-based filtering to classify points into stable geometric categories, including flat horizontal surfaces, flat vertical structures, and linear vertical features. A correspondence-based filtering step removes dynamic objects and temporal changes to ensure that registration is driven by stable geometry. The core of the method is an adaptive piecewise registration strategy, where the reference point cloud is divided into sequential 1-meter patches. Each patch is assigned a local rigid transformation estimated using an adaptively expanding registration window guided by the availability of stable vertical features. A final smoothing step ensures spatial continuity between adjacent transformations. The method was evaluated on two MLS datasets collected one year apart along a 3 km road corridor using the FGI Roamer-R4DW system. Validation using 30 independent ground signals showed that the 3D RMSE improved from 3.38 cm to 1.54 cm, with vertical RMSE improving from 2.54 cm to 0.67 cm. The results demonstrate that the proposed approach enables centimeter-level alignment suitable for high-precision multi-temporal road monitoring and change detection applications.

3D Meshing of Challenging Surfaces using Gaussian Splatting

Dario Billi^1,2, Chaimaa Delasse^2,3, Arnadi Murtiyoso², Hélène Macher², Pierre Grussenmeyer², Gabriella Caroti¹, Andrea Piemonte¹

¹Department of Civil and Industrial Engineering, ASTRO Laboratory, University of Pisa, Largo Lucio Lazzarino, 56122 Pisa, Italy; ²Université de Strasbourg, INSA Strasbourg, CNRS, Laboratoire ICube UMR 7357, 67000 Strasbourg, France; ³Ecole des Sciences Géomatiques et de l’Ingénierie Topographique, Institut Agronomique et Vétérinaire Hassan II, Madinat Al Irfane, 6202 Rabat, Morocco

This work addresses the challenge of accurate 3D reconstruction of complex scenes such as vegetation, transparent, or non-Lambertian surfaces, which often cause difficulties for traditional Multi-View Stereo (MVS) methods. This issue is particularly relevant in the field of Cultural Heritage (CH), where many objects and environments exhibit such characteristics. To overcome these limitations, the study proposes the use of the new MILo (Mesh-In-the-Loop Gaussian Splatting) approach (Guédon et al., 2025), comparing its results with conventional MVS techniques and Terrestrial Laser Scanner (TLS) data.

MILo builds upon the 3D Gaussian Splatting (3DGS) technique, introducing a differentiable mesh extraction during optimization of the Gaussian parameters. This enables gradient flow between the volumetric and surface representations, resulting in more accurate and lightweight meshes, suitable for downstream applications such as simulations or animations.

The study uses three datasets: a Tilia tomentosa tree (Strasbourg) for complex natural geometries, the winter garden of the Sarreguemines Museum for reflective surfaces, and woodcarvings from Kasepuhan Palace (Indonesia) for fine ornamental details. Preliminary results on the tree dataset show that MILo significantly improves reconstruction quality, preserving thin structures such as branches and leaves compared to traditional MVS methods.

The final analysis will include both qualitative and quantitative comparisons (RMSE, standard deviation, completeness, mesh complexity) against TLS data, to rigorously assess MILo’s performance across different geometric and material conditions.

Render-to-Real Image-Based Change Detection of Outdoor Infrastructure Using 3D Gaussian Splatting

Satoko Hattori-Nagao, Kazuo Oda, Tomoaki Eguchi, Takanobu Nagao, Satomi Kakuta

Asia Air Survey Co., Ltd., Japan

This study proposes a framework for detecting changes in outdoor civil infrastructure using bi-temporal images and validates its effectiveness through experiments on real-world datasets. The proposed method performs change detection by comparing a 3D Gaussian Splatting (3DGS) model reconstructed from multi-view images acquired before changes occur with a single real image captured from a new observation viewpoint after changes. The processing pipeline consists of: (1) construction of the 3DGS model, (2) generation of an initial rendered image corresponding to the post-change real image, (3) feature matching between the rendered image and the real image followed by camera pose estimation, and (4) change detection. Experiments conducted on a sediment control dam and a bridge dataset demonstrate that the proposed method achieves a maximum Intersection over Union (IoU) of 0.82 for change detection. Furthermore, compared to a baseline method based on bi-temporal real image pairs, the proposed method improves IoU by up to 24 percentage points. The results also indicate that even under limited acquisition conditions after changes, accurate change detection can be achieved when the 3DGS reconstruction quality and pose estimation are sufficiently reliable.

Empirical assessment of geometric accuracy of underwater lidar in tropical shallow waters

Mentari Khoerunnisa Azzahra¹, Fickrie Muhammad², Arnadi Murtiyoso³, Annette Scheider⁴, Harald Sternberg⁴, Gabriella Alodia²

¹Institut Teknologi Bandung, Faculty of Earth Sciences and Technology, Geodesy and Geomatics Engineering Postgraduate Programme, Bandung, Indonesia; ²Institut Teknologi Bandung, Faculty of Earth Sciences and Technology, Hydrography Research Group, Bandung, Indonesia; ³Université de Strasbourg, CNRS, INSA Strasbourg, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, Strasbourg, France; ⁴HafenCity University Hamburg, Department of Hydrography and Geodesy, Hamburg, Germany

Light detection and ranging or lidar technology has been widely applied across various spatial domains. To meet the needs for a detailed underwater survey, Fraunhofer IPM developed an underwater lidar, known as ULi. The system has been tested under controlled laboratory conditions. Nevertheless, Fraunhofer IPM claims sub-millimetre range precision in clean water. However, no empirical study has managed to address this aspect, as fieldwork in the Elbe River (Walter et al., 2025) did not manage to obtain suitable data due to its naturally high turbidity. The present study will evaluate the geometrical accuracy of ULi against terrestrial laser scanner (TLS) and photogrammetry. An acoustic Doppler current profiler (ADCP) was chosen as a measurement target on the field experiment due to its rigidity and high reflectivity, with the dimensions of the frame is 75 × 75 × 65 cm. The data sets were georeferenced to the WGS 84/UTM Zone 48S coordinate system using control point targets affixed to the ADCP frame and measured with a total station applying the intersection method. Subsequently, the geometric accuracy assessment was performed through statistical evaluations, including root mean square error analysis and 3D point cloud deviation comparison among ULi, TLS, and photogrammetry data sets. The 3D model derived from the ULi data will be assessed against models derived from TLS and photogrammetry through statistical analyses of length discrepancies and spatial deviations. Additionally, intensity, point density, linearity, planarity, and scattering analyses will be performed to evaluate how well the point cloud represents the geometric characteristics.

Experimental Validation of Human-Readable Coded Targets for Cross-Platform Photogrammetry and 3D Laser Scanning

Miglena Raykovska^1,5, Milen Borisov², Stanislav Harizanov¹, Lyubka Pashova³, Nikolay Petkov¹, Kristen Jones⁴, Pavel Georgiev¹, Georgi Vasilev¹, Ivan Lirkov¹

¹Institute of Information and Communication Technologies, Bulgarian Academy of Sciences; ²Institute of Mathematics and Informatics, Bulgarian Academy of Sciences; ³National Institute of Geophysics, Geodesy and Geography, Bulgarian Academy of Sciences; ⁴Queens University, Canada; ⁵Centre of Excellence in Informatics and Information and Communication Technologies

Coded targets are widely used in close-range photogrammetry and 3D laser scanning for automated referencing and registration. However, most fiducial systems are optimized for specific software environments, limiting interoperability across processing pipelines. This study presents a cross-platform coded target framework for multi-sensor 3D acquisition that combines geometric redundancy, binary encoding, and human-readable elements to enhance robustness and reproducibility. An open-source implementation (PGT-Toolkit) supports marker generation, detection, and standardized coordinate export. Performance was evaluated using a controlled laboratory framework with systematically varied viewing angles, distances, and illumination conditions. Experiments were conducted using DSLR-based photogrammetry and terrestrial laser scanning. Detection rate, centroid repeatability, reprojection error, and cross-platform coordinate consistency were assessed and compared with those of established fiducial systems. Results demonstrate stable detection under oblique viewing geometries and consistent coordinate estimation across both commercial and open-source software environments. Laboratory studies confirm that Human Readable Coded Targets (HRCT) provide reliable, accurate, and cross-platform compatibility for both photogrammetric and 3D laser scanning workflows, which remain to be verified by field studies. The proposed framework contributes a structured methodology for experimental validation of interoperable coded targets in multi-sensor 3D workflows.

Integrating Multi-View Stereo and Depth Foundation Models for Precise 3D Reconstruction of Thin Urban Structures

Hwiyoung Kim¹, Impyeong Lee², Kyoungah Choi³

¹Geospatial Team, InnoPAM, Korea, Republic of (South Korea); ²Dept. of Geoinformatics, University of Seoul, Korea, Republic of (South Korea); ³Geospatially Enabled Society Research Division, Korea Research Institute for Human Settlements, Korea, Republic of (South Korea)

Constructing high-fidelity 3D models for urban Digital Twins is challenging, particularly for thin, texture-less structures like power lines where traditional Multi-View Stereo (MVS) fails due to matching ambiguities. While recent Monocular Depth Foundation Models offer dense estimation, they lack absolute scale and often degrade when applied to large-scale aerial imagery. This paper proposes a hybrid depth estimation pipeline that synergizes the metric accuracy of MVS with the structural coherence of foundation models.

Our method follows a Coarse-to-Fine strategy. First, we generate a scale-aware initial depth map by injecting sparse MVS points into the "Depth Anything" model as geometric priors, compensating for the lack of absolute scale in monocular estimation. Subsequently, a structure-guided refinement stage employs edge-based contour grouping to rectify object boundaries and suppress noise. Experimental results demonstrate that our approach successfully reconstructs power lines as distinct, linear objects with absolute scale, effectively resolving the data voids inherent in MVS and the geometric distortions typical of monocular models. This research provides a robust workflow for enhancing the precision of urban 3D reconstruction.

Estimation of refraction in photogrammetry from airborne data in an alpine environment

Myrta Maria Macelloni, Nives Grasso, Alberto Cina

Politecnico di Torino, Italy

Valpelline is an unspoilt Alpine valley located in the northernmost part of the Aosta Valley, on the border between Italy and Switzerland. It is the region’s longest valley, shaped by glaciers and rivers, with elevations ranging from about 900 m to over 4000 m at peaks such as Mont Gelé (3518 m) and Dent d’Hérens (4171 m).

Since 2020, the glaciers have been monitored by the GlacierLAB group (Politecnico di Torino) and ARPA Valle d’Aosta. Because of the valley’s steep, inaccessible terrain, biannual aerial photogrammetric surveys with a GNSS antenna, a low-accuracy IMU, and a PhaseOne iXM-RS150F camera (151 MP, 50 mm lens).

Due to a lack of synchronization between the camera and GNSS, Ground Control Points (GCPs) are needed for georeferencing. However, their configuration is often insufficient. Camera calibration certificates (2019, 2022) are crucial to correct image distortions; when unavailable, calibration is estimated using Agisoft Metashape and Structure-from-Motion methods, dividing known points into GCPs and Control Points to evaluate residuals.

High-altitude flights require correction for atmospheric refraction, which affects image geometry independently of optical distortion. Tests were carried out to estimate refraction errors (via Saastamoinen formulas) and to separate them from optical effects, enabling more accurate 3D models of Valpelline’s complex alpine environment.

Learning-based Estimation of Surface Normals in Unstructured Airborne LiDAR Point Clouds

Max Hermann^1,2, Martin Weinmann²

¹Fraunhofer IOSB, Karlsruhe, Germany; ²Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany

To produce suitable 3D models for downstream tasks, point clouds are often triangulated to reconstruct a triangle mesh, which first requires estimating normal vectors that define the local surface orientation. Because normals are not directly measured during laser scanning, they are often estimated in postprocessing using two steps: (1) selecting a neighborhood around each point and fitting a local surface function, and (2) orienting the resulting normal to distinguish inside from outside. Larger local neighborhoods often yield more consistent normals by averaging the surface, but can smooth out sharp discontinuities.

For orientation, various methods attempt to estimate the inside versus outside direction. In watertight scans, orientation can be determined by locally triangulating the points and propagating consistent normal orientations along the connected triangles. For surface scans containing holes and occlusions, typical for airborne LiDAR, this is more challenging, and heuristics like Minimum-Spanning-Trees or global flips towards one major coordinate axis are often used.

We propose a learning-based approach to estimate surface normals in unordered point clouds from airborne LiDAR scanning. Across multiple datasets, our approach consistently reduces artifacts and improves the quality of reconstructed triangle meshes compared to baseline methods, while achieving significantly faster runtime

Railway parameter extraction with high-precision UAV-photogrammetry: a feasibility study

Lucas De Burggrave¹, Pierre Prévost¹, Erkki Bartczak¹, Suzanna Cuypers¹, Jens Derdaele², Maarten Bassier¹

¹KU Leuven, Belgium; ²TUC RAIL, Brussels

This study investigates the feasibility of using UAV-based photogrammetry for the accurate extraction of railway geometry parameters such as gauge, alignment, and cant. The research explores whether aerial image-based reconstruction can meet the high precision requirements traditionally achieved through terrestrial survey methods. A series of experimental flights were carried out to evaluate how flight configuration, image quality, and processing strategy influence measurement accuracy and reliability. The results provide insight into the potential and current limitations of UAV photogrammetry for rail infrastructure documentation and quality control. Overall, the study contributes to advancing automated, efficient, and safe methods for railway inspection and geometric parameter extraction.

Sand Engine Beach State Assessment by applying Machine Learning on massive ARGUS Imagery

Alex De Jong, Roderik Lindenbergh, Sander Vos, Daan Hulskemper

Delft University of Technology, Netherlands, The

Dynamic beach locations world-wide are monitored by so-called Argus camera systems. Their automatic image capturing results in large databases of coastal images acquired during different illumination conditions. We present a lightweight and efficient method to automatically extract meaningful sand and supporting classes from ∼ 1 million Argus images of the Sand Engine, The Netherlands, a nature-based solution for beach erosion of 2 by 1 km. The method consists of 2 neural networks. First, a ResNet18 model selects images of sufficient quality. The second network, a shallow multi-layered perceptron is fed by RGB, intensity and texture features and classifies pixels into 6 classes, Water, Foam and Vegetation on one hand, and Aeolian, Wet and Armoured Sand on the other hand. Initial results shows good agreement with human interpretation. Final results will be used to assess the multi-year morpho-dynamic evolution at the hour scale of the Sand Engine.

Pixel-based vegetation mapping at class-level from UAV multispectral imagery: application in an alpine lake ecosystem

Mohammad Elahi¹, Alessandra Spadaro², Francesca Matrone², Andrea Maria Lingua², Chiara Graziani², Vittorio Fra¹

¹Interuniversity Department of Regional and Urban Studies and Planning (DIST), Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino; ²Department of Environment, Land and Infrastructure Engineering (DIATI), Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino

Vegetation mapping in alpine environments is essential for monitoring ecosystem dynamics and climate change impacts, yet remains challenging when using very high-resolution UAV imagery under limited labeled data. This study proposes a data centric, pixel-based classification framework for species-level vegetation mapping using multispectral UAV data acquired in an alpine study area. The approach prioritizes improving data representation rather than increasing model complexity. To address label scarcity, a feature-rich dataset was constructed by integrating spectral information, vegetation indices, and lightweight spatial descriptors to enhance class separability. Classification was performed using XGBoost, which is well suited for multispectral tabular data and robust under imbalanced conditions. The results show consistent classification performance across vegetation types and demonstrate the effectiveness of dataset enrichment under limited supervision, highlighting the importance of feature representation in data-scarce scenarios.

A Lightweight CNN–Mamba Hybrid Architecture for Efficient Crack Segmentation

Masaya Shimasaki, Mitsuteru Sakamoto, Toshiaki Satoh

PASCO Corporation, Japan

Pavement crack segmentation is an important task in road infrastructure inspection. However, the practical deployment of deep learning-based methods remains challenging because many high-performance models require substantial computational resources. This limitation is particularly critical in large-scale Mobile Mapping System (MMS)-based workflows, where large volumes of road surface imagery must be processed efficiently. In this study, a lightweight CNN–Mamba hybrid architecture is proposed for efficient crack segmentation as a deployment-oriented redesign of CT-CrackSeg. The proposed model replaces the original MobileViT-based global modelling modules with EfficientViM-inspired blocks based on hidden-state mixer-based state space duality (HSM-SSD), while preserving the overall encoder–decoder structure. In addition, the boundary enhancement branch is refined by introducing DCNv2-based deformable convolution. Experiments were conducted on the publicly available GAPs384 and CamCrack789 datasets. The results show that the proposed model maintains competitive topology-aware segmentation performance while substantially improving computational efficiency. Compared with CT-CrackSeg, the proposed method improves inference speed from 1.49 to 4.44 FPS on GAPs384 and from 1.31 to 3.92 FPS on CamCrack789. At the same time, peak memory consumption is reduced from 2827 MB to 355 MB, while the clDice score remains comparable, changing from 0.760 to 0.758 on GAPs384 and from 0.921 to 0.922 on CamCrack789. These results indicate that the proposed architecture provides a favourable balance between segmentation quality and deployment efficiency, and is suitable for large-scale pavement inspection and related photogrammetric infrastructure monitoring applications.

A Multi-Sensor and Multi-Temporal Approach to 3D Documentation of Historic Gardens: A Case Study of Villa Burba, Italy

Fangming Li¹, Cristiana Achille¹, Raffaella Laviscio², Luca Perfetti³, Francesco Fassi¹

¹3D Survey Group, ABC Lab, Department of Architecture, Built Environment and Construction Engineering (DABC), Politecnico di Milano, Via Ponzio 31, 20133 Milano, Italy; ²PaRID, ABC Lab, Department of Architecture, Built Environment and Construction Engineering (DABC), Politecnico di Milano, Via Ponzio 31, 20133 Milano, Italy; ³DICATAM, Civil Engineering, Architecture, Territory, Environmental and Mathematics, Università degli Studi di Brescia, Italy

Historic gardens are dynamic Cultural Heritage, shaped by seasonal cycles, vegetation growth, and continual maintenance, and require documentation methods capable of capturing change over time. This study presents a multi-sensor, multi-temporal workflow applied to Villa Burba, a seventeenth-century garden near Milan, Italy. Two surveys conducted in 2023 (leaf-on) and 2025 (leaf-off) combined UAV photogrammetry with mobile laser scanning (MLS) to maximize completeness under contrasting environmental conditions. Both datasets were processed independently, harmonized within WGS84 / UTM Zone 32N, and evaluated through point density analysis, deviation modelling, MLS loop-closure checks, and GCP residual evaluation.

Multi-temporal point clouds were analyzed in QGIS using PDAL-enabled tools. Cloud-to-cloud differencing and canopy height modelling revealed key transformations, including the drying of a water channel, the loss of a historic tree, and spatial shifts in vegetation structure. These digital findings were confirmed through field inspection. The workflow demonstrates a practical approach for monitoring dynamic heritage gardens and supporting long-term conservation and management through accurate, repeatable 3D survey data.

Affine Invariant OpenCV Descriptors and the Effects on Aerial Photgrammetry

Evan Okeeffe², Debra Laefer¹, Eleni Mangina²

¹New York University, United States of America; ²University College Dublin

Robust feature descriptors are necessary for computer vision applications such as image matching, photogrammetric three-dimensional (3D) reconstructions, and simultaneous localisation and mapping (SLAM). While most state-of-the-art feature descriptors are invariant to image transformations (such as translation, rotation, and scale) the majority lack stability in tracking points over large 3D perspective transformations. One successful method to solving these large perspective changes is by simulating affine tilts on the latitude and longitude axes of an image. These simulated tilts create greater invariance to changes in 3D perspective. To demonstrate the widespread efficacy of this approach, this paper applies affine simulation to seven state-of-the-art descriptors in OpenCV and to two of the enhanced OpenCV descriptors in OpenMVG.

Evaluating ORB-SLAM 3 Performance using a Photogrammetry-based Reference Trajectory

Leonardo Galvão, Daniel Regner, Moacir Wendhausen, Tiago Pinto, Armando Albertazzi

Federal University of Santa Catarina, Brazil

The robust evaluation of Visual Simultaneous Localization and Mapping (vSLAM) systems is fundamental to their development and deployment. However, this process is often constrained by the reliance on expensive and complex external infrastructure, such as laser trackers or motion capture systems, to provide accurate ground-truth trajectories. This paper introduces a novel and self-contained methodology for the high-fidelity evaluation of stereo vSLAM and stereo-inertial algorithms. Our approach leverages the very same image sequence used by the SLAM algorithm to generate a dense, globally optimized photogrammetric model. The proposed methodology comprises two fundamental steps, the first step consisted of validating photogrammetry as a ground truth method. For this purpose, the linear displacement measured by photogrammetry was compared with the displacement of a precision guide, which was benchmarked against a laser interferometer as the standard. Once the reference was validated, the second step assessed the performance of ORB-SLAM 3 on a free trajectory within a complex environment, by directly comparing the SLAM result to the trajectory generated by photogrammetry. The accuracy was then quantified using standard metrics, including Absolute Trajectory Error (ATE) and Relative Pose Error (RPE). The results validate our approach as an accessible, low-cost, and reliable alternative for benchmarking vSLAM systems, enabling rigorous performance analysis using only the data from the sensor suite under evaluation.

Deriving Tree Stem Profile and Volume Using a Close-Range Remote Sensing and Machine Learning Approach

Basam Dahy¹, Dag Björnberg^1,2, Shafiullah Soomro¹, Johan E. S. Fransson¹

¹Linnaeus University, Sweden; ²Softwerk AB, Sweden

Accurate estimation of tree volume is essential for precision forestry and sustainable forest management. Traditional forest inventory methods rely on manual measurements of tree height and diameter, which are time-consuming and costly to conduct over large areas, and difficult to perform efficiently in dense forest stands. This study presents a data-driven approach for estimating tree volume from partial tree stem profiles derived from high-resolution datasets. While the study relies on harvester production data (Sweden) and field-measured tree stem profiles (Brazil), the framework is designed to support the estimation of tree volume from close-range remote sensing techniques, such as terrestrial photogrammetry using handheld cameras. Three modelling approaches were evaluated, including two machine learning models (XGBoost and Random Forest) using partial tree stem profile measurements as predictors, and one baseline model (XGBoost) using diameter at breast height and tree height as predictors. The models were developed using two independent datasets: harvester production data of Norway spruce (Picea abies (L.) H. Karst.) from Sweden and field-measured tree stem profiles of Slash pine (Pinus elliottii Engelm.) and Loblolly pine (Pinus taeda L.) plantations from Brazil. The results show that tree volume can be predicted with reasonable accuracy using partial tree stem profiles, although models incorporating tree height achieved the lowest prediction errors. The findings demonstrate that partial tree stem profiles provide valuable structural information for machine learning-based tree volume estimation. This framework supports the future integration of close-range remote sensing techniques into modern forest inventory systems.

Towards Open-Vocabulary ALS Point Clouds Semantic Segmentation: An Empirical Study

Yanghong Lin^1,2, Tianyu Li^1,3, Shudong Zhou¹, Jingru Zhang¹, Li Fang¹, Wei Yao¹

¹Institute of Urban Environment, Chinese Academy of Sciences, China, People's Republic of; ²University of Chinese Academy of Sciences, China, People's Republic of; ³School of Resource and Environmental Sciences, Whuhan University, China, People's Republic of

Semantic segmentation of Airborne Laser Scanning (ALS) point clouds is critical for numerous photogrammetric and remote-sensing applications. While deep learning has become the dominant approach for ALS semantic segmentation, most existing methods rely on predefined label sets and thus lack the ability to recognize arbitrary semantic categories. With recent advances in visual foundation models (VFM), zero-shot visual understanding has achieved notable progress in natural image domains. However, the potential of adapting 2D VFMs to 3D ALS point cloud segmentation remains underexplored.

This contribution develops three VFM-based approaches for zero-shot, open-vocabulary ALS semantic segmentation: Grounding DINO+SAM, CLIP+SAM, and GSNET. Grounding DINO+SAM identifies object regions using text prompts and employs SAM to refine segmentation masks. SAM+CLIP first generates instance masks via SAM and then assigns semantic labels using CLIP text and visual embedding. GSNET integrates a remote-sensing-specific encoder with a CLIP-aligned encoder to alleviate the domain gap between natural and aerial imagery.

Empirical study conducted on the ISPRS Vaihingen dataset demonstrate that all three methods possess certain zero-shot open-vocabulary capabilities. Methods trained solely on natural images perform well on common classes (e.g., roof, tree) but struggle with rare categories such as powerline. GSNET improves performance across most categories, highlighting the importance of domain adaptation; however, rare-class segmentation remains challenging. These findings suggest that substantial domain gap and limited representation of rare classes are key obstacles to applying VFM in remote sensing. Future research should focus on test-time adaptation and unsupervised domain adaptation to enhance VFM generalization for 3D ALS point cloud.

A Workflow for the automatic Extraction of Glacier Contours from 4D Point Clouds

Steffen Isfort¹, Melanie Elias², Hans-Gerd Maas¹

¹TUD - Dresden University of Technology, Germany; ²HTWD - University of Applied Sciences Dresden, Germany

A workflow for the automatic extraction of the outlines of debris-covered glaciers and rock glaciers is presented. As the outlines in these scenarios are not clearly discernible, our approach is based on identifying geomorphological changes in multi-temporal 3D point clouds. We assume that these changes are caused by changes of the glacier. Consequently, areas with significant changes can be used to map the outline of the glacier. Our workflow uses pairs of multi-temporal 3D point clouds, which are captured for example by UAV imagery and TLS. After applying a robust registration algorithm, the difference of both point clouds is calculated. Considering only the areas that show significant changes, the glacier areas are isolated, and the outlines are mapped in a 2D mapping plane.

For evaluation, we test our workflow on two data sets. The Bøverbreen glacier, with only little debris cover, allows for a manual assessment of the glacier margins using an orthophoto mosaic from UAV imagery. A comparison of our calculated glacier margins with the manually assessed ones shows good agreement. The results confirm the basic functionality of our proposed method. However, tests show that the most challenging task is filtering glacial and non-glacial points, which is currently done solely based on the point density. More robust solutions to this problem will be discussed.

Automated detection of box-girder bridge deterioration using cylindrical projection from multi-camera 3D reconstruction and deep learning

OU Ming-Yun¹, Jhan Jyun-Ping¹, Lin Chen-Kuang³, Lin Shih-Syun¹, Tsai Hsin-Chu², Chou Tzu-Liang², Chang Chang-yu²

¹National Taiwan University of Science and Technology, Chinese Taipei; ²China Engineering Consultants, Inc., Chinese Taipei; ³Department of Mechanical and Materials Engineering, Tatung University, Taiwan

As large-scale infrastructure gradually ages, hundreds of existing bridges require regular inspections to ensure structural safety. While many researchers have proposed deterioration detection methods based on computer vision and deep learning—which can detect deterioration at the image level—no effective approach has yet been developed that integrates 3D reconstruction technology to achieve spatial localization and area quantification. To address this, this study proposes a two-part automated inspection workflow for the classification, localization, and measurement of internal deterioration in box-girder bridges. In the first part, the camera system is calibrated using an indoor calibration scene, and images are captured inside the box girder. A 3D model is constructed using Structure from Motion (SfM) algorithms, and a cylindrical projection unfolded map is generated. In the second part, a boundary-aware model—modified from DeepV3+—is used to perform pixel-level deterioration detection and classification on the unfolded map. Experimental results demonstrate that the system can generate scale-corrected cylindrical unfolded maps from 3D models with sub-millimeter scale accuracy (0.105 mm), effectively transforming complex 3D inspection tasks into measurable and analyzable 2D images. The model achieved an overall mean Intersection over Union (mIoU) of 65.11% across four categories of deterioration, representing a 7.54 percentage point improvement over the original DeepV3+. The research results validate the effectiveness of the proposed workflow in enhancing detection efficiency and objectivity for box-girder bridge maintenance.

Methodology and Practice of Hong Kong 3D Digital Map Construction Based on Multi-Source Data Fusion

Li Chen, Jun Li, Yaping Wang, Jing Wang, Weichen Yao

Shaanxi TIRAIN Science & Technology Co., Ltd., People's Republic of China

In response to Hong Kong's smart city development strategy, this paper takes the 3D digital map construction project in Kowloon as a practical case study and systematically presents a construction method -for 3D digital mapping based on multi-source data fusion. Aiming at the technical challenges in high-density urban environments—including dense buildings, complex 3D traffic networks, and severe shadow occlusion—an "air-ground fusion" data acquisition strategy is proposed. By comprehensively adopting multiple approaches such as oblique aerial photography, Vehicle Mobile Mapping System (VMMS), and Portable Mobile Mapping Survey (PMMS), a high-precision and highly realistic urban 3D model has been constructed. The paper focuses on the principles of multi-source data fusion based on feature registration and combined adjustment, as well as the 3D modeling process and the quality control methods for the final results. The project’s technical innovation and practical feasibility have been validated through international benchmarking. The research results have been applied to urban planning, traffic management, environmental studies and other fields, providing a solid data foundation and technical support for Hong Kong's smart city development.

Automatic Reconstruction of High-Accuracy 3D Roof Models from Orthophotos and Digital Surface Models

Yonghe Li¹, Masaya Shimasaki², Mitsuteru Sakamoto², Toshiaki Satoh², Tatsunori Sada¹

¹NIHON University, Chiba, Japan; ²PASCO Corporation, Tokyo, Japan

In recent years, the demand for 3D city model development has grown, as demonstrated by initiatives such as Project PLATEAU in Japan. In the construction of LoD2 building models, which are an essential component of 3D city models, the reconstruction of 3D roof models still heavily depends on manual work. To enhance productivity through automation, this study proposes a novel method for automatically reconstructing high-accuracy 3D roof models using orthophotos and Digital Surface Models (DSMs) derived from aerial imagery. In the proposed method, a deep-learning-based model is first applied to orthophotos and DSMs to extract 2D rooflines. Then, the extracted 2D rooflines are refined and polygonised to assemble 2D roof models. Finally, planar fitting was performed on the point cloud generated from the DSM within each 2D roof plane to reconstruct 3D roof models. In this process, the horizontal alignment of rooflines and the continuity between adjacent roof planes were preserved. In the experiments, 3D roof models manually digitized by stereoscopic measurement were used as the ground truth, and the automatically reconstructed 3D roof models were evaluated by comparison with this reference. As a result, the recall values for 2D and 3D roof planes were 0.686 and 0.430, respectively, and increased to 0.723 and 0.455 for roof planes larger than 4 m².

LiDAR-aided neural Scene Representation using low-cost Sensors

Mohamed Negm, Ahmed Elamin, Ahmed El-Rabbany

Toronto Metropolitan University, Canada

Neural scene representations are increasingly explored as alternatives to classical SfM and MVS in civil and architectural mapping, yet their ability to satisfy survey-grade geometric tolerances remains contested. This contribution examines how LiDAR guidance may stabilize NeRF and 3D Gaussian Splatting reconstructions of building façades obtained from low-cost cameras.

Research on Adaptive Feature Band Extraction Technology Based on Fractional Order Differentiation and Machine Learning

Fang Liu, Fei Liu, Xian Guo, Yikang Ren

Beijing university of civil engineering and architecture, China, People's Republic of

The Dunhuang murals, a significant component of China's cultural heritage, are severely threatened by salt-induced deterioration. To address the limitations of traditional invasive detection methods, this study explores a non-destructive approach using hyperspectral remote sensing to monitor mural salinity. Focusing on phosphate content, a key salt damage indicator, we propose a multi-level optimization framework that integrates Fractional Order Differentiation (FOD) for spectral enhancement and various feature selection strategies (including LASSO, SiPLS, SPA, CARS, and Random Frog) to improve prediction accuracy. Partial Least Squares Regression (PLSR) models were constructed using optimized spectral features. Results demonstrate that FOD effectively amplifies subtle spectral responses related to salinity. The model combining 1.9-order FOD spectra with LASSO feature selection achieved the highest performance, with a cross-validated R² of 0.908—a 15.96% improvement over the best model using FOD-transformed spectra alone. This study confirms that integrating FOD with advanced feature selection significantly enhances the precision and reliability of hyperspectral inversion models for mural salt damage, providing a powerful, non-destructive tool for cultural heritage conservation.

Assessing the sensibility of intervisibility on the quality of 3D geometry

Darshan Venkatarayappa, Bruno Vallet, Teng Wu

Univ Gustave Eiffel, G´eodata Paris, IGN, LASTIG, F-77454 Marne-la-Vall´ee, France

This work explores a new evaluation framework for 3D Model Quality Assessment using 3D intervisibility, a critical concept in 3D spatial analysis. In this work we will consider a high-quality LiDAR ground-truth 3D model and lower quality (dense matching and decimated) versions of it. Then we run the same intervisibility analysis on all of them and compare the results. This will allow us to evaluate the impact of geometric quality on intervisibility analysis This analysis is useful for anyone using 3D data for simulations, as it indicates what data quality they actually need to purchase or produce for their specific use case. Ultimately, the goal of this

work is to see how much the quality of the 3D model affects intervisibility results.

Neural Radiance Fields with Physically Based Reflectance for Satellite Images

Lulin Zhang¹, Ewelina Rupnik², Tri Dung Nguyen¹, Stephane Jacquemoud¹, Yann Klinger¹

¹Universite de Paris, Institut de Physique du Globe de Paris, CNRS; ²Univ. Gustave Eiffel, IGN-ENSG, LaSTIG

Recent adaptations of Neural Radiance Fields (NeRF) to remote sensing have shown strong potential for high-fidelity surface reconstruction from multi-view satellite imagery. NeRF represents a scene using multilayer perceptrons and optimizes a volumetric rendering objective to infer geometry and appearance. However, its performance declines sharply with the limited number of satellite viewpoints, and remote sensing imagery violates the simple reflection assumptions of natural scenes. Surface reflectance depends on material properties and illumination geometry, requiring explicit Bidirectional Reflectance Distribution Function (BRDF) modeling. In this work, a physically based NeRF formulation is proposed using the Hapke radiative transfer model, which efficiently describes surface–radiance interactions with a small set of parameters. This physically grounded approach is compared experimentally with empirical BRDF models, demonstrating its potential to enhance the physical realism and interpretability of NeRF reconstructions for Earth observation applications.

Mobile multi-camera system performance for photogrammetric road surface 3D measurements - assessment the effect of driving speed

Matti Tapio Vaaja¹, Markus Sarlin¹, Eino Waldén¹, Petri Rönnholm¹, Hannu Hyyppä¹, Juha Hyyppä², Mikko Vastaranta³, Matti Kurkela¹

¹Department of Built Environment, Aalto University, Finland; ²Department of Photogrammetry and Remote Sensing, Finnish Geospatial Research Institute, National Land Survey of Finland, FI-02150 Espoo, Finland; ³School of Forest Sciences, University of Eastern Finland, Joensuu, 80101, Finland

In this study, we built a mobile multi-camera system and investigated its use for photogrammetric 3D measurement of road surface geometry. More specifically, we tested the effect of driving speed on the quality of the 3D point cloud geometry on road surface. Our conclusion was that, with a five-camera system at speeds of 3-20 km/h, we achieved 3D distance errors of less than 0.5 mm when the data was compared to reference data measured from road surface samples. The results show that the method has great potential for producing sub-millimetre resolution and precision data on road surface damages, road roughness, and other road parameters. The

purpose is to use the system to collect reference data for verifying data from operational mobile laser scanning systems. The system can also be installed on other platforms and applications.

Digital Analysis of Rock Art in Santa Olaya Canyon: Integrating Cultural Landscape and UAV Technologies for Conservation

Fabiola D. Yépez-Rincón^1,3, Glenda N. Requena Lara², Carlos C. Aguilar Treviño³, Jacinto Treviño-Carreón², Ma. Eugenia Calvillo-Villacaña⁴, Pablo A. Cerda-Luque⁴, Juan F. Morales-Pacheco⁵

¹Faculty of Civil Engineering, Universidad Autónoma de Nuevo León, San Nicolás de los Garza, Nuevo León, México; ²Faculty of Engineering and Sciences, Universidad Autónoma de Tamaulipas, Ciudad Victoria, Tamaulipas, México; ³Teebcon Servicios, Ingenierías y Proyectos, SA de CV, Monterrey, Nuevo León, México; ⁴Faculty of Architecture, Design and Urbanism, Universidad Autónoma de Tamaulipas, Tampico, Tamaulipas, México; 5Faculty of Law and Social Sciences Victoria, Universidad Autónoma de Tamaulipas, Ciudad Victoria, Tamaulipas, México;; ⁵Faculty of Law and Social Sciences Victoria, Universidad Autónoma de Tamaulipas, Ciudad Victoria, Tamaulipas, México.

This research work presents the digital documentation of rock art found on a rock face in the Santa Olaya Canyon, in the municipality of Burgos, Tamaulipas. Unlike rock art found in caves, these open-air expressions are actively integrated with the natural and cultural landscape, functioning as symbolic markers of the territory. Through controlled flights with a DJI Mavic Air 2 drone and 3D photoreconstruction techniques, a difficult-to-access vertical surface of a rock face with rock paintings was recorded with high precision. The methodology employed responds to the need for conservation and study of these sites, which lack institutional protection mechanisms from the INAH (National Institute of Archaeology and History) or, as in this case, conservation and cultural research studies. It also contextualizes the value of rock art in Tamaulipas, particularly in the San Carlos and San Nicolás mountain ranges, where some of the most significant collections in northeastern Mexico are found. The application of non-invasive digital technologies is positioned as an effective tool for the documentation, analysis, and dissemination of archaeological heritage, especially in remote and limited-access regions. The generated orthomosaic and point clouds provide the opportunity to create a digital legacy of the area.

LiDAR Point Cloud Classification by 3D Sparse CNN for large-scale Mobile Laser Scanning

Nan Li¹, Florian Pöppl², Andreas Ullrich², Harald Teufelsbauer²

¹RIEGL Research & Defense GmbH; ²RIEGL Laser Measurement Systems GmbH

This work presents a deep learning-based framework for semantic classification of Mobile Laser Scanning (MLS) point clouds using a 3D Sparse Convolutional Neural Network (SparseCNN). The proposed approach addresses challenges specific to MLS data, such as varying point density, high data volume, and diverse urban or highway environments. A two-stage, coarse-to-fine classification pipeline is designed to ensure both scalability and high resolution: the first stage performs scene-wide semantic labeling, while the second refines ground-surface features such as road markings, sidewalks, and curbstones at finer spatial resolution.

To enhance robustness, the model is trained with tailored data augmentations including geometric transformations, density dropout, artificial noise injection, and local patch swapping. In addition to geometric input, radiometric features such as reflectance and echo information are incorporated to improve object differentiation, especially for materials like traffic signs and painted road surfaces.

Two sets of models are trained for different acquisition wavelengths (905 nm and 1550 nm), to account for the impact of laser wavelength on reflectance responses. Classification results on urban and highway scenes demonstrate the effectiveness of the method across a variety of environments and sensor platforms.

MUSF-SSA: Multi-scale Umbrella Feature with Spatial Self-Attention Model for Semantic Segmentation of Point Clouds

Linfu Xie, Rutao Zhang, Tianyi Xu, Weixi Wang, Xiaoming Li, Shengjun Tang, Renzhong Guo

Shenzhen University, People's Republic of China

Semantic segmentation of point clouds, a fundamental task in 3D scene understanding, faces two persistent challenges. First, it is difficult to efficiently extract discriminative features for complex and irregular surfaces; existing methods struggle with the trade-off between simple features, which are insufficient, and complex features, which are computationally expensive. Second, many deep learning models ignore the inherent spatial correlation of point cloud features during the training process, limiting segmentation accuracy. Optimizing the feature representation for complex surfaces while fully leveraging feature correlation is key to advancing segmentation performance.

To tackle these challenges, we propose the Multi-Scale Umbrella Feature model with Spatial Self-Attention (MUSF-SSA). This model introduces a novel Multi-Scale Umbrella Feature (MUSF) to efficiently represent irregular surfaces and integrates a spatial self-attention (SSA) mechanism in its backbone to explicitly learn the spatial correlation between features.

Through these improvements, while maintaining a low parameter count (1.088M), our model achieves 68.6% mIoU, 76.5% mAcc, and 90.4% OA on the S3DIS Area-5 test, a typical indoor point cloud dataset. Compared to the similar method RepSurf-U, this represents a gain of +3.6% mIoU, +4.0% mAcc, and +2.6% OA.

Evaluating the Efficiency of Machine Learning Algorithms in Identifying Geothermal Energy Potential Areas in Akita and Iwate Provinces, Japan

Majid Kiavarz, Mohammadreza Jelokhani Niaraki, Avin Meysami, Yasaman Ghorbani, Najmeh Neysani Samany

University of Tehran

The growing demand for clean and renewable energy sources has intensified the need to identify and exploit geothermal resources as a key solution for sustainable energy development. However, geothermal exploration faces significant challenges including geological complexity, high drilling costs, economic risks, and spatial data limitations. This study evaluates the efficiency of advanced machine learning algorithms, specifically Random Forest and Generative Adversarial Networks (GANs), in identifying geothermal energy potential areas in Akita and Iwate provinces, Japan. Using a limited dataset of 152 geothermal well locations, seven key parameters were analysed: volcanic activity, fault and fracture density, hot springs, surface thermal indices, fumaroles, mud volcanoes, and surface alteration evidence. Data were collected from geological and remote sensing sources and pre-processed for modelling. Results demonstrate that both algorithms effectively identify high-potential areas despite data scarcity. Random Forest achieved 94.08% accuracy in well identification with a C/S(C) index of 10.93, demonstrating robust performance and spatial correlation. The Generative Adversarial Network showed superior performance with 96.71% accuracy and a C/S(C) index of 4.36, indicating exceptional capability in identifying geothermal potential areas and detecting complex spatial patterns. These findings confirm that hybrid approaches combining machine learning and deep learning, particularly GANs, possess high capability for accurate geothermal prospectivity mapping and can effectively overcome limitations posed by data scarcity, providing valuable tools for exploration prioritization and investment decision-making

Theoretical Comparison of Façade Texture Resolution for 3D Building Models Generated from Nadir and Oblique Aerial Imagery

Masato Ishikawa, Tomoaki Inazawa, Yoshihiko Nakanishi, Futa Kawamata, Masahito Takada, Takuya Danjo

Kokusai Kogyo Co., Ltd., Japan

Building models are one of the key features in 3D city models. To realistically represent building exteriors, texture images are often applied to these models. Such textures are important not only for visual appearance but also for practical applications, such as automated generation of higher-Level-of-Detail (LoD) models and various urban simulations. In large-scale urban modeling projects, façade textures are typically obtained through aerial photogrammetry conducted by manned aircraft, primarily due to operational efficiency. In many such surveys, image acquisition is mainly based on nadir-oriented cameras. However, nadir-only imaging inherently limits façade resolution due to viewing geometry. In this study, we compare the façade resolution attainable from nadir and oblique cameras to examine the effectiveness of multi-directional camera systems in producing high-resolution façade textures. A theoretical approach is adopted to estimate the attainable façade resolution under given imaging conditions. A comparative analysis using the camera parameters of UCE M3 (nadir-only) and CM-2 (multi-directional) indicates several advantages of oblique cameras for façade texture generation: (1) significant improvement in the lowest façade resolution compared to nadir photography, (2) more consistent façade resolution across the entire survey area, and (3) limited sensitivity of façade resolution to increased camera station interval. These findings suggest that incorporating oblique cameras into an aerial survey system can contribute to stabilizing and enhancing attainable façade resolution compared to nadir only configurations.

Calibrating large-FOV stereo videogrammetric system using drone and epipolar geometry

Haibo Shi, Xianglei Liu, Runjie Wang

Beijing University of Civil Engineering and Architecture, China

Videogrammetry is widely used in fields such as structural health monitoring, surveillance, and aerospace, where accurate 3D measurements rely on precise calibration of stereo camera systems. Traditional planar target–based calibration provides high accuracy but becomes impractical for large-FOV setups due to the need for large, high-precision targets placed at long working distances. Control-field calibration, which uses spatially distributed artificial targets measured by total stations or GPS-RTK, similarly faces limitations in environments lacking accessible mounting locations. Other existing methods—such as rigid stereo-target calibration, close-range light-spot targets, and active phase targets—offer partial improvements but remain constrained by fabrication complexity, optimization instability, or limited depth-direction accuracy.

To address these challenges, this work proposes a flexible calibration method for large-FOV stereo videogrammetric systems using UAV trajectory imaging and epipolar geometry. A UAV carrying a rigid circular target flies through the measurement volume, while two synchronized cameras record its motion. Target centers are extracted using Circular-MarkNet, intrinsic parameters are obtained using an active-phase target, and scale-free extrinsic parameters are initialized from essential matrix estimation. The metric scale is introduced through static GPS measurements, and all parameters are refined via nonlinear optimization. Validation against a conventional circular-target control field shows that the proposed approach achieves comparable calibration accuracy within a 70–50–10 m volume while avoiding the need for large calibration targets.

A Hybrid Approach using Gaussian Splatting and Parametric Models based on 3D Renders for Real-Time Visualisation

Etienne Sommer, Arnadi Murtiyoso, Mathieu Koehl, Pierre Grussenmeyer

INSA Strasbourg, France

The valorisation and dissemination of built heritage to the public is a crucial objective, complementing conservation efforts. However, traditional 3D models, such as dense meshes, often present limitations for this purpose, proving too heavy and complex for easy sharing and real-time visualisation.

This paper presents a hybrid approach that addresses this challenge by leveraging 3D Gaussian Splatting (3DGS) for the real-time visualisation of complex parametric models. This method is particularly effective for visualising 4D reconstructions representing historical phases of edifices that may no longer exist.

The methodology employs synthetic images generated from the parametric model using 3D rendering software. To ensure compatibility with procedural textures, path-tracing is used , but photorealistic effects such as cast shadows and reflections are deliberately removed. These optimised 3D renders are then processed through a conventional photogrammetric pipeline to generate the necessary camera orientations and sparse point cloud for 3DGS training.

The resulting 3DGS representation enables real-time rendering. This technique successfully converts a model composed of multiple, distinct parametric components into a single, unified object. This approach also demonstrates a strong capability for reconstructing contextual elements, such as vegetation, which are often poorly handled by traditional meshing techniques. The method effectively transforms a complex, software-specific model into a lightweight representation ideal for applications where visualisation speed is essential.

Improving Head Pose Estimation in Radiation Therapy through photogrammetric Techniques for Machine Learning Applications

Cyrill Milkau¹, Sebastian Preußel², Sarah Guy³, Danilo Schneider¹

¹Faculty of Spatial Information, HTW Dresden – University of Applied Sciences, Germany; ²Institute of Photogrammetry and Remote Sensing, Dresden University of Technology, Germany; ³Department of Radiotherapy and Radiation Oncology, Dresden University of Technology, Germany

This study investigates the integration of photogrammetry and machine learning to enhance head pose estimation in radiation therapy. The primary objective is to improve the accuracy of patient positioning, which could reduce the reliance on immobilization masks, thereby enhancing patient comfort. The methodology involves the use of markers and cameras to track head movements, combined with machine learning algorithms to refine pose estimation. By merging deterministic photogrammetric techniques with advanced machine learning models, this approach aims to achieve more precise and reliable head pose estimation. The potential outcomes of this research could lead to more effective and comfortable radiation therapy treatments for patients with head-and-neck cancers.

A Comparative Study of Deep Learning and Unsupervised Segmentation Methods for Individual Tree Delineation from LiDAR point clouds

Jinhong Wang¹, Wei Yao^2,3, Tiangang Yin¹

¹Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University; ²Institute of Urban Environment, Chinese Academy of Sciences, China, People's Republic of; ³School of Engineering and Design, Technical University of Munich, Munich, 80333, Germany

This study aims to conduct a comparative analysis of individual

tree segmentation (ITS) methods for forest LiDAR point clouds.

Traditional ITS approaches have been predominantly based on

unsupervised segmentation algorithms using geometric features.

In recent years, research has progressively shifted toward super-

vised deep learning (DL) techniques. However, the perform-

ance of existing methods across diverse forest types has not yet

been systematically assessed.

On solving exterior orientation of an image with particle swarm optimization

Petri Rönnholm, Matti Kurkela, Matti T. Vaaja, Hannu Hyyppä

Department of Built Environment, Aalto University, Finland

Solving the exterior orientation of images is a fundamental component in photogrammetric mapping and 3D restitution processes. Additionally, it is essential in photogrammetric tasks such as visual odometry, camera-based visual simultaneous localization and mapping, camera calibration, camera-based 3D tracking of movement, and change detection. The aim of this research was to evaluate whether particle swarm optimization is suitable for finding the exterior orientation parameters of a single image using image resection. In addition, we developed a robustified particle swarm optimization by adding an iteratively changing stochastic model to the optimization criteria by attaching a weight matrix with residual vectors. The method was compared to the solution from the least squares method using both simulated ideal and noisy data. Solving the exterior orientation parameters reliably with particle swarm optimization was possible after fine-tuning the algorithm's options. The non-robustified version of particle swarm optimization provided identical results to the non-robustified least squares method. However, in the case of the robustified particle swarm optimization, only 60% of attempts resulted in the same outcome as the corresponding robustified least squares method, with sub-millimeter accuracy. In 40% of cases, the results achieved millimeter accuracy. The sub-millimeter accuracy was achieved in every case with sequential robustified particle swarm optimization, where the algorithm was rerun using stricter bounds for unknown parameters if the evaluation criteria were too large. The implementation of particle swarm optimization is easier than that of the nonlinear least squares method. However, the computation time for particle swarm optimization was significantly longer.

Incremental Semantics-Aided Meshing from LiDAR-Inertial Odometry and RGB Direct Label Transfer

Muhammad Affan, Ville Lehtola, George Vosselman

University of Twente

Geometric high-fidelity mesh reconstruction from LiDAR-inertial scans remains challenging in large, complex indoor environments– such as cultural buildings– where point cloud sparsity, geometric drift, and fixed fusion parameters produce holes, over smoothing, and spurious surfaces at structural boundaries. We propose a modular, incremental RGB + LiDAR pipeline that generates incremental semantics-aided high-quality meshes from indoor scans through scan frame-based direct label transfer. A vision foundation model labels each incoming RGB frame; labels are incrementally projected and fused onto a LiDAR-inertial odometry map; and an incremental semantics-aware Truncated Signed Distance Function (TSDF) fusion step produces the final mesh via marching cubes. This frame-level fusion strategy preserves the geometric fidelity of LiDAR while leveraging rich visual semantics to resolve geometric ambiguities at reconstruction boundaries caused by LiDAR point-cloud sparsity and geometric drift. We demonstrate that semantic guidance improves geometric reconstruction quality; quantitative evaluation is therefore performed using geometric metrics on the Oxford Spires dataset, while results from the NTU VIRAL dataset are analyzed qualitatively. The proposed method outperforms state-of-the-art geometric baselines ImMesh and Voxblox, demonstrating the benefit of semantics-aided fusion for geometric mesh quality. The resulting semantically labelled meshes are of value when reconstructing Universal Scene Description (USD) assets, offering a path from indoor LiDAR scanning to XR and digital modeling.

Evaluation of systematic and random errors in occupancy grid maps

Yuguang Liu¹, Marko Radanovic¹, Krista A. Ehinger², Kourosh Khoshelham¹

¹Department of Infrastructure Engineering, The University of Melbourne, Australia; ²School of Computing and Information Systems, The University of Melbourne, Australia

Map evaluation for occupancy grid mapping (OGM) is critical in the field of high-definition mapping of the road environment for autonomous vehicles. Existing methods cannot adequately evaluate the systematic and random errors that might be present in OGM. This article introduces two evaluation metrics for OGM under LiDAR position uncertainty: Mean Signed Distance (MSD) and Mean Absolute Deviation (MAD). MSD quantifies systematic displacement of occupied cells, while MAD measures random error exhibited as boundary thickening. Unlike classification-based, probabilistic, and geometric metrics, MSD and MAD directly isolate displacement and thickening effects in OGM. We validate both metrics in a controlled synthetic environment and on a real indoor LiDAR dataset, showing better performance than conventional metrics.

Deep learning-based building detection using high-resolution RGBI orthophotos and DSMs

Mohamed Fawzy^1,2, Attila Juhasz¹, Arpad Barsi¹

¹Department of Photogrammetry and Geoinformatics, Faculty of Civil Engineering, Budapest University of Technology and Economics, Műegyetem rkp. 3, H-1111 Budapest, Hungary, {mohamed.fawzy, juhasz.attila, barsi.arpad}@emk.bme.hu; ²Civil Engineering Department, Faculty of Engineering, Qena University, 83523 Qena, Egypt, mohamedfawzy@eng.svu.edu.eg

Deep learning techniques have demonstrated a promising efficacy for building feature extraction, presenting practical strategies to lessen the labour-intensive work of map updating, change detection, and urban growth monitoring. To address the labour-consuming challenges, a U-Net-based convolutional neural network model is developed to generate building maps automatically using high-resolution RGBI orthophoto and DSM data. The approach shows the effectiveness of the U-Net-based semantic segmentation for urban scene analysis. The presented procedures collect, preprocess, and combine orthophoto with DSM in order to train, apply, and assess the U-Net model for building extraction in urban environments using two input scenarios: (1) solely RGBI orthophoto and (2) RGBI orthophoto integrated with DSM. Four standard metrics: completeness, correctness, quality, and overall accuracy are applied to evaluate the model outputs, comparing the single orthophoto input to the combined orthophoto with DSM for building detection. The significant impact of the DSM and RGBI pairing is demonstrated by the heightened reliability of the data integration strategy when estimating buildings within nearby similar objects like roads and impervious surfaces. However, a few challenges related to the model's generalisation are noticed across complex urban contexts, including tree occlusions, unreferenced building extensions, and height irregularities surrounding structures. The findings highlight the potential of multimodal data fusion in urban investigations and reveal how it can improve the mapping of built-up assets. Final results argue that DSM incorporation significantly enhances building classification performance using deep learning frameworks for geospatial applications, particularly in complex urban environments where single data and traditional image-based segmentation methods face limitations.

Simulation of Stationary and Mobile Laser Scanning with VRscan3D

Denys Gorkovchuk¹, Julia Horkovchuk¹, Maria Chizhova², Darius Popovas³, Thomas Luhmann³

¹Kyiv National University of Construction and Architecture; ²Otto-Friedrich Universität Bamberg; ³Institute for Applied Photogrammetry and Geoinformatics

The VRscan3D project introduces a virtual simulation environment for stationary and mobile laser scanning designed to enhance education, research, and AI-based point cloud analysis. Developed using Unreal Engine, the simulator replicates the physical behavior of real terrestrial laser scanners, allowing users to perform realistic scanning operations within immersive 3D environments. The system reproduces manufacturer-specific parameters such as range noise, beam divergence, and intensity, generating synthetic point clouds that closely approximate real data.

VRscan3D enables users to plan and execute virtual scanning campaigns, analyze data quality, and understand the influence of scanning geometry, surface materials, and user behavior. Recent developments include dynamic scene simulation with moving objects, integration of user-imported environments, and support for mobile scanning trajectories—handheld, vehicle-mounted, or UAV-based—reflecting natural oscillations and movement patterns.

In addition to training and education, VRscan3D serves as a generator of synthetic point clouds with known ground truth, facilitating the development and validation of AI algorithms for object detection, segmentation, and classification. Comparative studies between simulated and real scans demonstrate high similarity in terms of accuracy, resolution, and completeness.

By bridging real-world surveying practice and virtual learning, VRscan3D offers a cost-effective, accessible platform for universities and professionals lacking physical equipment or facing mobility restrictions. It represents a new step toward open, immersive, and intelligent learning environments in geospatial education and research.

Symmetry-aware Texture Refinement for 3D Building Models via Massing Decomposition and Generative AI

Fan Xue¹, Yijie Wu², Maosu Li³

¹The University of Hong Kong, Hong Kong S.A.R. (China); ²The Hong Kong Polytechnic University, Hong Kong S.A.R. (China); ³The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, China

Three-dimensional (3D) building models with accurate geometry and realistic textures remain essential for city information modeling and digital twin applications. However, photogrammetric reconstructions consistently suffer from severe texture defects caused by occlusions, shadows, distortions, and projection errors. Existing approaches either rely on rigorous photometric optimization that demands topological correctness and multi-view imagery, or employ flexible AI-driven generation that leverages semantics but often lacks geometric constraints.

This paper presents a novel hybrid framework that exploits architectural regularities—specifically massing decomposition and partial symmetries—to guide high-fidelity texture refinement. We first decompose building meshes into mass-aligned convex volumes using MorphCut. Textures are then reprojected onto these volumes, followed by Building Section Skeletons to pair symmetric facades and establish precise geometric correspondences. Finally, generative AI is applied using symmetry-aware constraints to achieve contextually accurate inpainting and correction.

Pilot studies on three Hong Kong buildings demonstrate robust decomposition, faithful texture transfer, and effective defect mitigation, while revealing current limitations of unconstrained generative models in preserving floor counts and structural regularity. The proposed symmetry-guided pipeline notably advances the reliable and semantically coherent reconstruction of textures for complex urban buildings.

AI-Driven 3D reconstruction and quality assessment for Cultural Heritage: first results from the HERITALISE project

Filiberto Chiabrando¹, Andrea Maria Lingua², Alessio Martino¹, Francesca Matrone², Alessandra Spadaro²

¹Laboratory of Geomatics for Cultural Heritage (LabG4CH), Department of Architecture and Design (DAD), Politecnico di Torino, Viale Pier Andrea Mattioli, 39, Torino (TO), Italy; ²Geomatics Lab, Department of Environment, Land and Infrastructure Engineering (DIATI), Politecnico di Torino, Corso Duca degli Abruzzi, 24, Torino (TO), Italy

The accurate digital documentation of Cultural Heritage (CH) assets demands workflows capable of integrating heterogeneous, multiscale datasets while preserving both geometric fidelity and radiometric completeness. This paper presents the first results of the AI-based processing pipeline developed within the HERITALISE project (Horizon Europe, 2025–2028), applied to three multiscale case studies at the Reggia di Venaria Reale (Turin, Italy): an outdoor-indoor UAV photogrammetric survey, a kinematic SLAM acquisition of a contemporary sculpture garden, and a close-range dataset of an 18th-century decorative artefact. 3D Gaussian Splatting (3DGS) is evaluated as a novel view synthesis method across all three scenarios, demonstrating strong photorealistic rendering capabilities, particularly for complex material properties and geometrically challenging interiors, whilst highlighting current limitations for metric surveying applications. A two-stage crack detection workflow, combining tile-based text-prompted segmentation with SAM3 and multiview ray-based reprojection onto the reconstructed mesh, is validated on UAV imagery, achieving an 84.9% ray–mesh intersection rate. Finally, a standardised evaluation framework is proposed, encompassing adaptive, scale-dependent geometric and radiometric metrics organised into reference-based and no-reference assessment scenarios, aggregated into a transparent synthetic quality score with three adaptive quality classes. The proposed methodology contributes toward a reproducible, sensor-agnostic standard for the assessment of AI-generated CH documentation products.

Haul Road Extraction in Open-Pit Mines via Dual-Encoder RGB–DSM Transformer Fusion

Loghman Moradi, Kamran Esmaeili

University of Toronto, Canada

Haul roads are essential to open-pit mines, acting like the mine’s circulatory system. Keeping accurate, up-to-date maps of these roads is critical for maintenance, safety, and efficient material handling, yet automating this task is challenging. Traditional deep learning models that rely only on RGB images often fail in mining environments, where road surfaces resemble bare earth, dusty terrain, or shadowed areas. To address this, we propose a dual-encoder transformer that combines UAV-captured RGB images with DSM data using stage-wise cross-attention, leveraging both visual and topographic information. Two SegFormer encoders process each data type separately, creating detailed feature representations that are fused at each stage. This allows the model to learn specialized information while sharing knowledge between modalities. A lightweight All-MLP decoder produces the final segmentation map. We tested our method on a high-resolution dataset of 12,000 tiles from the Mildred Lake open-pit mine in Fort McMurray, Canada. Our model achieves 80.8% mIoU, 88.7% F1-score, and 73.7% road accuracy, outperforming an RGB-only baseline by 3.3%, 2.4%, and 7.8 points, respectively. Ablation studies demonstrate that including DSM data consistently improves recall and road detection, especially in areas where RGB information alone is ambiguous or terrain is complex.

Benchmarking Local Registration Algorithms on Multi Temporal and Multi Spatial Point Clouds

Tommaso Mainiero, Jad Ghantous, Nives Grasso, Vincenzo Di Pietra

Department of Environment, Land and Infrastructure Engineering , Politecnico di Torino, Italy

This study presents a systematic benchmarking framework to evaluate the performance of local point cloud registration algorithms and their impact on geomorphological change detection. Three widely used methods—Iterative Closest Point (ICP), Point-to-Plane ICP, and Generalized ICP (GICP)—were tested across two alpine case studies in Italy (Rio Cucco catchment and Belvedere Glacier), considering different surface types and initial alignment conditions.

Three local registration methods—Iterative Closest Point (ICP), Generalized ICP (GICP), and Point-to-Plane ICP—were tested under varying initial alignment and terrain conditions using standardized voxelized patches (0.3 m). Performance was evaluated through median distance, cloud-to-cloud mean distance, and computation time metrics.

Results highlight the strong influence of surface morphology on algorithmic stability: rocky areas ensure reliable convergence, while dense vegetation introduces ambiguity and reduced accuracy. GICP provided the best compromise between robustness and efficiency.

The study further highlights that integrating robust outlier rejection significantly improves statistical consistency and reduces LoD95. The proposed approach provides a reproducible framework for optimizing co-registration strategies and improving the accuracy of geomorphological monitoring in high-relief environments.

Human Trajectory Prediction on UAV Images: A Comparative Study

Rafael D. M. da Hora¹, Daniel R. Santos¹, Maurício C. M. Paulo¹, Felipe Ferrari¹, Raul Q. Feitosa², Paulo F. F. Rosa¹

¹Military Institute of Engineering, Brazil; ²Pontifical Catholic University, Brazil

Video human trajectory prediction is a fundamental research task for many applications in civil and defense. Compared to trajectory prediction based on a single frame, human trajectory prediction in videos, especially in the context of unmanned airborne vehicles (UAVs) platforms, is a challenge due to the time series prediction analyses required. As frames in a video streaming are highly correlated, trajectory detection in UAV images is affected by particular factors such as oblique camera views and the platform motion. This study aims to identify the most robust and accurate deep learning model in the context of UAVs videos by comparing three distinct categories: classical machine learning, established deep learning architectures, and computationally efficient models based on Multi-layer Perceptrons (MLPs). We propose an analysis based on only bounding box center coordinates instead of image scenes. The results show that a simple linear architecture provided the best performance, highlighting the importance of these mechanisms in predicting human motion from trajectory data alone.

Multi-technique approach for 3D documentation of rock walls in narrow gorges

Antonio Tomás Mozas-Calvache, José Luis Pérez-García, José Miguel Gómez-López, Diego Vico-García, Jorge Delgado-García

University of Jaén, Spain

This study presents a robust multi-technique methodology for generating complete, high-accuracy 3D documentation of highly constrained natural heritage sites, addressing the limitations of single-technique geomatic approaches. The research focuses on two challenging gorge environments in Southern Spain: Los Cañones de Río Frío and El Caminito del Rey. Both sites feature extreme vertical walls (up to 300 meters and narrow passages that complicate GNSS-RTK positioning and render individual UAV, TLS, or terrestrial photogrammetry techniques unfeasible due to occlusions and safety/logistical constraints. The proposed workflow centers on data fusion, leveraging LiDAR data for core geometry and photogrammetry for texture and gap-filling. Data acquisition integrated multiple sensors, including UAV LiDAR/Photogrammetry, Terrestrial Laser Scanning (TLS), Mobile Mapping Systems (MMS), and Spherical Photogrammetry (SP). A key methodological innovation involves deriving second-order Ground Control Points (GCPs) from UAV photogrammetry to georeference other data in areas with poor satellite coverage, significantly reducing fieldwork while maintaining accuracy. The highly precise TLS point cloud was used as the geometric base for the final model. The resulting products—including high-density point clouds and 2 cm orthoimages and 3D models—demonstrate comprehensive coverage and high accuracy (about 4 cm for georeferenced data), enabling 2.5D rockfall simulation and establishing a foundation for a Digital Twin of both gorges.

Augmented and Mixed Reality Scene Alignment Through 3D-to-3D Learning-Based Cross-Source Point Cloud Registration

Juan Sebastian Sardi Barzallo¹, Volker Coors²

¹Stuttgart,Technical University of Applied Sciences; ²Stuttgart,Technical University of Applied Sciences

With the fast development of reality capture technology and the increasing availability and accessibility to devices capable of capturing 3D point clouds, a wide range of applications where cross-source Point Cloud Data (PCD) data interact appears to be more frequent. Augmented and Mixed Reality (AR/MR) technologies are pivotal for the integration between digital and physical environments by overlaying Digital Twin (DT) models into real contexts, and show themselves as capable of producing real-time 3D point cloud data. Nevertheless, the integration of AR/MR real-time 3D point cloud data with others such as LiDAR data still an open field for research specially at fundamental tasks such as scene alignment and camera localization. Conventional vision-based methods are vulnerable to environmental variations making achieving suitable camera localization and scene alignment challenging. Conventional vision-based methods are vulnerable to environmental variations, making achieving suitable camera localization and scene alignment challenging. This work proposes an exclusively 3D-o-3D-based methodology for AR/MR scene align alignment and camera localization addressing the challenges of cross- source point cloud registration in large size disparity scenarios. By combining cross-source point cloud registration via Voxel Representation and Hierarchical Correspondence Filtering (VRHCF) learning-based method TEASER++ algorithm, our approach effectively manages asymmetric heterogeneous point cloud data, achieving promising registration results especially in extensive indoor settings. The qualitative results suggest improvements over existing studies, despite outlier challenges in outdoor environments that warrant further research. This study highlights the potential and the essential need for advanced methodologies to enable seamless interactions between digital and physical worlds.

Semantic-Guided High-Fidelity Indoor Scene Reconstruction Based on 3D Gaussian Splatting

Mingyue Dong¹, Xianwei Zheng¹, Jiansi Yang¹, Linwei Yue², Jianya Gong¹

¹Wuhan University; ²China University of Geosciences

Indoor 3D scene reconstruction is essential for digital twins and intelligent spatial applications but remains challenging due to severe occlusions, weak textures, and complex geometric structures. This paper presents a semantic-guided high-fidelity indoor reconstruction framework based on 3D Gaussian Splatting (3DGS), which achieves high-precision geometry and photorealistic rendering through semantic-aware optimization. First, a high-quality geometric prior generation scheme is developed by integrating a 2D depth prediction network to enhance noisy depth data captured by mobile devices. The refined depth maps are processed by computing spatial gradients to derive surface normals in world coordinates, providing geometric supervision for the position and orientation of Gaussian ellipsoids. A projection-error-based filtering mechanism ensures consistency across multiple views. Second, a semantic-guided differentiated reconstruction framework is introduced. Using a pretrained segmentation model (SAM), the method distinguishes between large weak-texture areas and fine-detail regions. Normal regularization improves surface smoothness in planar regions, while detail-aware weighting strengthens local geometric fidelity. Additionally, a multi-view semantic consistency strategy jointly optimizes color and geometry across viewpoints, enhancing global coherence and reducing overfitting. Experiments on ScanNet++ and Mushroom datasets demonstrate that the proposed method surpasses state-of-the-art baselines in rendering quality and geometric accuracy. It effectively reconstructs continuous surfaces and detailed structures, showing strong potential for applications in virtual reality, digital twins, and real-time indoor modeling.

Enhanced DUSt3R for Underwater 3D Reconstruction in Shallow Water Environments

Tsuyoshi Shimano, Takashi Fuse

The University of Tokyo, Japan

Shallow-water environments present significant challenges for underwater photogrammetry due to light caustics and the combined effects of absorption and scattering caused by water turbidty. These optical disturbances degrade image quality, disrupt feature matching, and ultimately reduce the reliability of 3D reconstruction using traditional SfM (Structure from Motion) pipeline. In this study, we focus on these two dominant factors and investigate a 3D reconstruction framework inspired by recent feed-forward architectures such as DUSt3R (Dense and Unconstrained Stereo 3D Reconstruction). To support this approach, we develop a synthetic data generation pipeline capable of simulating shallow-water visual conditions. Preliminary experiments indicate a possible trend for integrating physics-aware image formation with DUSt3R-type feed-forward reconstruction. However, several limitations remain: the current model does not yet achieve stable accuracy, real-world underwater validation has not been conducted, and computation costs remain high due to complex training procedures. Future work will focus on refining the network architecture, exploring DUSt3R-derived multi-view and high-fidelity extensions, accelerating computation, and validating the pipeline in real

shallow-water environments. Additionally, integrating advanced rendering techniques may further improvethe refinement of 3D reconstruction.

Evaluating SfM Techniques for DEM Production from VHR Satellite Imagery in Urban Contexts

Gabriele Lo Grasso, Valentina A. Girelli, Emanuele Mandanici

Alma Mater Studiorum - University of Bologna, Italy

Digital Surface Models (DSMs) provide the fundamental elevation data required for generating 3D city models, which support a wide range of analyses such as solar potential estimation, urban heat island assessment, and infrastructure monitoring.

Advances in very high-resolution satellite stereo imaging, airborne LiDAR, and aerial photogrammetry have made it possible to generate DSMs at fine spatial resolution using different acquisition geometries and multi-view reconstruction techniques. However, these data sources differ substantially in terms of spatial resolution, viewing geometry, and surface visibility, leading to variations in elevation accuracy and morphological completeness. Airborne LiDAR surveys can provide highly detailed and accurate three-dimensional point clouds compared to aerial photogrammetry, but are associated with high acquisition and processing costs, as well as logistical constraints.

This study presents a comparative analysis of the DSMs derived from WV-3 panchromatic stereo imagery and oblique aerial photographs processed with the Structure-from-Motion (SfM) approach, focusing on the capability of SfM to reconstruct the complex urban morphology. The study area, a district of the city of Bologna, is characterized by a heterogeneous urban texture including compact mid-rise residential blocks, industrial facilities, vegetated zones, and open spaces, making it an ideal test site for comparing elevation models derived from different sensors and acquisition geometries.

Canopy Entropy Sensitivity Analysis for Scalable Canopy Structural Complexity Estimation

Bin Wang, Yuqi Lei, Zheng Xu, Wen Xiao

China University of Geosciences（Wuhan）, China, People's Republic of

Canopy Entropy (CE) quantifies 3-D forest heterogeneity from LiDAR, but its reliability depends on point density and kernel bandwidth. Using 11 sub-sampled airborne datasets (12–240 pts m⁻²) and bandwidths 0.1–2 m over a 20 ha Jiangxi plot, we show CE is stable (CV < 0.6 %) above 72 pts m⁻², whereas below 50 pts m⁻² it falsely inflates (> +5 %). CE grows logarithmically with bandwidth, saturating beyond 1 m; 0.2 m is optimal at landscape scale. Maintain ≥ 50 pts m⁻² and h ≈ 0.2 m for unbiased canopy-complexity mapping. An Investigation of the Application of GCE for Comparing Cross-Scale Structural Complexity Using Simulated Datasets.

High-Precision Point Cloud Registration Method Based on Planar and Linear Features

Chenxin Yang, Kazuha Kumazawa, Saki Komoriya, Hiroshi Masuda

The University of Electro-Communications, Japan

Accurate registration of point clouds obtained from different viewpoints is essential for constructing consistent and reliable 3D models. Terrestrial laser scanner (TLS) data are typically represented in local coordinate systems centered at individual scanner positions, requiring transformation into a common reference frame. However, achieving high-accuracy registration for large-scale datasets remains challenging. Even small rotational errors in rigid transformations can result in significant positional deviations over long distances. Conventional registration methods, such as the Iterative Closest Point (ICP) algorithm, perform well in dense regions but often produce misalignments in sparse or geometrically uniform areas.

This study presents a high-precision point cloud registration approach that integrates global geometric features—such as planes and lines—with local point-based constraints. Plane and line features are extracted using RANSAC-based detection and incorporated into an enhanced ICP framework, improving both stability and convergence in large-scale environments.

Experimental evaluations using real TLS datasets acquired from an industrial factory demonstrate that the proposed hybrid ICP method significantly outperforms conventional approaches. The integration of global geometric features effectively reduces local misalignments and improves registration accuracy, particularly in regions with uneven point density or limited structural variation.

RTK-Guided Gaussian Splatting Pipeline for Georeferenced Urban 3D Reconstruction

Cheolhwan Kim¹, Wonjun Choi¹, Youngmok Kwon¹, Jungho Lee², Minhyeok Lee², Hong-Gyoo Sohn¹

¹Dept. of Civil and Environmental Engineering, Yonsei University, Seoul, Republic of Korea; ²Dept. of Electrical and Electronic Engineering, Yonsei University, Seoul, Republic of Korea

Automated 3D reconstruction technologies utilizing multi-source spatial data have gained significant attention in recent years. While conventional approaches rely on registration-based multi-sensor integration, recent Gaussian Splatting techniques have shown strong potential for large-scale modeling using only monocular imagery. However, existing 3DGS frameworks operate in relative coordinate systems and lack alignment with absolute geospatial references, limiting their applicability for real-world mapping.

To address these challenges, we propose a georeferenced Gaussian Splatting framework that integrates RTK-GPS camera position measurements directly into the training process. Initial camera parameters and sparse point clouds are estimated using an image-based SfM pipeline and subsequently aligned to a global coordinate frame through a similarity transformation based on RTK-GPS measurements acquired alongside the imagery. During coarse GS training, per-camera translation and rotation corrections are jointly optimized to compensate for geometric errors introduced during global frame alignment. The translation updates are guided toward RTK-GPS-measured positions, while a reprojection constraint based on SfM sparse 3D observations preserves the multi-view geometric consistency established by bundle adjustment.

The proposed method generates 3DGS outputs aligned with an absolute coordinate system with only marginal degradation in rendering metrics such as PSNR, SSIM, and LPIPS. Mesh conversion and surface-distance comparison with laser scanning data further validate the reliability of the reconstructed geometry. This work demonstrates the feasibility of real-world georeferenced modeling using Gaussian Splatting-based scene representation.

Shape Reconstruction from Large Scale Point Clouds Using Planar Adjacency Relations

Yusuke Nagasawa, Hiroshi Masuda

The university of Electro Communication, Japan

Digital twins of production facilities, represented as 3D virtual environments generated from point cloud data, are increasingly demanded for efficient facility management. Although terrestrial laser scanners (TLS) enable high-density 3D acquisition of such environments, the resulting point clouds are extremely large in data size. In practical applications, lightweight mesh models are therefore required as a substitute for raw point cloud data. However, TLS measurements often contain occlusions and missing regions, making it challenging to reconstruct complete mesh models directly from incomplete point clouds. Many objects installed in production facilities, such as equipment platforms, fences, columns, and ladders, consist mainly of planar surfaces. Efficient plane detection methods have been developed for large-scale point clouds (Masuda, 2015; Takeda, 2024). For objects composed of planes, 3D models can be reconstructed from the detected planes. However, industrial point clouds are extremely large, including many densely sampled planar regions. Furthermore, many existing methods focus on standard components with fixed shapes, such as pipe structures, and are not applicable to objects with more flexible geometries. To overcome these limitations, this study first converts the detected planar regions into simplified mesh representations to reduce data volume. We then construct a planar adjacency graph that preserves spatial relationships and geometric attributes between planes. Finally, we reconstruct the target structure by identifying and assembling appropriate subsets of planes.

In-situ LiDAR-assisted backpack camera system calibration for forest mapping

Raja Manish, Songlin Fei, Ayman Habib

Purdue University, United States of America

Backpack mapping systems equipped with LiDAR sensors and RGB cameras, and an optional GNSS/INS direct georeferencing unit, are increasingly used in forest inventory applications. A key prerequisite to deriving accurate mapping products from these platforms is system calibration to establish the mounting parameters relating the LiDAR and camera sensors to the IMU body frame of the GNSS/INS unit. Conventional system calibration procedures entail specific trajectory and target deployment at the calibration site, followed by a labor-intensive identification of targets in imagery and LiDAR point cloud. Given the significance of multi-modal data alignment for forest inventory, this study explores an alternative approach for camera–LiDAR system calibration.

Bundle Adjustment for Satellite Attitude Jitter

Shun Zhou, Hongbo Pan

Central South University, China, People's Republic of

To address the limitations of existing RFM bias-compensation methods, which difficult to handle complex attitude jitter and lack fully automated processing, this study introduces an innovative Bundle Adjustment (BA) approach that incorporates adaptively determined spline smoothing parameters. The method constrains the smoothing term of the spline using prior matching accuracy and enables the adaptive estimation of the smoothing parameter within the BA process. Because the procedure requires no manual intervention and the adaptive smoothing term retains reasonable physical interpretation, the proposed approach is broadly applicable to the correction of attitude jitter in linear pushbroom satellite systems.

A Comparative Study of MVS and NeRF Approaches for Dense 3D Reconstruction of Mediterranean Coral

Paolo Rossi¹, Riccardo Roncella¹, Cristina Castagnetti²

¹University of Parma, Department of Engineering and Architecture, 43124, Parma, Italy; ²University of Modena and Reggio Emilia, Department of Engineering, 41125, Modena, Italy

This work investigates the potential of optimizing underwater image acquisition while preserving reconstruction quality. A comparative evaluation of Multi-View Stereo (MVS) and Neural Radiance Fields (NeRF) is conducted, focusing on their performance in terms of completeness and robustness under conditions of reduced image availability. The study concentrates on underwater scenes involving Mediterranean coral species, where traditional photogrammetric methods often encounter difficulties due to occlusions and low-texture surfaces. The analysis is based on datasets acquired under controlled conditions, allowing for a direct comparison of the dense reconstruction capabilities of both approaches. The impact of decreasing the number of input images on reconstruction completeness and model accuracy is assessed, with results benchmarked against a reference dataset obtained using a triangulation laser scanner.

A progressive framework for 3D scene understanding from multi-view satellite imagery

Xuejun Huang¹, Yi Wan^1,2, Xinyi Liu^1,2, Yongxiang Yao¹, Dong Wei¹, Yongjun Zhang^1,2

¹School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, 430079, Hubei, China; ²Technology Innovation Center for Collaborative Applications of Natural Resources Data in GBA, Ministry of Natural Resources, Guangzhou, 510075, Guangdong, China

3D scene understanding is critical for applications like smart city management and urban planning. However, existing methods often treat 2D semantic understanding and 3D reconstruction as independent tasks, limiting the ability to create a unified 3D semantic representation. This separation hinders the accuracy, interpretability, and scalability of large-scale 3D scene understanding.

In this work, we propose a progressive, three-stage pipeline that seamlessly connects multi-view semantic understanding, self-supervised 3D reconstruction, and end-to-end semantic-level scene understanding. The approach gradually integrates semantic and geometric cues—first establishing reliable semantic priors, then recovering scene geometry without height supervision, and ultimately combining both into a unified 3D representation for more accurate scene understanding.

Beyond geometry: Reflectance-calibrated 3d Gaussians using LiDAR and imagery for photometrically robust Reconstruction

Yaoyu Li¹, Dedong Zhang^2,3

¹Hinton STAI Institute, East China Normal University, Minhang, Shanghai 200241, China; ²Department of Geography and Environmental Management, University of Waterloo, 200 University Avenue West, Waterloo, Ontario N2L 3G1, Canada; ³TianfuJiangxi Laboratory, Chengdu, Sichuan, 641419, China

This paper introduces LIG-3DGS, a novel framework for robust 3D reconstruction and novel view synthesis under conditions where standard image-based methods struggle. The core of our approach lies in the deep integration of LiDAR geometry and intensity information with a 3D Gaussian Splatting (3DGS) representation. Our qualitative and quantitative experiments demonstrate that LIG-3DGS significantly outperforms standard 3DGS and geometry-only baseline methods under challenging photometric conditions. By bridging the geometric precision of active sensing with the high-fidelity rendering of neural approaches, this work opens a promising pathway toward all-weather, high-fidelity 3D scene understanding.

Non-destructive extraction of vertical leaf base and inclination angles distribution in field maize

Lei Lei^1,2, Zhenhong Li^1,2, Guijun Yang^1,2,3, Hao Yang³

¹Key Laboratory of Loess, Xi’an 710054, China; ²College of Geological Engineering and Geomatics, Chang'an University, Xi’an 710054, China; ³Information Technology Research Centre, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China

Distributions of leaf base and inclination angles are important crop phenotypic traits, influencing light interception and productivity. LiDAR provides unprecedented detail of the 3D structure of the crop canopy. Recent research mainly focuses on the leaf base and inclination angles of maize at the individual level or at lower planting density. It is difficult to extract the distributions of leaf base and inclination angles of maize in the field due to the interlocked and overlapped nature of leaves. In this study, we have proposed a high-throughput method to extract the distributions of leaf base and inclination angles of maize in the field. Following the separation of the leaf and stem of maize, hollow cylinders with different thicknesses were used to extract the local leaf points from the separated leaf points based on each stem fitted line, and the DBSCAN algorithm and singular value decomposition were used to calculate the leaf base and inclination angles. The distributions of leaf base and inclination angles of maize in the field with different cultivars, planting densities, and growth stages were extracted and analyzed, and these performed well against the validation data. The high-throughput extraction of these distributions in maize fields holds significant importance for studying the optimal maize cultivar in conjunction with radiative transfer models.

Extraction of CCTV Surveillance Coverage Based on UAV Mesh and CCTV Image

Wonjun Choi¹, Youngmok Kwon², Cheolhwan Kim³, Hong-Gyoo Sohn⁴

¹Dept. of Civil and Environmental Engineering, Yonsei University, Seoul, Republic of Korea; ²Dept. of Civil and Environmental Engineering, Yonsei University, Seoul, Republic of Korea; ³Dept. of Civil and Environmental Engineering, Yonsei University, Seoul, Republic of Korea; ⁴Dept. of Civil and Environmental Engineering, Yonsei University, Seoul, Republic of Korea

This study presents a geometric framework for recovering missing CCTV camera parameters and deriving reliable three-dimensional viewshed coverage by matching UAV-based 3D mesh models with real CCTV imagery. Most CCTV metadata only contains approximate latitude and longitude, while essential calibration parameters such as azimuth, tilt angle, focal length, and field of view are unavailable. Without these parameters, visibility analysis in urban environments becomes inaccurate due to unaccounted building occlusions. To address this, a coarse-to-fine pipeline is proposed. In the coarse stage, camera tilt is estimated from the CCTV image using a monocular surface normal estimation model, and camera yaw is determined by matching cylindrical panoramic renderings of the mesh against the CCTV image using a dense feature matching network. In the fine stage, perspective projection images are rendered at 1 m height intervals using the estimated orientation, and each candidate is matched against the CCTV image to identify the optimal camera height. The rendering process simultaneously records world coordinates for every visible pixel, enabling direct extraction of 3D-2D ground control point correspondences from the best-matched candidate. Outlier correspondences are removed through Fundamental Matrix RANSAC, and spatially distributed representative points are selected via agglomerative clustering. Camera parameters are then estimated using an improved Perspective Projection Model with rotation matrix orthogonality constraints and weighted least squares adjustment. The recovered parameters are used to generate three-dimensional viewshed polygons. The method was tested on 41 CCTV cameras on a university campus and validated using reprojection error and ground-truth camera positions.

Volume estimation and accuracy assessment of unauthorised material deposits using airborne photogrammetry and laser scanning for environmental inspection

Markéta Potůčková¹, Alex Šrollerů¹, Martin Marko², Eva Štefanová¹

¹Charles University, Faculty of Science, Department of Applied Geoinformatics and Cartography, Albertov 6, Prague 2, Czechia; ²Czech Environmental Inspectorate, Na Břehu 267/1a, Prague 9, Czechia

Determining the volume of unauthorised stockpiles or material deposits is a common task for environmental inspection authorities. Although UAV photogrammetry and laser scanning are widely adopted in many fields today, their use within environmental inspectorate practice may still be limited in some countries. In addition, archives of aerial imagery and laser scanning data maintained by national mapping agencies offer valuable resources for retrospective analyses of terrain changes caused by unauthorised material deposits; however, their potential has not yet been fully realised.

The objectives of this study are to: (1) present a comparative analysis of UAV photogrammetry and laser scanning in relation to terrestrial GNSS measurements for determining the volume of larger stockpiles and apply a model for volume accuracy assessment; and (2) demonstrate both the potential and limitations of using archived aerial imagery and laser scanning data for retrospective terrain-change analysis, with a focus on estimating the thickness and volume of deposits and their accuracy.

Both objectives stem from the current need for environmental inspectorate. Volume estimation can be highly sensitive because of associated penalties; therefore, understanding the accuracy and limitations of the applied methods is crucial. When time constraints are not an issue and dense vegetation poses a challenge (including grass cover that cannot be penetrated by laser signals), terrestrial GNSS or traditional surveying remain the most reliable options. Nevertheless, airborne photogrammetry and laser scanning offer undeniable advantages in terms of operability and retrospective analysis.

Improved ICP Algorithm Constrained by Intensity Gradient for Urban Airborne Array InSAR Point Cloud Registration

Lijun Lu¹, Fangfang Ji², Shucheng Yang¹, Chunquan Cheng¹, Hanchao Zhang¹

¹State Key Laboratory of Spatial Datum, Chinese Academy of Surveying and Mapping; ²Capital Normal University,China, People's Republic of

Airborne array InSAR achieves high-precision three-dimensional reconstruction through multi-baseline interferometric height measurement, holding significant application value in urban spatial structure monitoring and surface deformation analysis. However, the acquired urban InSAR point clouds are often affected by multiple factors, including platform attitude errors, system calibration inaccuracies, and multi-angle imaging geometric discrepancies, leading to noticeable spatial biases among different datasets. To achieve geometric consistency across multi-baseline data, high-accuracy point cloud registration has become a crucial step in InSAR data fusion processing. Therefore, the research proposed an improved ICP Algorithm Constrained by Intensity Gradient for Urban Airborne Array InSAR Point Cloud Registration. The improved ICP algorithm constrained by intensity gradients, which integrates geometric and electromagnetic scattering features. Experimental results demonstrate that the proposed method exhibits superior robustness and registration performance in complex urban scattering environments, providing effective technical support for 3D reconstruction of SAR point clouds.

AiDroneTree: A Novel AI Deep Learning Based Network for Individual Tree Detection Using UAV-Derived Point Cloud in Dense Urban and Forest Landscapes

Sina Jarahizadeh, Bahram Salehi

State University of New York College of Environmental Science and Forestry, Department of Environmental Resources Engineering, 1 Forestry Dr., Syracuse, NY 13210 USA

Individual Tree Detection (ITD) is a primary step for estimating tree attributes such as spatial distribution, geometry, and species used in forest management, urban planning, and carbon accounting. While traditional field-based inventories are accurate, they are costly, labour-intensive, and limited in coverage. High-resolution UAV LiDAR offers a scalable alternative, and Deep learning (DL)-based object detection methods further enable automated ITD at large scales. In contrast to RGB imagery, UAV LiDAR can be transformed into multi-band representations that capture rich structural and textural information, which enhances ITD performance. However, previous methods still confront challenges presented by complex forest conditions, including overlapping crowns, and computational inefficiency when processing high-resolution, multi-band data. We propose AiDroneTree: a novel one-stage DL object-detection framework for multi-band rasterized UAV LiDAR, empowering more accurate and efficient tree detection in dense and heterogeneous forests to address this issue. The AiDroneTree architecture detects and segments the individual trees by combining a custom-built backbone and head optimized for detecting small trees in complex canopy environments with integration of Convolutional Blocks with Concatenate (CBC), LeakyReLU activations, and tunable layers throughout to detect bounding boxes and confidence scores for each tree. The results have been evaluated against YOLO on datasets captured from various environments with different tree shapes, sizes, and densities. The quantitative and qualitative results show that AiDroneTree outperforms YOLO in various forest conditions and achieves 91% accuracy, 93% precision, and 92% recall and F1-score.

Integrated MBES-based Assessment of Dam Tailrace Structure and Geomorphology

Sehoon Oh, Jiwan Hong, Geonu Park, Daegeon Woo, Joon Heo

Yonsei University, Korea, Republic of (South Korea)

The dam tailrace is a critical zone for dam safety, as high-energy spillway flows can deteriorate concrete slabs and drive scour along the downstream riverbed. However, this zone is difficult to access, and structural and geomorphic conditions are often assessed independently, limiting integrated understanding of their coupled behavior. Multibeam echo sounding (MBES) helps close this gap by providing high-resolution underwater topography and enabling simultaneous mapping of engineered concrete surfaces and erodible beds within a single survey. When deployed on unmanned surface vehicle (USV) platforms, MBES allows safe and efficient bathymetric mapping in narrow or high-energy downstream channels, supporting more complete characterization of tailrace conditions.

In this study, a USV-mounted MBES was used to acquire high-density underwater measurements across the tailrace of Daecheong Dam, capturing both the concrete stilling basin and the downstream alluvial bed. The resulting point cloud was segmented into two functional zones: (1) the concrete slab zone, where planar-deviation metrics quantified slab misalignment, elevation offsets, and localized deformations; and (2) the downstream zone, where terrain-based depression analysis delineated scour features and characterized their depth, extent, and morphology. By relating structural anomalies observed along the slab surface to the spatial distribution and severity of downstream scour, we perform a coupled slab–scour assessment that links block-level distress to localized erosion patterns near the apron-end transition.

This integrated approach demonstrates how MBES, combined with geospatial analysis, can support comprehensive underwater inspection and contribute to improved operational monitoring and hazard mitigation for large hydraulic structures.

High-detail 3D surveying and digital restoration of historical xylographic stamps: The Ulisse Aldrovandi case

Anna Forte, Maria Alessandra Tini, Valentina Girelli, Gabriele Bitelli, Luca Vittuari

University of Bologna, Dept. of Civil, Chemical, Environmental and Materials Engineering DICAM, Bologna, Italy

This contribution presents a digital workflow for the virtual restoration and functional recovery of a historic xylographic matrix created by the 16th-century naturalist Ulisse Aldrovandi and preserved at Palazzo Poggi, University of Bologna. Although not physically broken, the pearwood block had undergone subtle yet significant geometric deformation over the centuries, preventing it from producing a complete and accurate print.

The project employed high-resolution structured-light scanning to generate a detailed 3D model of the engraved surface, capturing its geometry with sub-millimetric accuracy. From the resulting 31-million-polygon mesh, approximately 7000 points corresponding to the peaks of the engravings were manually extracted and interpolated to model the deformation. A corrective digital transformation was then applied directly to the mesh vertices, restoring the planarity originally required for printing without altering the object itself.

This case study demonstrates the potential of integrating high-resolution 3D surveying and digital modelling to address subtle geometric deterioration in historical artefacts. The method offers a fully non-invasive and reversible approach that can be extended to other wooden matrices or similarly sensitive cultural heritage objects. Future work includes testing additional surveying techniques and evaluating the reproducibility of the proposed workflow across a wider set of materials and conditions.

Multi-class deterioration detection using data-centric approach from UAV-based bridge inspection applications

Ya-Li Lin¹, Jiann-Yeou Rau¹, Chao-Hung Lin¹, Wei-Shen Lai², Chih-Chao Hu²

¹National Cheng Kung University, Chinese Taipei; ²Institute of Transportation, Ministry of Transportation and Communications, Chinese Taipei

Modern AI applications increasingly rely on visual data for perception and decision-making, yet their reliability is fundamentally constrained by data quality and representativeness. Bridge inspection exemplifies this challenge: UAV imagery of bridge surfaces often exhibits complex textures, overlapping deterioration types, and severe class imbalance, limiting the performance of conventional deep models. To address these issues, this study proposes a data-centric approach within an integrated UAV-based bridge inspection framework. High-resolution UAV images are processed through photogrammetric calibration using Structure-from-Motion (SfM) and bundle adjustment, while a Swin-Unet segmentation model is trained with a data-centric sampling strategy that evaluates image patches through coverage, boundary, texture, and edge-entropy indicators to select representative samples. Experiments demonstrate that the proposed method achieves substantial improvements in mean IoU and F1-score compared with random cropping. The resulting multi-class deterioration maps are spatially integrated with 3D bridge models, forming a foundation for digital-twin-based inspection and confirming the effectiveness of data-centric optimization in enhancing the robustness of AI-driven infrastructure assessment.

DamViT: Vision Transformer–Based Robust Segmentation and 3D Mapping of Concrete Dam Damage from UAV Imagery

Jiwan Hong, Sehoon Oh, Joon Heo

Yonsei University, Korea, Republic of (South Korea)

Concrete dams require regular inspection because surface cracking and spalling can threaten durability and safety, yet UAV images of dam faces are often affected by low-light, blur, over-exposure, and stain-like discoloration that confuse automated crack segmentation. This contribution presents DamViT, a Vision Transformer–based framework for robust pixel-wise segmentation and 3D mapping of damage on concrete dams. UAV RGB images are annotated into three classes (background, crack, spalling) and used to train a SegFormer-based network equipped with two lightweight components: a degradation-aware module that estimates a per-pixel degradation map and guides feature extraction under low-quality imaging, and a stain-aware training strategy that explicitly balances stain-rich non-damage patches with damaged regions to reduce false positives on surface stains. The resulting three-class masks are back-projected onto a photogrammetrically reconstructed 3D dam mesh using camera poses and intrinsics, enabling computation of crack length, spalling area, and their spatial distribution in the structural coordinate system. The proposed pipeline links UAV imaging, robust segmentation, and quantitative 3D damage mapping to support dam safety management.

An end-to-end pipeline for 3D building modeling, texturing, and semantic integration from uav data

HyunSoo Kim¹, DinhMinh Bui¹, Ji Sang Park², Jun Su Kim³, Changjae Kim¹

¹Dept. of Civil and Enviromental Engineering, college of Engineering, MyongJi University, Republic of Korea; ²Principal Researcher, Mobility and Navigation Research Section, Electronics and Telecommunication Research Institute , Daejeon, Republic of Korea; ³AI Technology Team, Geostory Co., Republic of Korea

This study proposes an end-to-end automated pipeline for the generation, texturing, and semantic enhancement of 3D building models using UAV-based multi-source data, including imagery, image-derived point clouds, and orthophotos. The pipeline consists of three sequential stages: automatic 3D modeling, post-processing and texturing, and semantic integration. In the first stage, building candidates are automatically extracted from UAV-derived point clouds and orthophotos to generate geometric 3D models. The second stage refines the geometry through manual correction and applies texture mapping using UAV imagery and camera orientation parameters to enhance visual realism. In the third stage, façade images derived from building textures are processed through learning-based operators to detect semantic components such as windows. The detected 2D semantic information is converted into 3D coordinates and integrated into the textured 3D models, forming CityGML-like hierarchical structures within a .json framework. The resulting models contain both geometric and semantic information, offering high compatibility with CityGML and CityJSON standards. The proposed workflow demonstrates the potential for efficient, data-driven, and automated urban model generation that supports digital twin construction and spatial database updating. Future work will focus on incorporating LiDAR-based point clouds to further improve automation and semantic accuracy within the CityGML 3.0 framework.

Comparison of Crack Detection Performance According to Caustic Noise Removal Methods in Shallow-Water ROV Imagery

Daegeon Woo, Jiwan Hong, Changjoon Oh, Geonu Park, Sehoon Oh, Joon Heo

Yonsei University, Korea, Republic of (South Korea)

This contribution investigates how caustic noise—bright, wave-induced light patterns—affects crack detection performance in shallow-water ROV imagery acquired at Daecheong Dam. Although many studies address underwater challenges such as turbidity, color attenuation, and motion blur, the optical distortions caused by caustic flicker have received little attention, despite being one of the most dominant artifacts in the 0–3 m depth range. Using real ROV video frames, we generated paired datasets with and without caustic-removal preprocessing and evaluated their impact on two lightweight CNN-based crack detection models (YOLOv5 and a transfer-learning AlexNet variant). Four filtering strategies were tested, including physics-based temporal median and motion-compensated averaging, as well as learning-based DeepCaustics and an FFT-residual method adapted from RecGS. Experimental results show that caustic-removal preprocessing consistently reduces false positives and improves crack visibility under diverse lighting conditions. The findings demonstrate that caustic noise is a critical but often overlooked source of detection instability in shallow-water inspections. The study emphasizes the importance of integrating simple, unsupervised caustic-mitigation steps into ROV-based monitoring pipelines to enhance the reliability of underwater infrastructure assessment.

Efficient Boundary Refinement for Classification of MMS Point Clouds

Makoto Nakano¹, Keita Hiraoka¹, Genki Takahashi², Hiroshi Masuda¹

¹The University of Electro-Communications, Japan; ²Kokusai Kogyo Co., Ltd., Japan

Mobile Mapping Systems (MMS) provide dense point clouds essential for 3D mapping and infrastructure management, where semantic labeling is required to segment points into meaningful objects. Previous studies have shown that multiscale geometric features effectively capture local context for this task. Building on our previous work using multiscale features with efficient two-stage neighborhood search, we applied Contrastive Boundary Learning (CBL) to enhance classification accuracy near object boundaries. While CBL significantly improved boundary recognition, it also increased computational cost compared to Random Forest–based segmentation, limiting its practicality for large-scale datasets. In this study, we analyze the trade-off between segmentation accuracy and inference time in CBL-based boundary refinement. We further explore strategies to reduce computation while maintaining sufficient accuracy, aiming to achieve an optimal balance for practical MMS point cloud processing.

Reconstruction and Evolution Simulation of Ancient Road Networks in the Yuncheng Region Based on Multi-Modal Data Fusion

Jingjue Jia¹, Mingyi Du¹, Qiang Chen¹, Zhenhua Gao²

¹Beijing University of Civil Engineering and Architecture, China, People's Republic of; ²Shanxi Provincial Research Institute of Archaeology,China, People's Republic of

Ancient transport networks are central to studies of historical geography, regional socio-economic systems, and human mobility patterns. Traditional network reconstruction has relied primarily on the Least-Cost Path (LCP) model; however, the LCP’s “single-optimal” assumption is overly simplistic and cannot capture common historical realities such as the coexistence of multiple routes. Although probabilistic approaches such as Circuit Theory (CT) and behaviorally explicit methods such as Agent-Based Modeling (ABM) have been developed, a systematic, integrated framework that combines these approaches remains underdeveloped. Using the Yuncheng area of Shanxi Province as a case study, this paper systematically compares and integrates three distinct network models by constructing LCP, CT, and ABM networks and quantitatively comparing their differences in path morphology and predictive logic. The resulting multimodal, integrated probabilistic road network synthesizes the strengths of the three approaches and provides precise, high-confidence target areas for archaeological survey.

Assessing Stream Morphology Using High Resolution and Thermal UAV Imagery

Anastasia Umstott, Yanli Zhang, Carmen Montana Schalk

Stephen F Austin State University, United States of America

To protect and promote fish resources, fish habitat needs to be assessed and establish a “standard” for good or poor habitat for specific fish species. For this study, High resolution UAV images, including thermal image, are collected with an Anzu Raptor T for selected streams in East Texas. Orthomosaic and classification analysis were performed to make accurate map to represent open water, channel substrate and riparian vegetation. This approach provides a rapid means to assess streams. Future efforts will target finer geomorphic unit classifications (e.g., pool, riffle, run) across multiple river systems. This information can be critical for freshwater habitat management and restoration.

Road marking condition assessment from drone imagery via detector-guided segmentation and gaussian mixture damage modeling

Dinh Minh Bui, JuBin Lee, HyunSoo Kim, SoMin Han, ChangJae Kim

Department of Civil and Environmental Engineering, College of Engineering, Myongji University.

Road marking condition assessment is essential for transportation safety and road asset management, yet conventional inspection methods remain labor-intensive and inefficient. This study proposes an automated workflow for assessing road-marking conditions from drone imagery by combining object detection with a detector-guided segmentation strategy. First, road-marking regions are localized through a lightweight detector optimized for aerial viewpoints. The detected regions are then refined using a segmentation module that produces pixel-accurate masks, enabling reliable extraction of surface-level deterioration such as fading, cracking, and structural discontinuities.

The proposed approach was evaluated on drone datasets collected under varying flight altitudes and illumination conditions. Experimental results indicate that detector-guided segmentation significantly improves robustness to background clutter and enhances segmentation accuracy compared to single-stage models. The method also supports quantitative condition scoring, making it suitable for integration into municipal inspection workflows.

This contribution demonstrates the potential of combining detection and segmentation for large-scale, drone-based road-marking assessment, offering a practical solution for automated infrastructure monitoring.

Quantitative Analysis of LiDAR Accuracy for Mapping Applications

Ahmed Elaksher¹, Tarig Ali², Abdullatif Alharthy³

¹NMSU, United States of America; ²American University of Sharjah, UAE; ³Ministry of National Guard, KSA

Airborne laser scanning (LiDAR) technology has demonstrated exceptional capability in rapidly capturing dense point clouds and accurately representing complex surface features. It has been successfully applied across numerous geospatial and engineering disciplines with highly promising outcomes. The accuracy of any derived product inherently depends on the quality of the original LiDAR data and the processing methods employed. Therefore, evaluating data quality is an essential prerequisite for reliable analysis and application.

This study presents a quantitative assessment of LiDAR system performance, focusing on the intrinsic accuracy of the laser measurements themselves—an aspect often underexplored in existing literature. The evaluation was conducted through detailed field surveying using GPS triangulation and leveling techniques. Results reveal both planimetric and vertical accuracy characteristics, with a total elevation discrepancy of approximately 0.12 m and a horizontal RMSE near 0.50 m. The identified discrepancies exhibit two distinct components: a short-period random variation associated with the LiDAR ranging system, and a lower-frequency component influenced by biases in the geopositioning subsystem.

Image-assisted aerial LiDAR completion with morphology-guided gaussian splatting

Siyuan Zou¹, Yongjun Zhang², Zhiwei Li¹, Hongbo Pan¹, Xinyi Liu², Haojun Tang¹, Hai Kan³

¹School of Geoscience and Info-Physics, Central South University; ²School of Remote Sensing and Information Engineering, Wuhan University; ³School of Resource and Environmental Sciences, Wuhan University

Airborne LiDAR offers high geometric accuracy and efficient wide-area coverage, and has been widely used in applications such as urban 3D reconstruction, forestry inventory, topographic mapping, and powerline extraction . However, due to near-nadir acquisition geometry and occlusions, vertical structures such as building façades are often under-sampled, resulting in large voids in the point cloud . Traditional geometric hole-filling methods, including Moving Least Squares, Poisson surface reconstruction, and mesh repair, are effective for small gaps, but they often suffer from over-smoothing, structural distortion, and topological discontinuities when applied to large-scale missing regions.

Meanwhile, multi-view imagery can recover continuous surfaces through dense matching or Gaussian Splatting, but the reconstruction quality still depends heavily on the completeness of the initial geometry. When the initial triangulated points or geometric priors are incomplete, façade regions remain prone to fragmentation and noise This paper proposes an image-assisted LiDAR completion framework that models LiDAR completion as continuous surface reconstruction with explicit Gaussians. Through anisotropic Gaussian initialization and tangent-plane-guided densification, the method preserves façade geometry and improves the completeness and accuracy of LiDAR-image fusion reconstruction.

Date: Wednesday, 08-July-2026

8:30am - 10:00am

WG II/2C: Point Cloud Generation and Processing
Location: 713A

8:30am - 8:45am

Differentiable deep consistency for point cloud registration

Tian Zhang, Sagi Filin

Technion - Israel Institute of Technology, Israel

Point cloud registration is a key facilitator for scan alignment in mapping, autonomous driving, and robotic applications. Current pipelines increasingly adopt neural-based paradigms, where most research focuses on learning view-consistent descriptors for correspondence matching. Due to outliers, matching is typically followed by a geometric verification phase that assesses correspondences by enforcing distance or angular consistency to support transformation estimation. Although effective, this verification stage scales quadratically, creating a computational bottleneck that hampers efficient registration. More importantly, since matching and verification are usually optimized separately, the verification stage cannot guide the learned descriptors or foster their geometric awareness. To address both limitations, we introduce a novel end-to-end neural registration framework that unifies correspondence learning and verification within a single differentiable formulation. Specifically, we propose a new consistency-driven cross-attention module that dynamically correlates cross-scan neighborhoods to suppress inconsistent matches and reinforce inter-scan feature coherence. In doing so, it produces robust and discriminative descriptors without incurring the quadratic cost of explicit pairwise verification. Our formulation is readily applicable, and we demonstrate its seamless integration into the GeoTransformer and RoITr state-of-the-art architectures without additional supervision or post-processing. Results show that our method excels in challenging low-overlap scenarios, where competing methods often yield few correct correspondences or fail entirely. It consistently achieves superior inlier ratios and the lowest registration errors on 3DMatch, 3DLoMatch, and KITTI, improving registration recall by up to 2.6%. Beyond accuracy, it converges faster during training and achieves the quickest inference among state-of-the-art methods.

8:45am - 9:00am

Cross-source Point Cloud Registration in the Bird’s-eye Domain: Aligning Street-level LiDAR with High-resolution Aerial Orthoimagery

Seunghwan Hong¹, Junghun Suh¹, Seokhyun Moon², Mohammad Gholami Farkoushi³, Hong-gyoo Sohn³

¹Kakao Mobility, Republic of Korea; ²University of Seoul, Republic of Korea; ³Yonsei University, Republic of Korea

Combining terrestrial Mobile Mapping System (MMS) point clouds with aerial photogrammetric data offers a practical route to comprehensive 3D urban models that integrate street-level geometric detail with wide-area coverage. However, direct 3D-to-3D registration between these data sources often fails because of large differences in viewpoint, point density, scale, and scene composition. This study presents an orthoimage-based registration framework that reformulates cross-source alignment in the Bird's-Eye-View (BEV) domain. After removing transient objects and extracting ground-level points from the MMS cloud, the data are rasterised into a synthetic orthoimage aligned in resolution and projection with a geo-referenced Unmanned Aerial Vehicle (UAV) orthoimage. A learned dense matcher establishes image correspondences, which are geometrically verified and lifted to 3D for coarse alignment, followed by tile-wise point-to-plane Iterative Closest Point (ICP) refinement and global trajectory regularisation via robust factor-graph optimisation. The aligned MMS and UAV point clouds are then integrated through reliability-driven voxel-level fusion. Experiments on a 3.7km urban corridor in Seoul demonstrate that the proposed framework achieves a 3D root-mean-square error of 6.19cm, indicating that BEV-domain orthoimage matching combined with local 3D refinement and trajectory regularisation provides a viable approach for large-scale MMS-UAV registration in dense urban environments.

9:00am - 9:15am

Automated Alignment Enhancement of Backpack Image-LiDAR Data in a Forest Environment

Jidong Liu, Sangyoon Park, Songling Fei, Ayman Habib

Purdue University, United States of America

In recent years, backpack mobile mapping systems (MMS) have shown great promise for under-canopy forest mapping. These systems integrate cameras, LiDAR sensors, and Global Navigation Satellite System/Inertial Navigation System (GNSS/INS) units to provide multi-modal geospatial data essential for modern forest applications that require both geometric and spectral information. However, transportation logistics and improper handling can degrade the system calibration. Moreover, canopy-induced GNSS signal outages will cause trajectory errors. The resulting misalignments between the image-LiDAR data necessitate the application of image–LiDAR registration. Such algorithms can be broadly classified as 2D-3D, 3D-3D, or 2D-2D, depending on the domain in which image-LiDAR features are identified. Due to the inherent modality differences, 2D–3D methods often struggle with feature matching. These methods typically require manual feature selection (Habib et al., 2005) or the availability of prominent features in urban environments (Liao et al., 2023). In contrast, 3D-3D methods rely on generating 3D image point clouds, which imposes strict requirements on image overlap (Yang et al., 2015). Although 2D–2D approaches are less demanding on image data (Hu et al., 2023), none have been applied in under-canopy forests, where establishing multi-modal correspondences remains challenging. To overcome these limitations, this study introduces a post-processing framework for automated image–LiDAR alignment enhancement for backpack MMS in forest environments. This method utilizes a 2D–2D image–LiDAR registration approach based on semantic tree-trunk features.

9:15am - 9:30am

A Marker-based Method for precise 3D Registration between CT-Data and photogrammetric Datasets

Sebastian Preussel¹, Cyrill Milkau², Hans-Gerd Maas¹

¹TU Dresden, Germany; ²HTW Dresden, Germany

In order to enable photogrammetric tracking of objects from a computed tomography (CT) dataset with a multi-camera system, a transformation between the CT data space and a photogrammetric reference frame is required, typically based on control points. To achieve a robust and precise registration between CT and photogrammetric datasets, this work proposes a marker-based approach. The main goal is to use a marker model that allows straightforward segmentation and control point estimation in CT voxel space, while also supporting reliable and precise control point estimation in the photogrammetric images. As a proof-of-concept, spherical markers were investigated, since they allow centre estimation in both domains. In the CT data, marker centres were determined by intensity-based thresholding followed by sphere fitting, while in the photogrammetric data they were estimated by intensity-based thresholding, edge detection, circle fitting, and multi-image spatial intersection. Two different marker models were tested. The results show that the proposed method is feasible and yields sub-millimetre standard deviations of unit weight for both marker types. However, since a sufficient stochastic model is not yet available, the reported accuracy measures may be optimistic and should therefore be interpreted with caution. Future work will address these limitations, in particular uncertainty modelling as well as remaining lighting and contrast issues.

9:30am - 9:45am

Advances in Historical Aerial Image Analysis: Boosting SfM Pipelines with Learned Models

Lucas Kugler¹, Jan Dirk Wegner¹, Camilo Rada^2,3, Livia Piermattei¹

¹University of Zurich, Switzerland; ²University of Magallanes; ³University of British Columbia

Scanned aerial images acquired with film cameras (hereafter referred to as historical images) over the past century is a unique source for deriving Digital Elevation Models (DEMs) and orthoimage to reconstruct past Earth’s surface and quantify long-term changes from glacier to landscape and urban development.

The Historical Structure-from-Motion (HSfM) pipeline (Knuth et al., 2023) currently represents the state of the art to fully automatically generate these historical DEMs. However, struggles with inconsistent image quality, distortions, distinct geometries and above all is based on the commercial software Metahape. Therefore, we aim to: (1) develop a fully open-source solution in COLMAPs environment, (2) integrate learned models in different SfM-steps to better handle the complex properties that come with historical imagery, and (3) compare our output against HSfM.

Our work is based on 180 historical aerial images acquired above the challenging terrain of Gran Campo Nevado Glacier. The results show that our photogrammetric workflow leads to a 0.26 px smaller mean reprojection error as well as roughly 9-times more tie-points for the sparse point cloud compared to the HSfM. The mean DEM difference with a reference DEM on stable terrain and the 95%-quantile DEM difference are also smaller in our experiments (0.71m vs. 10.10 m and 73.62 m vs. 99.03 m).

Further tests of our workflow include employing alternative models for feature extraction, matching, and dense reconstruction as well as evaluating multitemporal approaches (as adopted in Knuth et al., 2023) to enable a more representative comparison.

9:45am - 10:00am

Trinocular Multi-Object 3D Reconstruction in Camera-Simulating virtual Environments for Knee Arthroplasty

Arne Schierbaum¹, Tobias Neiss-Theuerkauff², Thomas Luhmann¹, Frank Wallhoff², Till Sieberth¹

¹Jade University of Applied Sciences, Institute for Applied Photogrammetry and Geoinformatics, Oldenburg, Germany; ²Jade University of Applied Sciences, Institute for Technical Assistive Systems, Oldenburg, Germany

In knee arthroplasty, computer-assisted navigation enhances the accuracy of prosthesis placement. However, current methods rely on invasively drilled locators to track the knee position during surgery, prolonging the healing process. For this reason, research is focused on markerless approaches capable of determining knee orientation and transferring preoperative planning into the surgical environment. This work presents a trinocular multi-object 3D reconstruction system designed for intraoperative acquisition of the knee surface, providing a foundation for marker less navigation. Due to the scarcity of real surgical data with ground truth, a synthetic dataset was created using Blender to simulate optical image acquisition of a virtual knee model under controlled camera and lighting conditions. The dataset enables a systematic evaluation of how camera motion and viewpoint affect pose estimation and 3D reconstruction accuracy. The results demonstrate that moderate camera deflection between 15° and 25° achieve the best balance between accurate camera pose estimation and surface reconstruction quality. The work confirms the potential of trinocular SLAM for robust bone surface tracking while also identifying the limitations of synthetic data, such as the absence of real-world visual variability. These results form the basis for future work on 3D reconstruction during dynamic knee movements and their tracking, as well as on the integration of markerless optical navigation systems into surgery.

8:30am - 10:00am

WG III/1J: Remote Sensing Data Processing and Understanding
Location: 713B

8:30am - 8:45am

Regional Fire Dynamics in the Atlantic Forest Biome: Differences from the National Scenario

Felippe Lima, Carlos Eduardo Biasi, Rafael Wagenmacher, Elisama Oliveira, Antonio da Silva, Renan Reis, Jullyene Lemes, Reinaldo Ribeiro, Luiz Neto, Laurizio Alves, Bianca Pansonato

Censipam, Brazil

This study statistically analyzes fire events in the Atlantic Forest, seeking to understand their particularities in relation to the national scenario. The biome, historically pressured by deforestation, fragmentation, and anthropogenic activities, also suffers from agricultural, livestock, and accidental fires, which increase its vulnerability. The research used data from Censipam's Fire Panel, obtained by MODIS and VIIRS orbital sensors, considering records from 2020 onwards and specific sections for the Atlantic Forest. Variables such as area, severity, persistence, speed of expansion, number of outbreaks, Fire Radiative Power (FRP), and detections were analyzed. The results indicate that, compared to the national pattern, fires in the Atlantic Forest are less intense and shorter in duration, a phenomenon associated with higher humidity, landscape fragmentation, and management conditions. It is concluded that the dynamics of fire in the biome differ significantly from the national average, reinforcing the importance of regional monitoring and firefighting strategies aimed at preserving its ecological integrity.

8:45am - 9:00am

A Spatiotemporal Evaluation Framework for MODIS-Derived Fire Events

Bahareh Kalantar¹, Husam Al-Najjar², Naonori Uedae¹

¹RIKEN Center for Advanced Intelligence Project, Japan; ²Faculty of Engineering and IT, University of Technology Sydney (UTS)

The MODIS burned area product is widely used to extract ignition locations and delineate individual fires for wildfire probabilistic loss modeling. However, limited studies have systematically evaluated the accuracy of these derived fire events through detailed spatial and temporal comparisons with reference datasets. This study addresses this gap by developing a robust framework to assess the accuracy of MODIS-derived individual fires across the United States. In this study, the MODIS Collection 6 MCD64 burned area product was used to extract ignition locations and individual fire events using the Fire Events Delineation (FIRED) algorithm. A comprehensive evaluation framework was then implemented to assess the delineated fire events against the Monitoring Trends in Burn Severity (MTBS) reference dataset, accounting for both spatial overlap and temporal consistency. The results show that the proposed approach achieved an average Intersection over Union (IoU) score of 0.54, an F-score of 0.701, an overall accuracy of 0.77, a precision of 0.90, and a recall of 0.57. These metrics represent averages across the period 2001–2020. Collectively, the results highlight the strengths and limitations of the event detection system and provide a quantitative assessment of its performance. This comprehensive evaluation offers valuable insights into the reliability of MODIS-derived individual fire events and improves understanding of their suitability for wildfire probabilistic loss modeling and related applications.

9:00am - 9:15am

CFMap: A Deep Convolutional Neural Network for Predicting Wildfire Risk Maps

Lucas Lopes, Rafik Ghali, Moulay Akhloufi

Perception, Robotics and Intelligent Machines (PRIME), Université de Moncton, Canada

Wildfires cause economic, social, and environmental consequences, as they affect ecosystems, public safety, biodiversity and natural resources. They pose challenges to various world regions, particularly Mediterranean areas such as Spain. Numerous fire prediction and detection systems were introduced to detect and predict fires as well as prevent their risks and damage. Statistical methods and classical machine learning models were often employed to estimate and predict fire risk, showing their efficiency in generating fire risk maps. However, they fail to accurately capture complex temporal and spatial characteristics related to fire ignition. To address this challenge, a novel Convolutional Neural Network (CNN) model, namely CFMap, was introduced for predicting and generating detailed wildfire risk maps covering Spain regions. Comprehensive analyses were performed using data between 2008 and 2024, including fire history, geographical location information, land usage features, human activity indices, topography data, meteorological features, and vegetation indices from Spain regions, collected from the IberFire dataset. CFMap showed a superior performance with an accuracy of 0.8028 ± 0.0440, an AUC (Area Under the Curve) of 0.9354 ± 0.0088, and an F1-score of 0.7787 ± 0.0623, outperforming classical machine learning methods (XGBoost, LightGBM, and RandomForest) and deep learning models including ResNet and a simple CNN. These results demonstrate its reliability in predicting fire events and generating monthly fire risk maps for different Spain regions. Consequently, it helps to identify high fire risk zones, improve fire management strategies, and efficiently deploy firefighting resources, thereby reducing the potential risk and impact of fires.

9:15am - 9:30am

Graph-Attention Network for Spatially-Aware Post-Hurricane Building Damage Assessment from UAV Imagery

Fuad Hasan¹, Ali Lesani¹, Chul Min Yeum¹, Rodrigo Costa²

¹Computer Vision for Smart Structures (CViSS) Lab, Waterloo, Canada; ²University of Waterloo, Canada

In the immediate aftermath of a hurricane, the rapid, accurate assessment of building damage is paramount for effective emergency response and the allocation of resources. Traditional methods of damage assessment, which rely on ground-based surveys, are often slow, hazardous, and subjective. While the advent of remote sensing (RS), through Unmanned Aerial Vehicles (UAVs) and the application of Convolutional Neural Networks (CNNs), has significantly advanced the automation of this process, these models operate on a pixel-level or object-level basis, failing to capture the inherent spatial relationships and contextual information within a disaster zone. Damage patterns are not spatially random; they exhibit strong spatial autocorrelation, a principle encapsulated by Tobler's First Law of Geography. This paper introduces a novel approach that leverages Graph Attention Networks (GATs) to explicitly model spatial dependencies when evaluating building damage. By representing damaged buildings and their surroundings as nodes and edges in a graph, our model can learn and weigh the influence of neighboring structures and the local environment when assessing their damage level. This spatially-aware methodology moves beyond simple image classification to a more holistic scene understanding. We evaluate the method on DoriaNET, a geo-referenced UAV dataset collected after Hurricane Dorian (2019) that provides masked building patches, GPS centroids, structural metadata, and ordinal FEMA/HAZUS-style damage labels. By incorporating spatial context via a graph-based framework, our GAT model achieves superior performance in building damage classification compared to state-of-the-art CNN-based approaches, producing more coherent and accurate damage maps better suited to real-world disaster management scenarios.

9:30am - 9:45am

Imaging wind field from videos: an innovative tool for urban scale measurements.

Annia Benchadi, Jairo Acuña Paz y Miño

Université de Lille, France

This work presents an innovative image-based method for measuring wind speed and direction in urban environment using video

footage. Wind dynamics are traditionally investigated at multiple spatial scales, including pollutant dispersion at the canopy level

(Allwine, 2000), architectural design and outdoor comfort at the building scale (Allard, 2012; Holst, 2011) and the convection heat

transfer coefficient ℎ [Wm-²K-1] used to define the boundary conditions of numerical simulations (Oke, 2017). In 1997, Gary Settles

showed that image measurement could provide non-invasive and high-resolution measurements of fluid motion. This paper presents a

method for extracting anemometric data from images at the urban scale. We process freely accessible videos from the internet in which

air masses are identified at the canopy level. Motion extraction technique is used to isolate elements of the video that are in motion.

This information is fed into an optical flow algorithm that estimates an apparent velocity in [pixels/frame]. To convert the data to

[km/h], the view’s perspective is considered to ensure the conversion is accurate across the entire image. Distance mapping is

performed by projecting the image onto a 3D model of the scene, and the camera's recording parameters are estimated by simulating

the illumination of the scene. The anemometric data obtained are evaluated in relation to meteorological data recorded at a nearby

weather station. Innovative and simple to implement, this approach provides estimates of wind speeds and directions that are both

reliable and directly usable for architectural design and climate studies.

9:45am - 10:00am

Predictive Modeling of Urban Heat Islands in Indian Cities: A Case Study of Jaipur city, Rajasthan, India

Aryan Yadav, Kakoli Saha, Marguerite Madden

Indian Institute of Technology, Hyderabad

Rapid urbanization and the loss of vegetative cover in Indian cities have raised serious concerns about environmental sustainability and public health. This study focuses on analyzing and forecasting Urban Heat Island (UHI) patterns in Jaipur, India, by examining both Surface UHI (SUHI) and Atmospheric UHI (AUHI). Using Google Earth Engine, the research integrates diverse spatio-temporal datasets—including Landsat-derived indices (such as LULC, NDVI, NDWI, NDBI, NDMI, albedo, and emissivity), geospatial features (building density, sky view factor, and population density), and meteorological data (air temperature, humidity, wind speed, and solar radiation) from 2000 to 2024—to train a Random Forest Regression model. The model demonstrated strong performance (R² = 0.806; RMSE = 0.059), surpassing linear and generalized additive models by effectively capturing complex, non-linear relationships. It also helped identify high-risk areas like Transport Nagar and Budhsinghpura. Projections for 2030 and 2035 indicate increasing heat stress, particularly in Jaipur’s expanding urban periphery. This GIS-integrated machine learning framework presents a replicable approach for UHI prediction in other fast-growing Indian cities.

8:30am - 10:00am

WG I/5: Microwave and InSAR Technology for Earth Observation
Location: 714A

8:30am - 8:45am

Advanced InSAR Technology for Artificial Slope Monitoring: Addressing Vegetation Decorrelation and Atmospheric Delays

Yinjie Cao, Lei Zhang, Hongyu Liang

College of Surveying and Geo-Informatics, Tongji University, Shanghai, 200092, China

Southwestern China’s complex terrain and climate make landslides frequent, especially along highways where numerous high, steep artificial slopes are formed during construction. These slopes often deform or fail within 1–2 rainy seasons due to intricate geology, severely affecting construction and infrastructure safety. An automated, real-time monitoring and early-warning system is therefore urgently needed.

Conventional techniques (leveling, GPS, crack meters) are limited by small coverage, low efficiency, high cost, and inability to detect regional or hidden deformations. Spaceborne InSAR offers wide-area, high-precision, all-weather monitoring but faces severe decorrelation noise from dense vegetation and atmospheric delay errors in mountainous regions.

This study developed advanced InSAR methods for artificial slopes along the Huali Highway (G4216) in Yunnan Province. Using TCPInSAR and >240 Sentinel-1 images (2015–2025), we retrieved surface deformation throughout pre-construction, construction, and post-construction phases. To overcome local challenges, two novel correction approaches were proposed: (1) a noise-reduction method based on spatial correlation estimation of deformation signals, effectively suppressing vegetation-induced decorrelation; and (2) an atmospheric correction technique using Singular Spectrum Analysis (SSA), significantly reducing delays caused by complex weather.

Results show the improved InSAR system successfully detected multiple deformation zones along the corridor and provided reliable early warnings for safety management. By addressing key technical bottlenecks, this work validates the practicality and effectiveness of advanced InSAR for automated slope stability monitoring in geologically and environmentally complex regions, offering valuable reference for similar large-scale infrastructure projects.

8:45am - 9:00am

Meteorological Influence on L-Band Forest Backscatter: Evidence from the BorealScat-2 Radar Tower

Shivam Rawat¹, Albert R. Monteith², Lars M.H. Ulander², Hjalmar Laudon³, Jose Gutierrez Lopez³, Henrik J. Persson¹

¹Department of Forest Resource Management, Swedish University Of Agricultural Sciences, Umeå, Sweden; ²Department of Space, Earth and Environment, Chalmers University of Technology, Gothenburg, Sweden; ³Department of Forest Ecology and Management, Swedish University Of Agricultural Sciences, Umeå, Sweden

Meteorological control of L-band forest backscatter from the BorealScat-2 radar tower

How strongly do weather conditions imprint on L-band radar signals from forests at sub-daily time scales? This question is investigated using the BorealScat-2 tower experiment in the Svartberget Experimental Forest (northern Sweden). The system acquires fully polarimetric, tomographic radar data at P, UHF and L band every 30 minutes, providing height-resolved backscatter profiles from the ground, through the trunk zone, into the upper canopy. Within the shared footprint, an ICOS flux mast delivers continuous measurements of CO₂, water vapour and energy fluxes, together with radiation, vapour pressure deficit (VPD), temperature, wind and precipitation. Sap-flow sensors, dendrometers and soil water probes further characterise water storage and transport in trees and soils, offering an unusually detailed description of forest water dynamics.

The study will focus on L-band backscatter during late spring and summer, quantifying how diurnal amplitude, phase and vertical centre-of-mass in different height zones and polarisations relate to VPD, temperature, radiation and rainfall. It will specifically assess the relative roles of atmospheric demand, canopy wetness and soil water status in driving sub-daily L-band variability, and examine differences between co- and cross-polarised channels and between structural layers. Overall, the study aims to provide process-based insight into how specific meteorological drivers control sub-daily L-band radar variability in boreal forests, supporting the interpretation and modelling of future vegetation radar missions.

9:00am - 9:15am

Cross-validation of the DEM obtained using LuTan-1 SAR satellites: A case study in Guyuan County, China

Xinming Tang¹, Yao Lu¹, Tao Li¹, Xiaoqing Zhou¹, Jing Lu¹, Xiang Zhang¹, Xuefei Zhang¹, Xiaofeng Qiu², Jiyi Chen¹

¹Land Satellite Remote Sensing Application Center, Ministry of Natural Resources, Beijing 100048, China; ²Beijing SatImage Information Technology Co.,Ltd., Beijing 100048, China

The digital elevation model (DEM) based on synthetic aperture radar interferometry (SAR, InSAR) technology have become an important data source for large-scale topographic mapping, but their characteristics vary with satellite systems and methodologies. In this paper, we conduct the cross-validation for the first time to compare the LuTan-1 raw DEM (LT-1 RDEM) and GaoFen-7 (GF-7) satellite laser altimetry data. Besides, we compared the penetration capabilities of SAR satellites including C-band SRTM and X-band TanDEM-X. The optically derived ZiYuan-3 (ZY-3) DEM was also included for multi-source cross-validation. Taking Guyuan County, Hebei Province, China (including four landform types: plains, tablelands, hills, and mountains) as the study area, we introduced GF-7 laser altimetry points (LAPs) as the verification benchmark to cross-validate the elevation accuracy of LT-1 RDEM, SRTM, TanDEM-X DEM (TanDEM), and ZY-3 DEM. The results indicate that: (1) Topographic relief has a significant impact on accuracy, and the RMSE of the DEMs in the study area generally increases sequentially with the intensification of topographic relief; (2) Benefiting from the 10-meter spatial resolution, LT-1 RDEM performs best in detail representation; (3) In terms of mean height error, LT-1 RDEM exhibits a general negative bias, confirming the stronger penetration capability of the L-band, and its elevation values may be closer to the true ground surface; (4) The RMSE of LT-1 RDEM in the study area is 1.958m, slightly larger than TanDEM’s 1.65m, but in fact, the accuracy of TanDEM as a digital surface model (DSM) may be systematically overestimated by laser altimetry data.

9:15am - 9:30am

Operational Deformation Monitoring of the Hong Kong–Zhuhai–Macao Bridge with Multi-Orbit LuTan-1 SAR Satellites

Jing Lu¹, Xinming Tang¹, Tao Li¹, Xiang Zhang¹, Xuefei Zhang¹, Tan Li², Xiaofeng Qiu², Yao Lu¹

¹Land Satellite Remote Sensing Application Center, MNR, China, China, People's Republic of; ²Beijing SatImage Information Technology Co.,Ltd., Beijing 100040, China

This study evaluates the operational capability of the Chinese LuTan-1 (LT-1) L-band SAR constellation for monitoring the Hong Kong–Zhuhai–Macao Bridge (HZMB), a representative sea-crossing bridge under a complex subtropical marine climate. Leveraging the advantages of L-band SAR—including strong resistance to decorrelation and a spatial resolution of up to 3 meters—we applied the Small Baseline Subset (SBAS) technique to 47 ascending and descending orbital images. To the best of our knowledge, this represents one of the first comprehensive deformation studies of the HZMB using the LT-1 constellation. A key aspect of our methodology is the cross-validation between multi-orbit datasets, which confirmed both the reliability of the measurements and the complementary distribution of coherent points due to SAR imaging geometry. The results indicate overall structural stability of the HZMB, with the maximum deformation localized at the Jianghai Navigation Bridge, showing a Line-of-Sight (LOS) displacement rate of –4.3 mm/yr. In contrast, the two artificial islands exhibited minor deformation, with LOS rates not exceeding –3.0 mm/yr. These findings validate LT-1 as a powerful and reliable tool for the operational health monitoring of large-scale coastal infrastructure.

9:30am - 9:45am

LuTan-1 InSAR Products Assessments for Geohazards and Geoinformation Monitoring

Tao Li¹, Xinming Tang², Xiang Zhang¹, Jing Lu¹, Xin Tian², Xuefei Zhang¹

¹Land Satellite Remote Sensing Application Center, MNR, China, People's Republic of; ²Southeast University

LuTan-1 (LT-1) satellites have been launched for about 4 years. About 771,282 images have been distributed to the users of China till 29th October, 2025. Main application purpose of LT-1 is geohazard monitoring and geoinformation production. Interferometric capability is the primary consideration for LT-1. In this paper, we assessed the interferometric applications in the natural resource monitoring industry. First, we overviewed the status of LT-1, the main interferometric products were introduced as S2A, S2B, S3A, S4A, S5A, S5B and S5C. They are geometrically calibrated single look complex (SLC) image, interferometrically calibrated SLC, differential interferometric synthetic aperture radar (SAR, InSAR, DInSAR) products, stacking, MTInSAR, digital orthorectified image, and digital surface model, respectively. S2A are generated after geometric calibration, the geometric accuracy is about 1.53 after calibration. The baseline is then calibrated for helix bistatic formation data and generate S2B whose accuracy is better than 0.96 m. S3A, S4A and S5A are all used for deformation monitoring, the accuracy values of them are 2.7 mm, 8.6 mm/yr, and 3.7 mm. Geometric accuracy of S5B is 12.5 m, and the height accuracy of S5C is better than 4.7 m. More than 330 geohazards were detected in Guangdong province. The geohazards recognition rate in the field working stage increase from 28% to 47.24%. Even a prediction has been made to avoid disaster for a family and saved 3 people. The application effectiveness has been validated through those years.

9:45am - 10:00am

Improved deformation monitoring technology considering the penetration variation of L-band SAR signals

Xiang Zhang¹, Xinming Tang¹, Hui Zhao², Tao Li¹

¹Land Satellite Remote Sensing Application Center, MNR, China, People's Republic of; ²National Geomatics Center of China, Beijing, China

The influence of soil moisture change on interference phase information is fully taken into account for accurate deformation monitoring in this research. Especially the effects have more prominent contribution to L-band SAR data. In order to obtain high-precision surface deformation information over agricultural area, the interference phase component caused by soil moisture change should be considered, and the optimal processing of interference phase information is achieved. The reliable interference phase information characterizing the surface deformation details is obtained, thus the natural surface deformation information with high precision can be achieved. Firstly, the penetration depth of different band SAR for agricultural soil was analyzed and simulated. And the sensitivity between penetration depth variation and different band SAR signals were discussed. The fact of soil moisture changes for interferometric phase contribution is confirmed, which provided the foundation for reliable deformation montoring considering the soil moisture variation effects, especially for L-band SAR data. The periodic irrigation for the wheat fields will induce soil moisture variation, which may result in the penetration depth change for radar electromagnetic wave. Therefore, the phase component was derived by the variation of soil moisture over the wheat fields. Multi-temporal Lutan-1 SAR data were acquired over ShanDong agricultural plain in China. The obvious ‘deformation details’ induced by the soil moisture change were acquired over the agricultural area, which demonstrated the effect of soil moisture variation for interference phase. Therefore, the accurate deformation details over agricultural area can be obtained by the combination of soil moisture information.

8:30am - 10:00am

WG III/8C: Remote Sensing for Agricultural and Natural Ecosystems
Location: 714B

8:30am - 8:45am

Random Temporal Masking and Neural ODE Optimization for Crop Type Mapping with Inconsistent Remote Sensing Time Series Data

Wei Cui¹, Tao He¹, Yu Duan¹, Ziang Wang¹, Kaimin Sun¹, Ya Wen²

¹WUHAN UNIVERSITY,wuhan, China; ²North Automatic Control Technology Institute. Taiyuan,China

Multi-temporal remote sensing is crucial for crop monitoring, but existing mapping methods struggle with incomplete time series due to data missingness. Current models often assume consistent data, leading to performance degradation when faced with irregular or missing observations. To address this, we propose an enhanced approach combining random temporal masking with neural Ordinary Differential Equation (ODE) optimization, designed to be embedded into existing models.

Our method first employs a random temporal masking strategy during training, forcing the model to learn effective temporal dependencies from sparse, incomplete sequences, thereby boosting its adaptability to diverse missing data scenarios. Second, a time-smoothing regularization term, based on neural ODE, guides the model to learn a continuous, smooth feature trajectory from discrete observations, effectively mitigating temporal inconsistencies and abrupt fluctuations caused by missing data. We also incorporate sine-cosine positional encoding with slight perturbations for precise time representation.

We integrated our approach into the state-of-the-art TSViT model and evaluated it on the PASTIS dataset. Experiments show that while the original TSViT’s accuracy (OA and mIoU) sharply declines with increasing missing frames, our enhanced model maintains significantly better performance. At 80% missing data, our method improves OA by approximately 8% and mIoU by about 12% compared to the baseline. Qualitative results further demonstrate our model’s ability to preserve coherent, smooth spatiotemporal predictions, enhancing robustness and generalization in real-world applications.

8:45am - 9:00am

Mind the Gap: Bridging Prior Shift in Realistic Few-Shot Crop-Type Classification

Joana Reuss¹, Ekaterina Gikalo¹, Marco Körner^1,2,3

¹Technical University of Munich (TUM), TUM School of Engineering and Design, Department of Aerospace and Geodesy, Chair of Remote Sensing Technology, Germany; ²Technical University of Munich (TUM), Munich Data Science Institute (MDSI), Germany; ³ELLIS Unit Jena, University of Jena, Germany

Real-world agricultural distributions often suffer from severe class imbalance, typically following a long-tailed distribution. Labeled datasets for crop-type classification are inherently scarce and remain costly to obtain. When working with such limited data, training sets are frequently constructed to be artificially balanced—in particular in the case of few-shot learning—failing to reflect real-world conditions. This mismatch induces a shift between training and test label distributions, degrading real-world generalization. To address this, we propose Dirichlet Prior Augmentation (DirPA), a novel method that simulates an unknown label distribution skew of the target domain proactively during model training. Specifically, we model the real-world distribution as Dirichlet-distributed random variables, effectively performing a prior augmentation during few-shot learning. Our experiments show that DirPA successfully shifts the decision boundary and stabilizes the training process by acting as a dynamic feature regularizer.

9:00am - 9:15am

Integrating hyperspectral and phenological features for cereals mapping in a mediterranean region, Morocco

Mohamed Bourriz^1,2,3, Ahmed Laamrani^1,4,5, Nadir El Bouanani^1,2, Walid Elbarz², Hamd Ait Abdelali², François Bourzeix², Abdelghani Chehbouni¹

¹CRSA, Mohammed VI Polytechnic University (UM6P), Campus Ben Guerir 43150, Morocco; ²A-Lab, UM6P, Campus Rabat 11103, Morocco; ³Friedrich Schiller University Jena, Department of Geography, Jena 07743, Germany; ⁴Department of Geography, Environment & Geomatics, University of Guelph, Guelph, ON N1G 2W1, Canada; ⁵Institut de recherche sur les forêts (IRF), Université du Québec en Abitibi-Temiscamingue (UQAT), 445 boul. de l’Universite´, Rouyn-Noranda, Québec J9X 5E4, Canada

Accurate discrimination of cereal crops in heterogeneous agroecosystems requires methods that integrate both spectral and temporal information. This study proposes a compact spectral–temporal framework that combines Optimal Hyperspectral Narrowbands (OHNB) selected from EnMAP imagery using a Spectral Attention Module (SAM) with a Dynamic Time Warping (DTW)-derived phenological distance computed from Sentinel-2 EVI time series. The analysis was conducted in the Saïss region of Morocco, one of the country’s major cereal-producing areas.

SAM identified 29 physiologically meaningful narrowbands spanning the visible, red-edge, near-infrared, and shortwave-infrared regions (429–2438 nm), capturing key pigment, structural, and moisture-related vegetation properties. EVI time series were preprocessed through 10-day median compositing, linear interpolation, and Savitzky–Golay smoothing to generate stable phenological profiles. DTW quantified the temporal similarity of each field’s EVI trajectory to a cereal reference curve, producing a phenology-driven distance feature.

Three classifiers—Random Forest, SVM, and TabPFN—were evaluated under a nested standard and spatial cross-validation strategy. Using only hyperspectral bands, SVM and TabPFN achieved the highest accuracies (ROC-AUC = 0.95–0.93). Incorporating the DTW feature consistently improved performance under spatial CV, especially for RF (ROC-AUC increase: 0.89→0.91), and reduced the performance gap between validation schemes.

Overall, the fusion of SAM-selected hyperspectral bands with DTW-based phenological information enhanced spatial robustness and improved discrimination between cereal and non-cereal fields. The proposed approach offers an efficient and transferable solution for operational crop mapping in semi-arid agricultural landscapes.

9:15am - 9:30am

Applying a U-Net Convolutional Neural Network for Mapping Banana Crops in the Atlantic Forest Region of Brazil Using CBERS-4A High Spatial Resolution Imagery

Hugo do Nascimento Bendini¹, Mateus de Souza Miranda², Thales Sehn Körting³

¹Department of Fisheries Resources and Aquaculture (DERPA), Faculty of Agrarian Sciences (FCAVR), State University of Sao Paulo (UNESP), Registro, Brazil; ²Artificial Intelligence Laboratory for Aerospace and Environmental Applications, Applied Computing, National Institute for Space Research, Brazil; ³Remote Sensing Postgraduate Program (PGSER), Earth Sciences General Coordination (CGCT), Brazil’s National Institute for Space Research (INPE)

Mapping banana crops in heterogeneous tropical landscapes remains challenging due to spectral similarity with surrounding vegetation, fragmented smallholder systems, and complex land-use mosaics. This study applies a deep learning approach, using a U-Net model, on high spatial resolution CBERS-4A imagery to map banana crops in Brazil’s Ribeira Valley, a subtropical region with high rainfall and heterogeneous land cover. Reference data were created through manual interpretation of satellite imagery supported by field knowledge. Representative image tiles were selected and divided into smaller patches for model training, validation, and testing. The U-Net model was trained with standard optimization techniques and evaluated using common semantic segmentation metrics. On the validation set, it achieved strong performance (accuracy 0.91, F1-score 0.84, AUC-ROC 0.96, AUC-PR 0.92). Performance was maintained or improved on the independent test set (accuracy 0.91, F1-score 0.86, AUC-ROC 0.97, AUC-PR 0.93), indicating good generalization. with high agreement between predicted and reference data. Most errors occurred at boundaries between crops and natural vegetation. Additional validation using official agricultural statistics confirmed consistency at the municipal scale. The approach demonstrates that high-resolution imagery combined with deep learning can effectively map banana crops in the region and offers a promising tool for agricultural monitoring and land-use planning in complex environments. The code, trained models, and data are publicly available at https://github.com/hnbendini/banana-unet-mapping.

9:30am - 9:45am

Observing the Phenological Characteristics of Winter Food Crops with Spectral Indices

Hsuan-Yi Li¹, James A. Lawrence¹, Philippa J. Mason², Richard C. Ghail³

¹Department of Civil and Environmental Engineering, Skempton Building, Imperial College London, South Kensington, London SW7 2AZ, UK; ²Department of Earth Science & Engineering, Imperial College London, Prince Consort Road, London SW7 2AZ, UK; ³Department of Earth Sciences, Queens Building 245, Royal Holloway, University of London Egham, Surrey TW20 0EX, UK

This study is based on the best crop classification result generated by the proposed unsupervised Machine Learning (ML) method in Li et al., 2025a, using the spectral indices calculated by the formula with spectral bands from Sentinel-2 image products, Normalized Difference Vegetation Index (NDVI), Soil Adjusted Vegetation Index (SAVI), Enhanced Vegetation Index (EVI) and Normalized Difference Moisture Index (NDMI). The patterns and characteristics of these spectral indices, across arable fields with different crop types following the winter growing seasons, have not yet been analyzed in detail. This research aims to provide a comprehensive study of each input spectral index and its impact on the crop classification model. Each spectral index is analyzed across a series of crop fields, using Sentinel-2 images, carefully selected to follow the patterns of winter crop phenology, and the results of unsupervised classification for each crop type in Norfolk, UK are successfully generated and analyzed. The different growing rates between winter barley and wheat have been classified found on a monthly basis using Sentinel-2 RGB images and thus the images during the harvest time, May and June, can support crop classifications. Wild grasses or other plants on the fields led to some crop misclassification from November to March in the Sentinel-2 RGB images. Similarity between winter barley and wheat and the different sowing time among the same type of crop also led to misclassification. In future these misclassifications could be avoided through better understanding of the relation between spectral indices and crop planting cycles.

9:45am - 10:00am

Automated Monitoring of Crop Pests Using Low-cost RGB Sensors and Edge AI

Emma Dubrûle^1,2, Mickaël Germain^1,2, Marc Bélisle^1,2, Yacine Bouroubi^1,2

¹Université de Sherbrooke, Canada; ²Réseau québécois de recherche en agriculture

Current pest monitoring relies on labor-intensive manual scouting, often leading to preventive insecticide use, highlighting the need for automated surveillance. This study presents low-cost RGB camera sensors integrated with edge artificial intelligence (AI) for real-time aphid detection, enabling timely and targeted interventions. Using field images, we trained the YOLO11-n model and evaluated its performance under commercial farming conditions, achieving an average precision of 85 % for apterous aphids. The complex structure of lettuce, with overlapping leaves and shaded areas, limits detection accuracy, particularly for nymphal stages. Nevertheless, these results pave the way for affordable precision agriculture solutions to sustainably improve pest management.

8:30am - 10:00am

ICWG III/IVa-C: Disaster Management
Location: 715A

8:30am - 8:45am

Residual-aware multi-sensor 3-D coseismic displacement decomposition: the 2025 Mw 7.7 Myanmar earthquake

Sen Lyu^1,2, Mahdi Motagh^1,2

¹GFZ Helmholtz Centre for Geosciences, Potsdam, Germany; ²Leibniz University Hannover, Institute of Photogrammetry and Geoinformation, Hannover, Germany

We present a residual-aware, multi-sensor 3-D coseismic displacement decomposition applied to the 2025 Mw 7.7 Myanmar earthquake. The workflow combines multi-track Sentinel-1 SAR pixel offsets (range and azimuth) with Sentinel-2 optical pixel offsets, using only the north–south component where the signal clearly exceeds the optical noise level.

The key innovation is to handle sensor- and mosaicking-related residuals within a robust inversion framework rather than as ad hoc preprocessing. Strip-wise and inter-track trends are removed by MAD–Tukey IRLS plane fitting that suppresses long-wavelength orbital and viewing-geometry errors while preserving sharp near-fault steps in overlap zones. A residual-aware weighted least-squares inversion is then performed per pixel to recover east–west, north–south and vertical displacements and their fault-parallel projection. The resulting fields provide spatially continuous, cross-sensor-consistent constraints on fault-parallel slip along this exceptionally long rupture.

8:45am - 9:00am

Spatiotemporal Analysis And Forecasting Of Ground Deformation Using PS-InSAR

Aditya Kumar Thakur¹, Avinash Singh², Rahul Dev Garg¹, Kamal Jain¹

¹Department of Civil Engineering, Indian Institute of Technology Roorkee, Haridwar, India; ²Department of Mining Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, India

In Kolkata, potential land subsidence occurred primarily due to excessive groundwater extraction, which has been one of the major environmental crises, along with rapid urbanization and soft soil characteristics. This study investigates Kolkata's land surface deformation patterns from 2017 to 2023 using Persistent Scatterer Interferometric Synthetic Aperture Radar (PS-InSAR) technology. The study comprehensively analyzes deformation scenarios from 2017 to 2022; additionally, a detailed examination of the 2023 deformation scenario reveals continued trends and localized changes in subsidence patterns. The result shows that the mean ground velocity between 2017 and 2022 varies between -2.8 and -5.5 mm/year, and the area under the subsidence zone shows an increasing trend. Predictive models for 2024 and 2025 are developed based on historical data, providing forecasts of future subsidence trends. The prediction indicates that in 2024, the area under the high deformation class will be relatively higher compared with 2025. Spatial association analyses explore correlations between subsidence patterns of different years in Kolkata. The findings of this study may facilitate the assessment of the possible effects of ground-level movement on resource management, safety, and economics in this densely populated city.

9:00am - 9:15am

Integrating Unsupervised Change Detection and Deep Learning Segmentation for Automated Landslide Mapping

Gazali Agboola, Eden Wasehun, Leila Hashemi Beni

College of Science and Technology, North Carolina A&T State University, United States of America

Rapid and accurate detection of landslides after extreme climate events, such as heavy rainfalls or hurricanes, is essential for hazard response and mitigation. Traditional mapping methods rely on manual interpretation or labelled datasets, limiting scalability. This paper presents an integrated workflow combining unsupervised autoencoder-based + KMeans change detection and deep learning semantic segmentation to improve landslide identification in Western North Carolina following Hurricane Helene (September 2024). The approach leverages Planetscope RGB-NIR imagery at 3 m spatial resolution and North Carolina Department of Environmental Quality post-event landslide inventory points. The unsupervised autoencoder extracts latent features and highlights change zones, while segmentation models such as UNet learn spatial–contextual patterns from semi-automated labels. Results demonstrate high detection accuracy with segmentation models achieving strong overlap with ground-truth inventories and minimal false positives with an F1-score of 92%. This hybrid pipeline bridges rapid unsupervised detection and precise pixel-level segmentation, enabling scalable, near-real-time landslide mapping.

9:15am - 9:30am

A Segmentation-Based Multimodal Framework for Operational Landslide Mapping Using Post-Event SAR

Jeewantinie Kapilaratne, Shin-Ichi Kaneta

Asia Air Survey Co. Ltd., Japan

Rapid and reliable landslide mapping is critical for post-disaster response, yet Synthetic Aperture Radar (SAR)-based detection remains challenging due to speckle noise, geometric distortions, and complex terrain. This study develops an operational post-event landslide extraction framework using a UNet segmentation architecture with multimodal geospatial data fusion. High-resolution COSMO- SkyMed SAR imagery is combined with terrain representations derived from Digital Elevation Models (DEM), Red Relief Image Maps (RRIM), and rainfall indices to evaluate the contribution of complementary geospatial information to segmentation performance.

Experiments were conducted across three major landslide-triggering events in Japan (Kyushu, Hokkaido, and Kumamoto), comparing SAR-only and multimodal configurations. Results demonstrate that integrating terrain information and rainfall data improves landslide detection performance compared with SAR-only inputs. RRIM consistently outperforms DEM as a topographic descriptor, particularly in steep or heterogeneous terrain, while rainfall information provides moderate gains in recall. Boundary-based metrics further indicate improved geometric fidelity of mapped landslides when multimodal inputs are incorporated. The framework requires only a single post-event SAR acquisition supplemented with publicly available ancillary datasets, enabling rapid and scalable generation of landslide inventories without reliance on pre-disaster imagery. These findings establish a reproducible baseline for SAR-driven landslide segmentation and highlight the potential of multimodal geospatial data fusion for operational disaster response and hazard monitoring.

9:30am - 9:45am

Tracking Snow Avalanches: Integrating Field Observations and Satellite-Derived Indicators

Suvrat Kaushik¹, Fatima Karbou¹, Léo Viallon- Galinier¹, Adrien Mauss²

¹Météo-France, CNRM, Centre d’Études de la Neige (CEN), Grenoble, France; ²Météo-France, Centre de Météorologie Spatiale (CMS), Lannion, France

In this study, we integrated information from the French avalanche database, high-resolution digital elevation models (DEMs), and Sentinel-1 SAR images to model avalanche extents for events occurring across three distinct time periods in three French massifs. The modelled avalanche extents were compared with manually delineated polygons mapped over SAR RGB composites generated using the principle applied in colour-based change detection algorithms. The comparison revealed a strong correspondence between the two independent approaches, with IoU values ranging from 0.42 to 0.47 and F1 scores between 0.58 and 0.63 across the different massifs. We further analyzed the distribution of SAR backscatter values in pre- and post-event images across different zones of the avalanche paths. The results indicated that a fixed 3 dB threshold would most likely be insufficient to capture the complete avalanche extent, as certain zones exhibited backscatter increases of less than 3 dB in post-event SAR imagery. As a result, a multi-threshold approach based on different avalanche zones is recommended. Finally, we assessed the potential of Sentinel-2 optical imagery to detect surface changes and characterize the physical behaviour of avalanche-affected paths following intense avalanche

events. However, the results were inconsistent, exhibiting the expected trends in one study area but nearly opposite patterns in the other, indicating that the integration of optical data for automated avalanche mapping may not always be reliable.

8:30am - 10:00am

WG II/3E: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

8:30am - 8:45am

Technical Scheme for 3D Digital Map Production Based on the SSW Vehicle-mounted LiDAR Mobile Mapping System (VMMS)

You Jia, Eryan Zhang, Junyong Hu, Yang Jia, Weigang Liu

Shaanxi TIRAIN Science & Technology Co., Ltd., People's Republic of China

To meet the growing demand for 3D digital map applications and to better understand the multi-level spatial structure of cities, some cities have implemented citywide 3D digital map programs. In 3D digital map production, vehicle-mounted mobile surveying is a key component. Drawing with a practical project, this paper proposes a technical scheme for road data acquisition and processing based on the SSW VMMS (Vehicle-mounted Mobile Mapping System). Through integrated processing steps, including combined navigation solution, point cloud correction, image coordinate calculation, image deblurring, point cloud coloring, point cloud denoising, and Orbit GT data preparation, the rapid production of colored point cloud data with georeferenced coordinates, 360° panoramic image data, and individual image data is achieved. A technical scheme suitable for 3D digital map production along urban roads was developed and validated. The results produced by this scheme have passed inspection and acceptance, and were released to the public free of charge as the first batch of visualized 3D map data on the Common Spatial Data Infrastructure Portal (portal.csdi.gov.hk), receiving widespread attention and positive recognition from various sectors of society. This scheme not only promotes the broader application of the SSW VMMS but also provides effective reference for similar urban vehicle-mounted mobile mapping projects.

8:45am - 9:00am

Road Network Vectorization With Geometric Enforcement

Zhenyu Zhu^1,2, Elena Di Bernardino², Florent Lafarge¹

¹Inria, France; ²Université Cote d'Azur, France

We present an automatic algorithm for graph-based road network extraction from remote sensing images. While existing works mostly focus on improving accuracy, we address the problem of the geometric quality of the output graphs. The state-of-the-art methods largely overlook this aspect by generating graphs without strong geometric guarantees, regularity preservation and low-complexity, which, ultimately, reduces their impact in many application scenarios. Our algorithm relies upon foundation models that analyze road networks with pixel-based representations, as well as geometric algorithms and data structures in charge of connecting geometric primitives into planar graphs. This hybrid strategy allows us to strongly enforce the geometric quality of the output graphs while bringing a high level of generalization. We show the potential of our algorithm and its advantages over existing methods on two datasets commonly-used in the field using both the conventional accuracy metrics and new metrics introduced to measure the geometric quality of the output graphs.

9:00am - 9:15am

A practical workflow for road slopes monitoring using handled mobile mapping systems

José Miguel Gómez López, José Luis Pérez-Garía, Antonio Tomás Mozas-Calvache, Jorge Delgado-García, Diego Vico-Gacía

Universidad de Jaén, Spain

High-resolution monitoring of road infrastructure is essential for the early detection of geomorphological instabilities such as landslides and erosion. This study evaluates the performance of handled MMS under different vehicle-mounted configurations: a 2-meter survey pole versus a suction-cup mount, and varying acquisition speeds (10 and 20 km/h). Furthermore, a GNSS-denied scenario was simulated to test the robustness of SLAM-based processing. Initial results revealed significant geometric discrepancies (double-points artifacts and drift), particularly in the SLAM-only and high-speed datasets. To address this, an automated segment-based refinement workflow was developed using a ICP algorithm. The refinement successfully reduced the standard deviation to the level of the point cloud´s mean point spacing (5 cm). Comparative multitemporal analysis against UAV-LiDAR reference data confirms that the proposed refinement renders even SLAM-processed data viable for detecting centimetric terrain displacements. The findings demonstrate that while suction-cup mounting at 10 km/h is optimal, algorithmic refinement allows for reliable road slopes monitoring and change detection across all tested configurations

9:15am - 9:30am

Assessing positional accuracy of photogrammetric multi-camera systems for mapping underground utility pipelines

Luca Perfetti¹, Ahmad Elalailyi², Federica Marotta³, Francesco Fassi², Giorgio Paolo Maria Vassena¹

¹Università degli Studi di Brescia, dept. of Civil Eng., Architecture, Territory, Environment and Mathematics (DICATAM), Italy; ²Politecnico di Milano, dept. of Architecture, Built environment and Construction engineering (ABC), Italy; ³Consorzio di Bonifica di Piacenza, Italy

Underground utilities such as water pipelines and sewers are critical for urban systems, yet their management is challenging due to limited accessibility and uncertain positional data. Current inspection practices rely on robotic crawlers equipped with CCTV cameras or man-entry inspections, enabling visual documentation of structural conditions but lacking accurate georeferencing of internal points. Advanced solutions relying on panoramic imaging and IMUs offer partial 3D measurements and trajectory estimation, though accuracy remains limited by drift and environmental variability.

This study investigates the feasibility of multi-camera photogrammetry for mapping pipelines and confined underground environments and improving positional accuracy. Preliminary experiments were conducted using the Atom-Ant3D system on two test sets: (i) five pipelines of varying materials (concrete, PVC, fiberglass) and diameters (60–110 cm); and (ii) a 1.3 km water-distribution tunnel (~2 m diameter) prepared with 28 fixed targets measured via total station for accuracy evaluation. Data were acquired using robotic and handheld configurations and processed through two workflows: Structure-from-Motion (SfM) and multi-view V-SLAM.

Accuracy assessment focused on the tunnel test, comparing unconstrained and constrained trajectories against a reference solution. Results provide insights into the potential of photogrammetric approaches for precise pipeline reconstruction and georeferencing, supporting improved subsurface utility management and planning.

9:30am - 9:45am

Beyond Centers: Bounding-Box Voxel Projection for Multi-View 3D Detection and Tracking

Rasho Ali, Max Mehltretter, Christian Heipke

Leibniz university hannover, Germany

3D multi-view, multi-object tracking (3D MV-MOT) makes use of multiple cameras to reduce the number of missed detections and to mitigate occlusions. Most current 3D MV-MOT methods suffer from information loss when associating 3D locations with 2D image features via a 3D-to-2D projection, as they use a discrete grid in 3D and sample image features only at the projected centers of each grid cell. Thus, all other feature information is lost. An additional information loss commonly arises during cross-view aggregation when applying max or average pooling: these methods either overemphasize a single view or treat conflicting views, that depict different entities, e.g., due to occlusions, equally.

In this work, we introduce two novel modules for 3D MV-MOT, employed to pedestrian tracking, that target these limitations:

(i) VoxROI aggregates all image features that fall within the bounding box around a voxel's projection into each respective image, instead of only sampling features at the projected voxel center.

(ii) SimFuse aggregates per-view voxel features into one coherent feature representation per voxel, using similarity weights computed from re-identification (Re-ID) features.

Subsequently, they are used to measure cross-view identity similarity. Views with higher Re-ID feature similarity receive larger weights, while inconsistent views are suppressed.

Experimental results on the WildTrack dataset confirm our method's effectiveness for multi-view pedestrian detection and tracking, reaching, and in particular in cross-view scenarios improving, the general state-of-the-art. The approach maintains strong performance across different camera configurations, demonstrating its generalization capability when training and testing on different camera setups.

9:45am - 10:00am

Fine-Grained Urban Low-Altitude Airspace Gridding with Dynamic Event Response and Vertical Air-Route Corridors

Leyang Zhao, Hang Mei, Xing Lin, Weixi Wang, Xiaoming Li, Sichang Wang, Lei Han, Yang Zhao, Ding Ma

Research Institute for Smart Cities, School of Architecture and Urban Planning, Shenzhen University,

With the rapid growth of urban low-altitude applications, traditional airspace management approaches based on simple altitude limits and static no-fly zones can no longer meet the demands of high-density and highly dynamic operations. To address this issue, this study proposes a fine-grained gridding method for urban low-altitude airspace with dynamic event response and vertical flight corridor constraints. First, a unified three-dimensional grid model is constructed on the basis of an urban 3D digital twin platform, and the grid scale and update cycle are determined by jointly considering clearance requirements and safety separation. Second, a method for injecting static and dynamic attributes is established to achieve the unified representation and continuous updating of terrain, buildings, no-fly and restricted zones, wind fields, temporary restrictions, as well as occupancy and release information within the grid. Third, fixed-geometry and dynamically open vertical flight corridors are designed to support controlled cross-layer flight transitions and reduce the risk of vertical conflict propagation. An experimental system is developed using a typical high-density urban area in Yuehai Subdistrict, Nanshan District, Shenzhen, as the case study. The results show that the proposed method can achieve stable spatial discretization, accurate attribute loading and updating, and clear organization of cross-layer flight. The proposed method provides a unified technical framework for low-altitude airspace representation, state management, and operational governance in complex urban environments.

8:30am - 10:00am

IvS6A: Canadian Remote Sensing for Urban Applications
Location: 716A

8:30am - 8:45am

Urban Growth, NO₂ Pollution, and Economic Development Across Global Megacities

Thilo Erbertseder, Lorenza Gilardi, Thomas Esch, Hannes Taubenböck

Earth Observation Center, German Aerospace Center (DLR)

Megacities—defined as Functional Urban Areas (FUAs) of more than 10 million inhabitants—are global hotspots of population growth, economic activity, and environmental pressure. Their development trajectories shape regional and global emission patterns, yet a comprehensive understanding of how urban expansion, air pollution, and economic development interact over time has remained limited. While prior research has examined either urban growth or atmospheric pollution trends, an integrated analysis linking both dimensions within a socio-economic framework is still lacking. This study addresses this gap by leveraging long-term Earth Observation (EO) datasets to systematically analyze settlement growth and tropospheric nitrogen dioxide (NO₂) pollution across 38 megacities between 1996 and 2015. Using the World Bank income classification, we evaluate whether observed environmental and urbanization patterns align with the Environmental Kuznets Curve (EKC)—a hypothesis that posits a non-linear (inverted U-shaped) relationship between environmental degradation and economic development.

8:45am - 9:00am

Mapping Environmental Equity: Urban Green Spaces and the 3-30-300 Rule in Canada

Kaushik Roy¹, Saeid Homayouni¹, Ying Zhang²

¹INRS, Quebec City, Canada; ²Natural Resources Canada

Urban green space accessibility represents a critical dimension of sustainable planning and public health outcomes. This research quantifies compliance with the "3-30-300" framework - requiring residents to view three trees from home, neighborhoods to maintain 30% canopy coverage, and proximity to public green space within 300 meters across Montreal Island and Quebec City. While this policy has gained substantial theoretical traction, empirical implementation assessment in Canadian urban contexts remains limited. Employing high-resolution remote sensing imagery, deep learning-based land cover classification, and LiDAR-derived canopy data, we conducted comprehensive spatial analysis of both municipalities. Road network data from OpenStreetMap enabled walkability assessment. Integrated compliance metrics (I330300) revealed stark disparities: Montreal achieved 20.93% compliance, while Quebec City registered merely 2.69%. These findings underscore substantial green space accessibility deficits across both municipalities, with particular concentration in peripheral neighborhoods. Spatial statistical analysis identified pronounced clustering of non-compliance zones, demonstrating heterogeneous distribution of environmental amenities. Population demographic analysis revealed significant correlations between socioeconomic indicators and green space availability, suggesting environmental inequity patterns. Such disparities raise critical equity concerns regarding differential access to environmental services and associated health benefits. These results directly advance United Nations Sustainable Development Goal 11 objectives for establishing inclusive, sustainable cities. The quantitative assessment methodology demonstrates the efficacy of integrating remote sensing, machine learning, and spatial analysis for evidence-based urban environmental policy evaluation. Findings provide empirical foundations for targeted interventions addressing green space deficits in underserved urban communities, enabling data-driven municipal planning strategies that prioritize equitable environmental resource distribution and enhanced public health outcomes.

9:00am - 9:15am

Measuring Heat Stress and Mitigation Capacity Around Transit Stops Using Hyperlocal Microclimate Data

Naser Ahmed, Jinfei Wang, Jinhyung Lee

Department of Geography and Environment, Western University

This presentation examines heat stress and mitigation capacity around transit stops during an extreme heat wave in Vancouver, Canada. Using hyperlocal microclimate modelling and high-resolution urban geometry data, we estimate “feels-like” Mean Radiant Temperature and shade availability to develop two new indicators: the Transit Stop Heat Stress Index and the Transit Stop Mitigation Capacity Index. Results reveal strong spatial and socio-economic disparities, with higher heat exposure and fewer mitigation features in lower-income neighbourhoods. The study demonstrates how microclimate data can guide climate-responsive, equitable transit planning under intensifying heat conditions.

9:15am - 9:30am

Landfill methane emission detection and quantification using a drone-based path-integrated TDLAS sensor

Mohammad Rokhafrouz, Jinfei Wang, James Voogt

Dept. of Geography and Environment, Western University, London, Ontario, Canada

Landfills are among the largest anthropogenic sources of methane, yet accurately detecting and quantifying their emissions remains challenging due to diffuse release patterns, complex terrain, and weather-driven variability. This presentation introduces a drone-based monitoring approach that uses a path-integrated Tunable Diode Laser Absorption Spectroscopy (TDLAS) sensor to detect and quantify methane emissions at a municipal landfill in London, Canada. Methane measurements collected throughout the year, together with on-site meteorological observations, were integrated into an inverse atmospheric plume-dispersion model to estimate emission rates. This contribution demonstrates the potential of drone-based TDLAS measurements to provide practical, high-resolution landfill methane monitoring and to reduce uncertainties in greenhouse gas reporting and mitigation efforts.

9:30am - 9:45am

Coupling dynamic cities and climate: the urbisphere project

Nektarios Chrysoulakis¹, Jörn Birkmann², Andreas Christen³, Sue Grimmond⁴

¹FORTH, Greece; ²University of Stuttgart, Germany; ³University of Freiburg, Germany; ⁴University of Reading, United Kingdom

Climate change and urbanization transform life globally, with direct impacts on each other, yet they are rarely studied together across disciplines. The Synergy Grant urbisphere (https://urbisphere.eu), funded by the European Research Council (ERC), aims to forecast feedbacks between climate and cities. With new synergies between four disciplines (spatial planning, remote sensing, modelling and ground-based observations), urbisphere incorporates city dynamics and human behaviour into climate forecasts/projections, focusing on within-city dynamics of peoples’ activities and how these can be up-scaled to cities globally. urbisphere is studying inter/intra-city form and function (demographics, mobility, climate adaptation and vulnerability planning typologies), exploring human/socio-economic vulnerability, exposure, risk perception, coping/adaptive measures to climatic stressors and settlement/building typologies.

urbisphere is developing new ways to represent city dynamics for weather/climate models. These models are informed by the urbisphere developed Earth Observation system, using space-borne/airborne and ground based sensors with near real-time data transmission, processing and visualization of data from 500+ sensors, including a network of ceilometers, scintillometers, Doppler wind lidars, flux towers combined with street-level and indoor sensors. Combined these measure the 3-dimensional state of the atmosphere and the surface. These observations are providing both new understanding of urban surface-atmosphere processes and datasets for model evaluation at unprecedented detail.

8:30am - 10:00am

ThS5: Large Language Models for Intelligent LiDAR Point Cloud Processing
Location: 716B

8:30am - 8:45am

GeoOpen3D: Geometry-guided training-free open-vocabulary 3D segmentation via visual foundation models

Shuai Zhang¹, Zhuoxiao Li¹, Jing Ou¹, Tengxi Wang¹, Zhecheng Shi^1,2, Wufan Zhao¹

¹The Hong Kong University of Science and Technology (Guangzhou), China; ²School of Computer and Communication Engineering, Northeastern University, China

Open-vocabulary 3D segmentation offers an attractive alternative to closed-set scene parsing, yet directly transferring 2D vision-language models to outdoor point clouds remains difficult because projection disrupts geometric continuity and sparse sampling weakens mask quality. This paper presents GeoOpen3D, a geometry-guided and training-free framework for open-vocabulary 3D point cloud segmentation. GeoOpen3D constructs a geometry-preserving RGB-D representation through projection, super-sampling, and depth enhancement to improve alignment between 3D structure and 2D foundation models. It then combines GroundingDINO for language-driven proposal generation with SAM for mask extraction, while introducing depth-aware regularisation to favour structurally coherent regions and clearer boundaries. The selected masks are back-projected to the original point cloud through pixel-to-point correspondence, yielding point-wise semantic labels without any 3D model training. Experiments on the SensatUrban dataset show that GeoOpen3D achieves 42.1\% mIoU, including 98.5\% IoU for buildings and 97.3\% IoU for vegetation, outperforming existing training-free open-vocabulary baselines. Additional experiments on a custom island dataset further demonstrate promising transferability to unseen categories. These results indicate that geometry-guided 2D-to-3D transfer provides an effective and scalable path towards open-vocabulary understanding of large-scale outdoor scenes.

8:45am - 9:00am

SPARC: Scalable 3D Panoptic Segmentation with Reinforcement-driven Clustering

Haoling Huang, Ting Han, Jie Ouyang, Yiping Chen

Sun Yat-sen University, China, People's Republic of

Large-scale 3D panoptic segmentation is critical for digital twins and geospatial analysis, demanding models that process massive point clouds while distinguishing instances across highly diverse spatial scales. However, prevailing graph-based approaches rely on one-shot optimization, suffering from \textit{short-sighted decisions} where irreversible local errors propagate globally, leading to severe under-segmentation at boundaries between objects of disparate scales. To overcome this short-sightedness, we present \textbf{SPARC}, a scalable framework that reframes graph clustering as a sequential, self-correcting decision process driven by hierarchical reinforcement learning. Specifically, SPARC employs a dual-level agent where a meta-controller adaptively determines instance completeness while a low-level policy iteratively refines edge affinities, enabling the model to revise early mistakes based on long-horizon rewards rather than greedy local cues. Complementing this, we introduce Semantic Voxel Partitioning (SVP) to generate semantically coherent superpoints, ensuring robust primitives that mitigate noise before clustering begins. Extensive experiments demonstrate that SPARC achieves state-of-the-art performance on the DALES dataset with a Panoptic Quality of 62.4\%, surpassing existing methods by 9.8\% and effectively resolving multi-scale segmentation ambiguities.

9:00am - 9:15am

LaSA-Net: A Language-Guided Network for Large-Scale 3D Referring Expression Segmentation on the UrbanRefer Benchmark

Bin Liu^1,2,3, Lingfei Ma^1,2,3,4, Haiyan Guan⁵, Jonathan Li^1,2,3,4

¹Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University, Shanghai 200241, China; ²Key Laboratory of Spatial-temporal Big Data Analysis and Application of Natural Resources in Megacities, Ministry of Natural Resources, East China Normal University, Shanghai 200241, China; ³School of Geospatial Artificial Intelligence, East China Normal University, Shanghai 200241, China; ⁴Hinton STAI Institute, East China Normal University, Shanghai 200241, China; ⁵School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China

3D Referring Expression Segmentation (3DRES) aims to segment point cloud scenes based on a given expression. However, existing 3DRES methods face three main challenges: (1) significant progress has been made in indoor scenes, yet large-scale and complex outdoor scenes, captured by airborne or mobile LiDAR, remain fully unexplored; (2) traditional methods often suffer from inefficiency and mis-segmentation due to insufficient attention to the spatial information of instances during query generation; and (3) existing models treat all queries equally in the decoder and predict the final mask in one step, which is inefficient in outdoor road scenes dominated by background point clouds, where objects are sparse and small. To address these challenges, a new outdoor 3DRES benchmark, named UrbanRefer, is introduced. The dataset consists of 100 large-scale outdoor scenes and 1,100 specially designed long textual descriptions, emphasizing geospatial relationships and multi-object contexts unique to outdoor environments. Additionally, the Language-guided Spatial Anchoring Network (LaSA-Net) is proposed for the directional segmentation task in outdoor scenes. Specifically, the Local-Global Aggregation (LGA) module is incorporated into the backbone to enhance local and global context awareness, effectively optimizing point features. Furthermore, a Text-driven Localization (TL) module is introduced, which directly predicts the 3D positions of all entities mentioned in the text, providing robust spatial priors for the decoder. Finally, a Hierarchical Prompt-aware Decoder (HPAD) is designed to locate rough regions by extracting task-driven signals from the interaction between expressions and visual features. Extensive experiments demonstrate that the mIoU metric of LaSA-Net outperforms state-of-the-art methods by 0.9%.

9:15am - 9:30am

Scenereasoner: Decoupled Spatial Tokenization for large-scene understanding with llms

Bohang Shi, Shengjun Tang, Xiaoming Li, Weixi Wang, Linfu Xie, Baoding Zhou

Shenzhen University, Shenzhen, Guangdong, People's Republic of China

Most existing 3D vision-language models focus on object-level or single-room understanding and perform poorly in large-scale,

multi-room indoor environments where task-relevant objects constitute only a small fraction of the total point cloud. When multi-

room point clouds are fed directly into an LLM, critical semantic signals are diluted by the vast amount of redundant background,

making it difficult for the model to focus on truly relevant regions. We propose SceneReasoner, a decoupled spatial tokenisation

framework that addresses this challenge through three core designs: (1) pre-tokenisation text-guided feature weighting that leverages

the shared CLIP embedding space between OpenScene point features and text queries to amplify question-relevant point features

before any spatial compression occurs; (2) 2D–3D feature fusion that integrates top-down 2D CLIP features with 3D sparse tokens,

supplying the model with appearance semantics—such as texture, material, and room layout—absent from raw point clouds; and

(3) layer-wise dense feature injection that inserts local dense features into the LLM attention mechanism layer by layer for fine-

grained perception of key regions. We evaluate on the XR-Scene benchmark, which covers cross-room question answering and

scene captioning over HM3D indoor environments with an average area of 132 m2. SceneReasoner achieves the best CIDEr on

XR-SceneCaption (+0.33 over LSceneLLM), the highest METEOR on XR-QA, and competitive ROUGE-L across all three tasks,

demonstrating the effectiveness of task-guided spatial tokenisation for large-scene understanding.

9:30am - 9:45am

Llm-Supervised Point Cloud Processing: from Unsupervised 3D Scene-Graph Generation to Interactive Scene Manipulation

Florent Poux^1,2, Alex Key³

¹3D Geodata Academy, France; ²Geoscity Lab, University of Liège, Belgium; ³Panoriq AI, Germany

Understanding and manipulating 3D spatial environments remains a fundamental challenge in geospatial sciences, with applications spanning digital twins, facility management, urban planning, and autonomous systems. While point cloud acquisition technologies have matured significantly, the semantic interpretation and interactive manipulation of captured 3D scenes continue to require extensive manual intervention and domain expertise. This paper presents a novel LLM framework that bridges unsupervised graph-based 3D scene understanding with natural language-driven interactive manipulation, enabling context-aware spatial intelligence at scale.

9:45am - 10:00am

Multimodal Large Language Models to road inventory with non-photorealistic Point Cloud visualization

Horia Ameen, Mario Soilán, Henrique Lorenzo Cimadevila, Jesús Balado-Frías

CINTECX, Universidade de Vigo, GeoTECH, 36310, Vigo, Spain

Accurate road inventories are crucial for maintenance, safety, and resource allocation, with automation improving efficiency but often lacking user-friendly human-machine interaction. This paper evaluates how non-photorealistic rendering of 3D point clouds impacts Multimodal Large Language Models (MLLMs) interpretation for road inventory, testing three methods on real road data in Santarém, Portugal. From 3D point clouds coloured with RGB information, non-photorealistic techniques are implemented and compared: Ambient Occlusion (AO), Eye-Dome Lighting (EDL) and Multi Feature-Rich Synthetic Color (MFRSC). Several state-of-the-art MLLMs are also tested: GPT5, Gemini2.5-Pro, Gemini2.5-Flash, CogVLM2, MiniCPM-V, Llama4-scout-17b, Mistral-Small3.2, Qwen2.5vl and Gemma3. The results indicate that non-photorealistic techniques do not hinder the identification of road elements by MLLMs, indicating their potential for 3D point cloud classification tasks even when true RGB colour is not available. Furthermore, the overall performance metrics, with F-scores over 80% for proprietary, state-of-the-art models (GPT5, Sonnet 4.5 and Gemini) show that 2D captures of 3D point clouds can be a suitable data source for zero-shot object classification.

8:30am - 10:00am

ThS4B: Toward Smart Forests: Emerging Tools in Remote Sensing, Artificial Intelligence, and Field Robotics
Location: 717A

8:30am - 8:45am

A Generative Upsampling Framework for Reconstructing High-Density Tree Structures from Low-Density Airborne Lidar

Erfan Hasanpour Zaryabi¹, Liam Bennett¹, Josh Qixuan Sun², Mark Crowley², Laura Chasmer³, Christopher Hopkinson³, Jeff Boisvert¹

¹University of Alberta, Canada; ²University of Waterloo, Canada; ³Western University, Canada

Light Detection and Ranging (lidar) has become an essential tool to quantify forest structure in three dimensions, allowing extraction of tree-level metrics such as height, crown volume, diameter at breast height (DBH), and biomass. Accurate forest structure quantification supports applications such as wildfire management, biodiversity assessment, forest health monitoring, and timber management. This is particularly urgent in countries such as Canada, where wildfires pose a significant challenge to forest management due to their increasing frequency and severity; advanced fire behavior models aid wildfire preparedness by predicting fire behaviour at fine-scale in 3D but require detailed 3D fuel information including canopy and ladder fuels.

Terrestrial Laser Scanning (TLS) and Uncrewed Aerial Vehicle (UAV) lidar provide dense point clouds that allow highly accurate characterization of individual trees, critical for assessing forest attributes and wildfire fuel characteristics. However, their limited spatial coverage makes them neither time- nor cost-effective for mapping extensive forested regions. Airborne Laser Scanning (ALS), in contrast, covers broad areas efficiently by collecting data from higher altitudes, but at the cost of lower point densities (typically 1–100 points/m²), insufficient for precise individual tree characterization.

To address this challenge, this study reconstructs densified tree point clouds from low-resolution ALS data using an upsampling framework based on a deep generative network trained on real and synthetic datasets. This approach bridges the gap between ALS’s extensive coverage and the detailed structural information provided by TLS and UAV lidar, enabling accurate, large-scale quantification of forest structure for applications such as wildfire management and monitoring.

8:45am - 9:00am

Tree Localization Using Integrated Heading, DBH and Ultra-Wideband for Precision Forestry

Zuoya liu¹, Harri Kaartinen¹, Antero Kukko^1,2, Juha Hyyppä^1,2, Ruizhi Chen³

¹Department of Remote Sensing and Photogrammetry, Finnish Geospatial Research Institute FGI in the National Land Survey of Finland, Vuorimiehentie 5, 02150 Espoo, Finland; ²Department of Built Environment, School of Engineering, Aalto University, P.O. Box 11000, FI-00076, Aalto, Finland; ³School of Data Science/School of Artificial Intelligence, The Chinese University of Hong Kong, Shenzhen, China

Accurate tree positions play a vital role in precision forestry and environmental sciences. In this study, we propose an accurate, efficient, and adaptable method for tree localization by integrating heading, diameter at breast height (DBH), and ultra-wideband technology. The proposed method is simple to implement in different forest environments and can determine the position of each tree within a few seconds. Compared with traditional field measures, such as laser rangefinders and inclinometers, the proposed approach is more efficient. In comparison with commonly used measures, such as terrestrial laser scanning (TLS) and mobile laser scanning (MLS), the proposed method is more cost-effective and easier to implement, making it particularly suitable for natural forests that are remote from roads yet require accurate measurements. Field experiments were conducted in a managed boreal forest in southern Finland, characterized by minimal understory vegetation and good visibility, where a total of 50 trees were mapped. Experimental results indicate that the proposed method can accurately determine tree positions with an RMSE of 0.12 m and an MAE of 0.11 m.

9:00am - 9:15am

Automatic phenotyping of the 3D tomato plant based on a clustering algorithm and geometric characteristic

Maurycio Oviedo Espinosa², Eija Honkavaara¹, Aluir Dal Poz², Mauricio Galo², Renan de Oliveira², Rodrigo Gallis³, Gabriel Maciel³

¹Department of Remote Sensing and Photogrammetry, Finnish Geospatial Research Institute, Finland; ²São Paulo State University, Brazil; ³Federal University of Uberlândia, Monte Carmelo, Brazil.

Plant phenotyping has become a fundamental tool in modern agronomic research, enabling quantitative analysis of morphological characteristics that can be collected in three dimensions using photogrammetric techniques or point clouds obtained by LiDAR systems. However, automatic segmentation of plants, especially the main stem and its branches, still poses a challenge for certain crops. This work proposes a non-destructive, geometry-based methodology for morphological phenotyping of tomato plants (Solanum lycopersicum) using photogrammetric point clouds. The proposed methodology consists of the following steps: stratification of the plant into horizontal sections; clustering of each stratum using the DBSCAN algorithm; selection of clusters based on the linearity tensor derived from eigenvalue analysis; and the fitting of a 3D cylinder to the linear clusters to approximate the main stem. The method was validated using manually labeled point clouds from nine tomato cultivars, achieving accuracy between 88% and 97%, with average F1-scores of 63.6% for the stem and 96% for the branches

9:15am - 9:30am

Linking TreeQSM with SAR and ALS to Detect Internal Canopy Allocation Shifts Across Scales

Hanna Elisabet Sorokina¹, Mariana Campos¹, Taiga Korpelainen², Rami Echriti¹, Eetu Puttonen¹, Juha Hyyppä¹, Yunsheng Wang¹

¹Finnish Geospatial Research Institute, Finland; ²University of Helsinki, Finland

Linking remotely sensed forest backscatter with fine-scale tree crown structural dynamics provides insights into tree growth strategies under varying conditions. In this study, we investigate whether branch-scale tree growth allocation dynamics, derived from multi-temporal TreeQSM models, are reflected in SAR and ALS observations. We analyzed branch organization dynamics of silver birch (Betula pendula) using terrestrial laser scanning data from 2021, 2023 and 2025 at a boreal forest site in southern Finland. Branch allocation metrics, including volume-weighted mean diameter (VWMD), small branch fraction (SBF), distal volume fraction, relative branch height, and top canopy volume, were quantified to capture shifts between structural reinforcement and exploratory growth. These metrics were compared with Sentinel-1 SAR features (α, entropy, C11, C22) and ALS-derived canopy metrics (plant area index, vertical complexity index) alongside local structural variables. Results show a consistent trade-off between coarse and fine branching, with strong negative correlations between ΔVWMD and ΔSBF across both periods (ρ = –0.92). SAR-derived α exhibits strong associations with these allocation shifts during 2021–2023 (ρ = –0.81 with ΔVWMD; ρ = 0.75 with ΔSBF), indicating sensitivity to internal redistribution of branch material. ALS metrics from 2021 reflect initial canopy structure and are associated with subsequent allocation shifts. Despite the small magnitude of observed changes, consistent monotonic relationships across datasets suggest that subtle within-crown branch allocation is detectable from satellite and aerial observations, reflecting the surrounding canopy context. However, weakened correlations in 2023–2025 highlight the influence of external factors on SAR signals.

9:30am - 9:45am

Weakly-Supervised Learning for Tree Instances Segmentation in Airborne Lidar Point Clouds

Swann Emilien Céleste Destouches, Jesse Ray Murray Lahaye, Laurent Valentin Jospin, Jan Skaloud

École polytechnique fédérale de Lausanne, Switzerland

Tree instance segmentation of airborne laser scanning (ALS) data is of utmost importance for forest monitoring but remains challenging due to variations in the data caused by factors such as sensor resolution, vegetation state at acquisition time, terrain characteristics, etc. Moreover, obtaining a sufficient amount of precisely labeled data to train fully supervised instance segmentation methods is expensive. To address these challenges, we propose a weakly supervised approach where labels of an initial segmentation result obtained either by a non-finetuned model or a closed-form algorithm are rated by a human operator. The labels produced during the quality assessment are then used to train a rating model, whose task is to classify a segmentation output into the same classes as specified by the human operator. Finally, the segmentation model is finetuned using feedback from the rating model. This in turn improves the original segmentation model by 34% in terms of correctly identified tree instances while considerably reducing the number of non-tree instances predicted.

9:45am - 10:00am

Optimisation of PointNet++ for Tree Species Classification from Drone LiDAR Data

Nada Hamdani^1,4, Imane Abouhat², Kenza Ait El Kadi^1,2, Saloua Bensiali^1,3, Imane Sebari^1,2

¹Research Unit of Geospatial Technologies for a Smart Decision, IAV Hassan II, Rabat 10101, Morocco/Société Topographie Informatique France, Morocco; ²Department of Photogrammetry and Cartography, School of Geomatics and Surveying Engineering, IAV Hassan II, Rabat, Morocco; ³Department of Applied Statistics and Computer Science; ⁴Société Topographie Informatique, 91000 Evry Courcouronnes, France

Trees play a key role in our planet. They regulate climate, preserve biodiversity, and contribute to human well-being. Each species has different contributions to our globe and a specific carbon storage potential. Identify tree species enable better measurement of global carbone, help authorities for better manage forests and green spaces. Unmanned Aerial System (UAS) LiDAR has become a powerful source of 3D point cloud for vegetation analysis, given its ability to captured large area in a short time and its capacity to penetrate canopy layers. Deep learning methods extract discriminative features directly from raw point clouds and generalize well to unseen datasets. This study optimises PointNet++ deep learning architecture for tree species classification by analysing the influence of sampling configurations on the performance of model detection, by using an open-source dataset “FOR-species20K”.Three-point cloud sampling configurations (4 096, 8 192, and 16 384 points per tree) were tested with three random seeds (0,42 and 123) to assess their impact on classification accuracy and ensure stability of prediction. Results on a separate test set of 508 trees show a consistent improvement in performance of PointNet++ with a sampling configuration of 8 192 points per tree, reaching a macro-average F1-score of 89.65%, surpassing the 74.9 % reported by (Puliti et al., 2025) for evaluating the same architecture. Dominant species such as Fagus sylvatica, Picea abies, and Pinus sylvestris achieve F1-scores exceeding 90%, indicating high model robustness.

10:00am - 10:30am

Morning Coffee Break
Location: Exhibition Hall "E"

10:00am - 5:30pm

Exhibition
Location: Exhibition Hall "F"

10:30am - 12:00pm

Plenary Session 3
Location: Exhibition Hall "G"

Keynote 1: Professor Jun Chen
Awards Ceremony:

The ISPRS Fritz Ackermann Award
Giuseppe Inghilleri Award

Keynote 2: Professor Xiaoxiang Zhu

12:00pm - 1:30pm

CRSS-SCT Member Meeting
Location: 709

Awards Ceremony

CRSS-SCT Silver Medal
Best Master’s Thesis
Best PhD Thesis
Student Symposium Travel Grants
Canadian Journal of Remote Sensing Best Paper Awards

12:00pm - 1:30pm

Lunch
Location: Exhibition Hall "E"

1:30pm - 3:00pm

WG III/1C: Remote Sensing Data Processing and Understanding
Location: 713A

1:30pm - 1:45pm

Remote sensing image semantic segmentation sample generation using a decoupled latent diffusion framework

Yue XU¹, Honghao Liu², Ruixia Yang³, Zhengchao Chen¹

¹Aerospace Information Research Institute, Chinese Academy of Sciences, China, People's Republic of; ²University of Chinese Academy of Sciences, China, People's Republic of; ³International Research Center of Big Data for Sustainable Development Goals, China, People's Republic of

Semantic segmentation deep learning algorithms still depend on large quantities of high-quality annotated samples. Because remote sensing imagery spans vast areas and highly variable land surface environments, annotation demands substantial expertise and is both time-consuming and labour-intensive, leaving the field with an acute shortage of first-rate training samples. Moreover, object categories in land cover data are inherently imbalanced. Models trained under imbalance often underperform in small sample categories.

This study proposed a decoupled latent diffusion framework for RS semantic segmentation sample generation, and a proportion-aware loss to optimize balance of sample classes. We tested the proposed method on the ISPRS Potsdam dataset and compared it with two classic image generation baselines. The results show that our approach outperforms the baselines, producing synthetic samples with superior visual quality and semantic consistency. To verify downstream utility, we trained DeeplabV3+, PSPNet, and SegFormer segmentation models with the synthesized data. Across all three networks, overall segmentation accuracy and class balance metrics improved markedly; gains were especially pronounced for the rare “Clutter” and “Car” categories, underscoring the proposed method’s generality and robustness. We further analysed how the proportion of synthetic samples affects performance. As the ratio of synthetic to real samples increased, mIoU and mF1 first rose and then declined; the best results were obtained when the proportion of synthesized samples approached 40%. This indicates that a moderate amount of synthetic sample can significantly boost segmentation performance, whereas excessive synthetic data risks over-fitting or misclassification.

1:45pm - 2:00pm

Bright-CC: A Novel Change Captioning Benchmark for Cross-Modal Remote Sensing Images

Jing Yao¹, Dongwei Sun², Yun Lin³, Zhangqi Zhou³, Haizhen Chen³, Xiangyong Cao², Zhengchao Chen¹

¹State Key Laboratory of Remote Sensing and Digital Earth, Aerospace Information Research Institute, Chinese Academy of Sciences; ²School of Computer Science and Technology, Xi’an Jiaotong University; ³Ningbo Institute of Surveying, Mapping and Remote Sensing

Existing remote sensing change captioning methods are limited to optical-only data, precluding all-weather, all-day monitoring. To address this, we introduce Bright-CC, the first large-scale benchmark for cross-modal (Optical-to-SAR) change captioning. Curated from the newly-proposed BRIGHT dataset, Bright-CC comprises 9,953 paired images focused on building damage assessment. It features dense four-class semantic labels (intact, damaged, destroyed) and a rich corpus of 49,765 GPT-4O-generated sentences (5 per pair), moving beyond simple binary change labels. Furthermore, we propose the Hybrid Feature Alignment Network (HFA-Net) as a robust baseline for this new task. HFA-Net is specifically designed to tackle the significant domain shift between heterogeneous sensors. Its architecture features: (1) a pseudo-siamese alignment module (HFEA) to project features into a common space; (2) a multi-scale atrous convolution module (CSTDF) to refine change context; and (3) a novel Lightweight Caption Generator (LCG), which is a parameter-efficient Transformer trained from scratch to avoid overfitting. Experiments show HFA-Net substantially outperforms adapted optical-only baselines (RSICCFormer, Chg2Cap) on all standard metrics. This work provides the community with a critical dataset and a strong baseline for future cross-modal spatio-temporal intelligence.

2:00pm - 2:15pm

Remote Sensing Change/Damage Image Generator Based on Prior Foundation Model and Multimodal Reference Information

Peng Chen, Guorui Ma, Haiming Zhang, Di Wang, Lunjun Fan

Wuhan University, China, People's Republic of

The scarcity and high cost of acquiring high-quality post-event remote sensing images (due to cloud cover, satellite limitations, and security risks) severely constrain the development and accuracy of change/damage detection models. This data gap is especially critical in disaster or military conflict scenarios. Existing cross-temporal image generation methods often lack precise spatial and semantic control, leading to inconsistent or unrealistic synthetic results. To address this core challenge, this paper introduces the Remote Sensing Change/Damage Generator (RSCDG), a novel method based on the Latent Diffusion Model for high-fidelity simulation of post-event satellite imagery. The RSCDG’s core innovation lies in its multimodal condition embedding framework, which integrates three specialized control pathways:The Pre-event Visual Prompt Adapter (built on PrithviModel) ensures high structural consistency between the pre-event and generated post-event images.The Spatial Location Control Pathway (using a ControlNet architecture and change/damage masks) precisely dictates the geometric location of the simulated change.The Generation Content Controller (using a CLIP Text Encoder) enhances semantic realism by guiding the model with natural language descriptions of the change/damage.Furthermore, we introduce a Mask Alignment Loss to enforce spatial and semantic adherence to detection rules. Results demonstrate that RSCDG accurately simulates complex scenarios like new urban construction and catastrophic building collapse. RSCDG is a powerful new tool designed to augment training data and significantly accelerate high-precision disaster response and urban monitoring.

2:15pm - 2:30pm

Edge Knowledge Distillation Guided Lightweight Change Detection Network

Tingyu Ji^1,2, Yixin Chen^5,6, Ruiqian Zhang¹, Xiaogang Ning¹, Xiao Huang³, Hanchao Zhang^1,7, Weibin Ma¹, Chunquan Cheng¹, Jiaming Wang⁴

¹State Key Laboratory of Spatial Datum, Chinese Academy of Surveying and Mapping, Beijing 100036, China; ²the College of Geodesy and Geomatics, Shandong University of Science and Technology, Qingdao 266590, China; ³the Department of Environmental Sciences, Emory University, Atlanta, GA 30322, USA; ⁴the Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan 430205, China; ⁵Sichuan Institute of Land Science and Technology (Sichuan Center of Satellite Application Technology), Chengdu 610045, China; ⁶Key Laboratory of Investigation, Monitoring, Protection and Utilization for Cultivated Land Resources, MNR, Chengdu 610045, China; ⁷Joint Laboratory of Spatial Intelligent Perception and Large Model Application

Deep-learning methods dominate remote-sensing change detection (CD), yet state-of-the-art models remain parameter-heavy and struggle with crisp boundaries, limiting their use on edge devices. We present LEDGNet, a Lightweight, Edge-knowledge-Distillation-Guided CD Network, that reconciles accuracy, boundary fidelity, and efficiency. LEDGNet integrates three purpose-built components: 1) an Edge Distillation Module that mines multi-scale boundary cues from a high-capacity teacher and transfers them to a compact student through an edge-aware loss; 2) StarLite, a depth-wise separable encoder that preserves fine spatial detail while minimizing floating-point operations; and 3) LiteDecoder, an inexpensive feature-fusion head that restores full resolution without bulky up-sampling. This design halves the parameters and inference time of mainstream fine-grained CD networks while enhancing edge sharpness. On the CDD and LEVIR-CD benchmarks, LEDGNet achieves competitive F1 performance while maintaining a compact footprint of 20.58 M parameters and 35.18 G FLOPs. With an inference time of 255 ms, it strikes a balance between resource consumption and detection efficiency, making it well-suited for high-efficiency remote sensing monitoring.

2:30pm - 2:45pm

Leveraging Pretrained Priors for Weakly Supervised Semantic Segmentation of Remote Sensing Images

Xin Li, Nicola Genzano, Marco Gianinetto, Marco Scaioni

politectinico di milano, Italy

Semantic segmentation of remote sensing imagery (RSI) is essential for urban mapping, land-use monitoring, and many areas. However, pixel-level annotation is expensive, making weakly supervised semantic segmentation (WSSS) that relies on image-level labels an attractive alternative. Leveraging pre-trained models offers strong priors from large-scale learned representations can help the WSSS, yet frozen models often yield sparse and misaligned class activation maps (CAMs) due to domain gaps and static inference. We propose a lightweight and efficient framework that integrates CLIP and DINO to address three challenges: semantic misalignment between generic text prompts and RSI-specific visuals, static CAM quality, and incomplete object coverage. Our design includes: (1) a Textual Prototype-Aware Enrichment (TPE) module that builds an RS-specific knowledge base using LLM generated descriptions to enrich text prompts; (2) a Unified Semantic Relation Mining (USR) module that fuses learnable adapter features with CLIP attention and DINO affinity for online CAM refinement; and (3) a Visual Prototype-Aware Enrichment (VPE) modulemaintains momentumvisualprototypes to complete regions and sharpen boundaries. Using frozen priors while only training a lightweight decoder ensures efficiency and consistently improves segmentation accuracy across diverse remote sensing scenes. Experimental results on the iSAID and ISPRS Potsdam datasets demonstrate the effectiveness of the proposed framework, achieving 38.01% mIoU on iSAID dataset and 47.01% mIoU with 66.89% overall accuracy on Potsdam dataset.

2:45pm - 3:00pm

DeSEO: Physics-Aware Dataset Creation for High-Resolution Satellite Image Shadow Removal

Lorenzo Beltrame^1,2, Jules Salzinger², Filip Svoboda³, Phillipp Fanta-Jende², Jasmin Lampert², Radu Timofte⁴, Marco Körner^1,5,6

¹Technical University of Munich; ²Austrian Institute of Technology; ³University of Cambridge; ⁴University of Würzburg; ⁵Munich Data Science Institute; ⁶ELLIS Unit Jena

Shadows cast by terrain and tall structures remain a major obstacle for high-resolution satellite image analysis. Public resources offering geometry-consistent paired shadow/shadow-free satellite imagery are essentially missing, even though there is a growing body of work on shadow removal in remote sensing, and most large-scale Earth-observation datasets are designed for shadow detection or 3D modelling rather than shadow removal. Existing deep shadow-removal datasets either target ground-level or aerial scenes or rely on unpaired and weakly supervised formulations rather than explicit satellite pairs. We address this gap with deSEO, a geometry-aware and physics-informed methodology that, to the best of our knowledge, is the first to derive paired supervision for satellite shadow removal from the S-EO shadow detection dataset through a fully replicable pipeline. For each tile, deSEO selects a minimally shadowed acquisition as a weak reference and pairs it with shadowed counterparts using temporal and geometric filtering, Jacobian-based orientation normalisation, and LoFTR–RANSAC registration. A per-pixel validity mask restricts learning to reliably aligned regions, enabling supervision despite residual off-nadir parallax. In addition to this paired dataset, we develop a DSM-aware deshadowing model that combines residual translation, perceptual objectives, and mask-constrained adversarial learning. In contrast, a direct adaptation of a UAV-based SRNet/pix2pix architecture fails to converge under satellite viewpoint variability. Our model consistently reduces the visual impact of cast shadows across diverse illumination and viewing conditions, achieving improved structural and perceptual fidelity on held-out scenes. deSEO therefore provides the first reproducible, geometry-aware paired dataset and baseline for shadow removal in satellite Earth observation.

1:30pm - 3:00pm

WG III/3A: Active Microwave Remote Sensing
Location: 713B

1:30pm - 1:45pm

Advanced Persistent Scatterer Interferometry products

Michele Crosetto, Maria Carmelia Ramlie, Anna Barra, María Cuevas-González, Riccardo Palamà, Qi Gao, Oriol Monserrat, Saeedeh Shahbazi

CTTC, Spain

Persistent Scatterer Interferometry (PSI) is a consolidated active remote sensing technique to measure and monitor land deformation. The technique has experienced an intense development in the last 25 years. PSI techniques use large stacks of SAR images that cover a given observation period. The outcome of any PSI processing is a cloud of geocoded measurement points that contain the estimated deformation time series over the observation period. If the analysed area is wide, the corresponding point cloud can be huge. In these cases, the potential users often experience problem in analysing such huge point clouds, and this can limit the PSI exploitation. In this paper we present a set of products that address specific application needs or that offer higher-level products with respect to the standard PSI products, which can facilitate the interpretation and exploitation of the PSI results.

1:45pm - 2:00pm

Back-to-back Approach to SAR Interferometry

Qi Gao¹, Michele Crosetto¹, Oriol Monserrat¹, Marta Monfort², Pavel Pavlovsky²

¹CTTC, Spain; ²GeoKinesia, Spain

Interferometric SAR (InSAR) is a well-established remote sensing technique to measure and monitor land deformation. We focus in this paper on Persistent Scatterer Interferometry (PSI) techniques based on large stacks of SAR images. Several PSI approached have been proposed in the last three decades, see for a review Crosetto et al. (2016). In this paper, we describe an approach the is based on the direct integration of the interferometric phases (back-to-back approach).

2:00pm - 2:15pm

Identification and Analysis of Recurringly Occluded Persistent Scatterers, with Application to Displacement Monitoring in the Oetztal Alps

Nils Dörr, Andreas Schenk

Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology, Germany

The Persistent Scatterer Interferometry (PSI) is a multi-temporal InSAR approach that allows to monitor displacement time series of the Earth's surface. The method identifies and analyzes Persistent Scatterers (PSs) which are phase stable scattering points which dominate the backscatter of their resolution cell. Standard PSI techniques only identify and analyze PSs which are coherent throughout the whole considered SAR time series. However, PSs can fade, appear or be occluded during the time series, forming so called Temporary PSs (TPSs), which should be integrated into the PSI to establish optimal measurement point networks. Previous research has proposed methods to integrate such TPSs into the PSI, however these were exclusively evaluated for construction-related TPSs. In this work, we evaluate the performance of a TPS integration method in handling recurringly occured PSs, and compare the performance of individual components of the algorithm against alternative methods. We evaluate the methods using simulated TPSs with temporal and spatial baseline settings taken from real Sentinel-1 data stacks. Furthermore, we present and discuss the application of the methods to a Sentinel-1 data stack acquired over the Oetztal Alps, which are seasonally covered by snow. We show that the integration of ROPSs significantly increases the measurement pixel density at many locations across the study area, compared to results from the European Ground Motion Service. Even if most of the ROPS did not have identified coherent segments in each covered summer with the current analysis algorithm, their integration leads to a significant information gain compared to standard PSI approaches.

2:15pm - 2:30pm

Semi-Automated Post-Processing Workflow for EGMS InSAR Data in Open-Pit and Dam Deformation Monitoring in the Presence of Sentinel-1 Winter Data Gaps

Pius Kipng'etich Kirui, Anna Reichstein, Andre Cahyadi Kalia

Bundesanstalt für Geowissenschaften und Rohstoffe (BGR), Germany

Deformation monitoring in open‑pit mining and tailings‑dam operations is critical for operational safety, yet conventional in situ geodetic techniques provide only sparse, point‑based measurements. InSAR offers many displacement measurements, but its operational uptake is limited by complex workflows and the difficulty of interpreting analysis‑ready products such as EGMS. In cold regions, seasonal data gaps can introduce phase‑unwrapping artefacts that appear as winter‑only displacement offsets of approximately half the Sentinel‑1 wavelength. We propose a semi‑automated workflow to post‑process EGMS displacement time series, including pre‑filtering to identify and remove points affected by phase‑unwrapping errors and subsequent time‑series clustering in either a reduced‑dimensional representation or the full feature space. Cluster selection is automated using heuristic criteria and a custom metric based on temporal homogeneity and consistency. The findings show that the semi‑automatically detected clusters are plausible with regards to a visual interpretation of the EGMS data. The workflow supports improved interpretation of EGMS time series and avoids hard‑coded thresholds or reliance on velocity‑based estimates.

2:30pm - 2:45pm

Assessment of Hydrocarbon Production induced Surface Deformation over Inglewood oilfield, Los Angeles

Imeime Innocent Uyo¹, Mahdi Motagh², Mahmud H. Haghighi¹, Vamshi Karanam³

¹Institute of Photogrammetry and Geoinformation, Leibniz University Hannover, Germany; ²GFZ Helmholtz Center for Geosciences, Potsdam, Germany; ³Southern Methodist University, Texas, United States of America

The Inglewood Oil Field, located in the Los Angeles Basin, California, is a major urban hydrocarbon production site with a documented history of ground deformation linked to oil extraction. To assess ongoing deformation and validate previous monitoring results, Interferometric Synthetic Aperture Radar (InSAR) analysis was conducted using Sentinel-1 SAR data processed through the Alaska Satellite Facility’s HyP3 platform and the Miami InSAR Time-series software in Python (MintPy). The study analysed ascending and descending datasets acquired between 2020 and 2025 to derive high-resolution deformation time series and velocity maps. Results reveal a localized deformation pattern characterized by low-magnitude vertical motion, with maximum uplift and subsidence rates of approximately +0.8 cm/yr and –1.6 cm/yr, respectively. Minor horizontal displacements (±1.0 cm/yr) suggest limited lateral strain associated with reservoir compaction and stress redistribution. Compared with previous assessments conducted up to 2024, the current findings indicate a marked reduction in deformation magnitude, implying progressive stabilization of reservoir pressure and improved subsurface management. These results demonstrate the effectiveness of InSAR for long-term monitoring of urban oilfields, providing critical insights into the behaviour and contributing to risk mitigation in densely populated environments.

2:45pm - 3:00pm

Evaluating Ground Deformation in Low-Coherence Agricultural Areas Using Multi-Temporal InSAR Analysis

Mingyue Ma¹, Mahmud Haghshenas Haghighi¹, Mahdi Motagh^1,2

¹Leibniz University Hannover, Germany; ²GFZ Helmholtz Centre for Geosciences, Germany

Ground deformation caused by excessive groundwater extraction has become a major environmental concern in agricultural regions worldwide. Interferometric Synthetic Aperture Radar (InSAR) enables large-scale monitoring of ground deformation. However, its performance often decreases in low-coherence areas affected by vegetation growth and irrigation. In this study, we conducted a comparative evaluation of three multi-temporal SBAS-InSAR processing frameworks, MintPy, LiCSBAS, and SARvey, to assess their consistency in monitoring ground deformation across Golestan Province, Iran, using Sentinel-1 data acquired between 2014 and 2024. The analysis included deformation velocity fields, cross-sectional profiles, and time-series displacements, which were compared with temperature and precipitation variations. All three frameworks identified a pronounced deformation zone in the Gorgan Plain, with maximum line-of-sight deformation rates up to 13 cm/year. Quantitative comparisons showed strong correlations among the frameworks (r = 0.80 to 0.89), confirming their mutual reliability even under low coherence conditions. The time-series analysis revealed clear seasonal deformation patterns, with summer subsidence and winter uplift closely related to hydroclimatic fluctuations. Overall, this study demonstrates that multi-temporal SBAS-InSAR approaches can provide consistent and physically meaningful deformation estimates in challenging agricultural environments, offering valuable insights for subsidence monitoring and water resource management.

1:30pm - 3:00pm

ICWG III/IIB: Planetary Remote Sensing and Mapping
Location: 714A

1:30pm - 1:45pm

Refinement of Asteroid Rotation Parameters through Stereo Intersection Angle Optimization and Masked Feature Matching

Zhen Peng^1,2, Xun Geng^1,2, Pengying Liu^1,2, Wenhao Liu^1,2, Haoyu Zhang^1,2

¹State Key Laboratory of Spatial Datum, Faculty of Geographical Science and Engineering, Henan University, Zhengzhou, China, 450046; ²College of Geographic Sciences, Henan University, Zhengzhou, China, 450046

Asteroid exploration is crucial for understanding the solar system’s origin, but establishing a precise body-fixed coordinate system—relying on accurate rotation parameters—remains challenging. Conventional methods like ground-based light curve inversion often lack precision: for example, it yielded ±2° errors for Ceres’ pole and ±10° for Vesta’s, failing to meet demands for topographic mapping and navigation. This study proposes a refinement method combining stereo intersection angle optimization and grayscale threshold masking. First, using the camera’s interior orientation parameters and tie point coordinates, relative orientation of stereo image pairs is conducted to build a stereo model, followed by forward intersection to calculate intersection angles. Only pairs with favorable geometry (intersection angle >5°) are retained to avoid large position errors from nearly parallel sightlines. Second, a grayscale-based binary mask is created to separate the asteroid from the deep-space background, eliminating spurious edge features that cause mismatches; the SIFT algorithm then extracts and matches features exclusively within the masked region. Finally, an “exhaustive search” iteratively refines rotation parameters using optimized matched points. Validated on 127 Hayabusa2 ONC-T images of asteroid Ryugu (captured July 10, 2018, 2.11m/pixel), the method reduced 5,174 initial candidate pairs to 1,454 valid ones (137,191 matched points). After 4 iterations, refined parameters were RA=96.5° and Dec=-66.4°, with minimal errors (δRA=0.069°, δDec=0.0126°) against reference values (RA=96.431°, Dec=-66.387°). Compared to methods without the two strategies, mismatches dropped from 14,949 to 7,369, and forward intersection residuals decreased. Future work will integrate initial parameters into a bundle adjustment model for further refinement.

1:45pm - 2:00pm

Scene recognition-based adaptive SLAM for lunar rover in polar regions

Wenhui Wan¹, Zhanjie Hu^1,2, Yingxiang Xu³, Kaichang Di¹, Man Peng¹, Bin Liu¹, Zhaoqin Liu¹

¹Aerospace Information Research Institute, Chinese Academy of Sciences; ²University of Chinese Academy of Sciences; ³Beijing Institute of Technology XUTELI School

The lunar polar regions have emerged as core targets in lunar exploration, primarily due to the potential water ice resources stored within their permanently shadowed areas. However, the complex terrain and extreme illumination conditions in these polar regions present significant challenges to the navigation of lunar rovers—systems that previously relied on dead reckoning and visual matching techniques. To address this, active 3D sensors such as LiDAR will be integrated into future exploration missions.Simultaneous Localization and Mapping (SLAM) based on multi-sensor fusion via factor graphs can significantly enhance the localization robustness of rovers on the lunar surface. In this context, we propose the Lunar Scene Recognition Adaptive SLAM (LSRA-SLAM) method: a framework that leverages environment-aware pre-training to dynamically adjust factor-graph weights, thereby achieving more consistent fusion of stereo camera, LiDAR, and IMU measurements across diverse lunar scenarios. We also introduce a reinforcement learning-based online training strategy, which enables the network to robustly learn from the system's dynamic behaviors. Simulated experiments validate the effectiveness of the proposed LSRA-SLAM method.

2:00pm - 2:15pm

YOLOLens2.0: A Unified Super-Resolution and Detection Framework for High-Fidelity Crater Mapping in Lunar Permanently Shadowed Regions

Riccardo La Grassa¹, Cristina Re¹, Adriano Tullo¹, Natalia Amanda Vergara Sassarini¹, Gabriele Cremonese¹, Junichi Haruyama²

¹Italian National Institute for Astrophysics, Italy; ²Institute of Space and Astronautical Science, JAXA, Japan

Accurate crater mapping in lunar permanently shadowed regions (PSRs) is hindered by extreme low-light and low-resolution imagery. We present YOLOLens2.0, a unified, end-to-end deep learning framework designed for high-fidelity crater detection and terrain reconstruction in these challenging environments. The architecture integrates a Dense-Residual-Connected Transformer

(DRCT) for multimodal super-resolution (SR) with a YOLO-derived detection module and an affine calibrator to ensure geometric consistency at meter scale. Our framework exploits a bidirectional synergy where SR enhances feature discriminability for detection, while detection-driven supervision refines structural reconstruction. Validation on Kaguya data demonstrates a significant performance leap, achieving a Recall of 89.20% and an mAP@50 of 0.844 an improvement of over 33 percentage points in recall compared to the original YOLOLens. Out-of-distribution validation on ShadowCam imagery, performed without fine-tuning, confirms the model’s robustness and scalability. The framework successfully preserves quantitative elevation fidelity and supports detailed morphometric analyses, including the extraction of the crater size-frequency distributions (SFDs) that align with theoretical lunar production functions. YOLOLens2.0 provides a scalable, high-precision methodology for planetary mapping, offering critical insights for lunar surface evolution studies and future exploration missions.

2:15pm - 2:30pm

Semantic-Gaussian Approach for Cross-View Image Matching and Pose Optimization on Planetary Surfaces

Rongling Zhang, Zhaojin Li, Bo WU

Research Centre for Deep Space Explorations | Department of Land Surveying & Geo-Informatics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong

Reliable localization across the full orbit-descent-ground chain in planetary exploration remains difficult because extreme differences in altitude, viewing geometry, resolution, and illumination cause cross-view image matching to fail. Traditional keypoint pipelines and unified Structure-from-Motion (SfM) struggle to establish robust correspondences across these heterogeneous Satellite-Descent-Ground datasets due to severe domain gaps.

To overcome these limitations, we propose a novel framework based on a joint semantic-geometric optimization paradigm. Rather than forcing a unified SfM pipeline across drastically different viewpoints, our method leverages independent intra-domain SfM outputs and telemetry data as structural priors. We introduce a differentiable rendering approach that tightly couples the optimization of 3D Gaussian Splatting (3DGS) scene parameters with learnable camera extrinsics. Furthermore, by integrating high-level semantic epipolar constraints derived from foundation models, our method dynamically refines initial cross-domain pose estimates during the rasterization loop. This joint formulation effectively bypasses the fragility of low-level pixel matching, enabling accurate and robust alignment across the vast baselines inherent to multi-stage planetary exploration image sequences.

2:30pm - 2:45pm

Crater Graph-Assisted Bundle Adjustment for Precision Topographic Mapping of Mars

Haonan Zhong, Zhaojin Li, Bo Wu

The Hong Kong Polytechnic University, Hong Kong S.A.R. (China)

Mars topographic data are crucial for quantitatively characterizing the Martian surface, supporting exploration missions, and enabling scientific study of surface processes. Photogrammetric processing of Mars orbital imagery is the most representative method for generating 3D terrain models, with bundle adjustment (BA) serving as the key step for mitigating inconsistencies in overlapping regions of different orbital images and further improving the spatial accuracy of the resulting DTMs. However, due to the texture-less surface of Mars and the absence of ground control points, the stability of BA is often compromised. Impact craters, which are prevalent on the Marian surface, have been utilized as an important semantic prior in various image analysis applications. They can also be used to assist the BA process for precision topographic mapping of the Martian surface.

This study introduces a novel BA method assisted by robust crater graph features to address this. The approach involves: (1) extracting craters using a deep learning model (YOLOv5) and constructing a stable graph structure via a minimum spanning tree; (2) establishing crater correspondences across different images based on graph features to generate robust tie points; and (3) formulating a strengthened BA equation with constraints from the graph's angular and edge relationships to mitigate geometric inconsistencies.

Experimental results indicate that the proposed method provides an effective solution for high-precision 3D mapping from Martian surface imagery with limited textures and significant illumination variation. By incorporating crater graph features, it enhances the precision and stability of BA, yielding high-precision topographic mapping results for various applications.

2:45pm - 3:00pm

Image Contrast Response to Surface Roughness Under Direct and Secondary Illumination: Implications for Lunar Polar Regions

Prasun Mahanti

Intuitive Machines, 101 E Jackson St, Phoenix, AZ, USA

Surface roughness influences image contrast by altering illumination, which depends on the surface slope. We conducted Monte Carlo simulations of rough surfaces under both directly illuminated and secondary-illuminated lunar conditions. Our results indicate that PSR secondary illumination yields significantly lower contrast, characterized by soft, diffuse shading and negligible shadow fraction.

1:30pm - 3:00pm

WG II/5: Temporal Geospatial Data Understanding
Location: 714B

1:30pm - 1:45pm

Improved Land Cover Classification of Aerial Imagery and Satellite Image Time Series using Diffusion-based Super-Resolution

Hubert Kanyamahanga, Mareike Dorozynski, Franz Rottensteiner

Institute of Photogrammetry and GeoInformation, Leibniz University Of Hannover, Germany

Accurate land cover classification requires both spatial details and temporal information of remote sensing data. While publicly available satellite image time series (SITS) offer short revisit times, they suffer from limited spatial resolution. In contrast, aerial imagery provides fine-grained spatial details, but its temporal coverage is limited. Thus, combining data from those sensors is of interest as their properties are complementary w.r.t. the problem domain. However, the large gap in spatial resolution between these two sensors makes their integration challenging. Generating super-resolution-SITS (SR-SITS) before fusion can help to reduce this gap. In this work, we propose a new approach that integrates diffusion models for generating SR-SITS into a method for the joint pixel-wise classification of aerial and SITS data. Specifically, we employ a diffusion model to generate SR-SITS at an intermediate resolution from the raw SITS and aerial imagery of the same observed area. The SR-SITS are temporally encoded and fused with the aerial features using a cross attention module to produce pixel-wise classification at the geometrical resolution of aerial image. Experimental results on the existing FLAIR benchmark dataset indicate that our approach achieves state-of-the-art results, with a mean Intersection over Union score of 64.0% and an overall accuracy of 76.6%.

1:45pm - 2:00pm

Sky-NeRF: Learning 4D Cloud Topography in a Dynamic Neural Radiance Field

Theïlo Terrisse¹, Dawa Derksen², David Youssefi², Hugo Meric²

¹CS Group, 6 rue Brindejonc des Moulinais, Toulouse, France; ²CNES, 18 avenue Edouard Belin, Toulouse, France

We present Sky-NeRF, a novel method for cloud topography estimation based on Dynamic Neural Radiance Fields. Similar to NeRF, we propose to model the 3D structure of clouds as a radiance field, encoded in the parameters of a neural representation. Our goal is to reconstruct the 3D geometry, appearance, and motion of the cloud using a stereo-video of high-resolution top of the atmosphere radiance images. In this paper, we evaluate a novel way of modeling the dynamic behavior of clouds, with the goal of extracting added-value physical information regarding the cloud such as advection speed and direction, velocity field and cloud trajectories. We investigate how to include a simple physical prior, advection, into the learning system and evaluate its impact. Our results show that Sky-NeRF is able to provide a more complete 4D reconstruction than traditional stereo-matching-based algorithms. Moreover, thanks to a physics-based interpolation, Sky-NeRF is able to generate coherent new images from unseen viewing angles, and at any time between the observed frames.

2:00pm - 2:15pm

Rigid and Non-Rigid Surface Change Tensors for Topographic Dynamics Monitoring

Shoujun Jia^1,2,3, Lotte de Vugt^2,4, Andreas Mayr², Chun Liu³, Martin Rutzinger²

¹TUM School of Engineering and Design, Technical University of Munich, Munich, Germany; ²Department of Geography, University of Innsbruck, Innsbruck, Austria; ³College of Surveying and Geo-informatics, Tongji University, Shanghai, China; ⁴Institute for Interdisciplinary Mountain Research, Austrian Academy of Sciences, Innsbruck, Austria

3D topographic change estimation is a fundamental task for understanding Earth surface dynamics in fields of photogrammetry and laser scanning. However, at the current state of research, it is still challenging to accurately separate and quantify various components of topographic surface changes (i.e., rigid spatial movement and non-rigid morphological deformation). In this paper, we conceptualize a surface change tensor to describe 3D surface change based on the displacement field, considering contribution of neighboring points to their center point on the surface. With this concept, we design a new method that is able to quantitatively separate rigid and non-rigid topographic change components from the mixed topographic change. Experiments on synthetic datasets demonstrate that our method is accurate and robust to quantify rigid and non-rigid surface changes, with superiority to the baseline method (M3C2). Additionally, real-world experiments on 3D point clouds collected at four epochs show the effectiveness of the proposed method for monitoring topographic dynamics and identifying geomorphological processes in complex large-scale mountain environments.

2:15pm - 2:30pm

Spatiotemporal reconstruction of 4D point clouds at different time scales through implicit neural representations for topographic monitoring applications

Mathilde Letard¹, Peter Naylor², Katharina Anders¹

¹TUM School of Engineering and Design; Technical University of Munich, Germany; ²ɸ-lab, ESRIN, ESA, Frascati, Italy

Monitoring surface change in dynamic environments is essential to preserve the integrity of human infrastructure and livelihoods from natural hazard consequences. With the advent of 4D remote sensing, near-continuous monitoring of dynamic scenes is unlocked. However, the unordered and irregular nature of point clouds, compounded by temporally variable occlusions and diverse acquisition conditions, hinders the accurate analysis of highly information-rich 4D data. This work addresses the challenge of irregular spatiotemporal sampling in time series of 3D point clouds for the case study of a dynamic sandy beach at different time scales. We explore the use of implicit neural representations (INRs) to model 4D data as continuous spatiotemporal functions that are optimised to estimate the beach topography continuously through space and time. By comparing four model variants and assessing their performance to reconstruct spatially and temporally subsampled data, we evaluate the applicability of INRs to high-frequency topographic monitoring, especially in the context of 4D change analysis. Our results show the ability to reconstruct missing epochs from time series of 3D point clouds with centimetric to decimetric accuracy at time scales ranging from seasonal to daily observations. Our findings highlight the importance of hyperparameter tuning to enable the capture of local details in complex spatiotemporal datasets. Through this, our work lays the foundation for continuous spatiotemporal representation of dynamic scenes, supporting a potentially broad range of change analysis applications.

2:30pm - 2:45pm

Topo4d: Topographic 4D STAC Extension for Curating and Cataloging Multi-Source Geospatial Time Series Datasets

Jiapan Wang¹, Xiaoyu Huang¹, Pim Maydhisudhiwongs¹, Mathilde Letard¹, Balthasar Teuscher², Prabin Gyawali¹, Martin Werner², Katharina Anders¹

¹Remote Sensing Applications, TUM School of Engineering and Design, Technical University of Munich, Germany; ²Big Geospatial Data Management, TUM School of Engineering and Design, Technical University of Munich, Germany

Spatiotemporal analysis of geospatial time series data has gained increasing attention with the emergence of 4D point clouds and automatic acquisition technologies such as permanent laser scanning (PLS), time-lapse photogrammetry, and uncrewed aerial vehicle (UAV) platforms, enabling near-continuous monitoring of Earth surface dynamics for change detection and process characterization. However, facing massive data volumes through the temporal domain, current topographic data curation practices often rely on empirically determined data processing and management, which may significantly affect reusability, interoperability, and hence processing efficiency due to the absence or heterogeneous nature of metadata. The need for standardized approaches to manage time-dependent metadata has become critical as the demands for sharing data and reproducing analysis across tools and application domains increase. We propose a topographic 4D extension (topo4d) to the SpatioTemporal Asset Catalog (STAC) framework, which provides an open and extensible specification for automatic metadata curation and FAIR data management practices. This paper demonstrates how the topo4d extension facilitates the interoperability and reusability of 4D datasets and presents the corresponding metadata curation workflows applied to two real-world environmental monitoring applications.

1:30pm - 3:00pm

WG III/5: Remote Sensing for Inclusive Pathways to Equality and Environmental Health
Location: 715A

1:30pm - 1:45pm

Remote Sensing of Urban Asbestos Exposure: Deep Learning for Environmental Risk Assessment

Malgorzata Krowczynska, Ewa Wilk

University of Warsaw, Poland

This study presents an integrated remote sensing and deep learning approach for large-scale detection of asbestos-cement roofing in urban environments. Asbestos remains a major environmental health concern across Europe, where asbestos-cement materials persist in the built environment despite regulatory bans. Accurate identification and quantification of these materials are critical for effective remediation planning and equitable health protection.

The research focused on Poland’s two largest metropolitan areas—Warsaw and Kraków—which differ markedly in morphology and historical development, providing contrasting case studies for model validation. High-resolution orthophotomaps (5 cm and 25 cm) from 2023–2024, combined with national building footprint datasets and field-verified information, were used to train and validate a convolutional neural network (CNN) for binary classification of asbestos and non-asbestos roofs.

The highest producer accuracy (90.4%) and overall accuracy (92.9%) were achieved using 128×128-pixel image windows, confirming that broader spatial context enhances classification precision in dense urban settings. The CNN model demonstrated consistent performance across both cities, highlighting its robustness and scalability.

By integrating open orthophotos with open-source analytical frameworks, the method supports the creation of spatially detailed asbestos inventories aligned with the EU INSPIRE Directive and the 2023 Asbestos Directive (EU 2023/2668). The approach enables cost-effective, standardized monitoring applicable to metropolitan and smaller urban contexts alike.

This study advances data-driven environmental health management by demonstrating that deep learning applied to national aerial imagery can deliver operational tools for mapping asbestos exposure risks and informing sustainable, equitable remediation strategies across Europe.

1:45pm - 2:00pm

Remote Sensing of Urban Greenspace: Two Decades of 30-m FVC and Population Exposure Assessment Across Chinese Cities

Dan-Xia Song^1,2, Ziyi Chen^1,2, Sixuan Qi^1,2, Tao He³

¹Hubei Provincial Key Laboratory for Geographical Process Analysis and Simulation, Central China Normal University, Wuhan, 430079, China; ²College of Urban and Environmental Sciences, Central China Normal University, Wuhan, 430079, China; ³School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, 430079, China

Urban greenspace is essential for ecological resilience, climate regulation, and human well-being, yet long-term, fine-scale assessments of its spatiotemporal dynamics and the extent to which residents benefit from green exposure remain limited. This study develops a 30-m resolution Fractional Vegetation Cover (FVC) dataset to monitor interannual and seasonal variations in urban greenspace across twelve representative Chinese cities from 2000 to 2020. To capture temporal exposure, we introduce the “greendays” metric, defined as the number of days per year that residents experience visible greenery. A population-weighted exposure model was applied to quantify both the magnitude and equality of greenspace exposure. Results show that greenspace increased across all cities over the two decades, with peri-urban areas exhibiting the most substantial gains due to ecological restoration and park development, while core urban areas experienced moderate but consistent improvements linked to renewal and localized greening efforts. Greendays displayed a slight upward trend, indicating an extended duration of annual greenery exposure for residents. Exposure equality remained high and improved in most cities, suggesting that greening initiatives increasingly benefited diverse population groups. Overall, this study provides a robust and scalable remote-sensing-based framework for tracking urban greenspace and exposure equity, offering critical evidence to support nature-based solutions, environmental justice, and sustainable urban planning in alignment with global development goals.

2:00pm - 2:15pm

Analysing the Impacts of Natural-Factor Variability on Lake Water Volume Using the Generalized Method of Moment

Yuxuan Yang¹, Yi Lin^1,2, Jie Yu^1,2, Tinghui Zhang³

¹College of Surveying and Geo-Informatics,Tongji University, China, People's Republic of China; ²Research Center for Remote Sensing Technology and Application,Tongji University, China, People's Republic of China; ³Guangzhou Institute of Geography Guangzhou,China, People's Republic of China

This study develops a generalized method of moments (GMM) framework to quantitatively assess the integrated relationships among climate, vegetation, and lake water volume. Using GSOD precipitation data, SSEBop evapotranspiration, Nino3.4 and MEI indices, and NDVI, we analyzed monthly variations of climatic and vegetation conditions in the Lake Victoria basin from 2000 to 2020. The associations between these factors and lake water-volume changes were first examined, and dynamic GMM was then applied to remove mutual influences among climate variables, allowing for a more reliable attribution of dominant drivers.Results show that precipitation is the primary driver of seasonal to interannual water-volume variations, while evapotranspiration imposes a consistent negative effect on lake storage. ENSO significantly modulates multi-year water anomalies. Vegetation dynamics respond to both climatic variability and lake water-volume changes, with water-level fluctuations providing additional positive feedback after controlling for climate effects.

2:15pm - 2:30pm

Land cover mapping from orthorectified Neo-Pleiades imagery via Object-Based methods

Valerio Baiocchi¹, Francesca Giannone², Chiara Magurano¹, Erica Nocerino³

¹Sapienza Università di Roma, Italy; ²Niccolò Cusano University, Rome, Italy; ³Università degli Studi di Sassari, Sassari, Italy

Posidonia oceanica (Linnaeus) Delile (referred from now on also as P. oceanica) is a marine flowering plant endemic to the Mediterranean Sea, forming extensive underwater meadows that play vital ecological roles, especially as blue carbon reservoirs. Its distribution spans from Gibraltar to Turkey and North Africa to the Adriatic down to 40-50 m of depth (Cocozza et al., 2024). Human impacts, such as pollution, urbanization, and global warming, have reduced its extent by up to 56% in some regions (Robello et al., 2024). Monitoring these meadows is essential, and remote sensing data such as Neo-Pléiades satellite imagery enable their accurate mapping and health assessment. This study applies object-based classification to orthorectified Neo-Pléiades images to evaluate Posidonia oceanica distribution along Sardinia’s eastern coast.

2:30pm - 2:45pm

Using the Soil Brightness Indicator to inform Participatory Community Planning for SDG2 Projects – a case study in Dodoma, Tanzania

Hannah Kemper^1,2, Dr. Valerie Graw¹, Dr. Felicia O. Akinyemi³, Sarah Muir², Dr. Rogerio Bonifacio², Giancarlo Pini², Theo Renouard², Sayana de Gorostizaga², Ivana Milic², Sirio Modugno², Paolo Lucchino², Serena Ciccarello², Lorenzo Bosi²

¹Ruhr University Bochum, Germany; ²United Nations World Food Programme; ³Karlstad University, Sweden

Soil is a crucial component of the ecosystem, affected by climate change, and is often overlooked by remote sensing experts and insufficiently considered while discussing sustainable development projects. To enhance the use of soil related datasets based on earth observation during the planning phase of participatory processes, a specific analysis workflow was piloted during community consultations in Dodoma, Central Tanzania. In order to enhance the integration of the soil conditions during the design of a new community development plan Landsat 8 data from 2023 and 2024 was processed and prepared to make soil information more accessible to non-technical staff and the local communities in Chamwino district. Results confirm the suitability of the SBI as soil indicator thanks to its high resolution, easy interpretability, and context specificity. Preprocessing through experts was identified as viable solution for preparing the data. In addition, field truthing exercises and conversations with the local community members further confirm the accuracy of this dataset for highlighting areas affected by soil salinity or fertility loss and for the final use during participatory planning processes.

1:30pm - 3:00pm

WG II/4B: AI/ML for Geospatial Data
Location: 715B

1:30pm - 1:45pm

From Pixels to Polylines: Extracting City-scale Vectorized Roof Structures with Line Segment Detection Networks

Mehmet Büyükdemircioğlu¹, Fabio Remondino¹, Martin Kada², Sultan Kocaman³

¹3D Optical Metrology (3DOM) Unit, Bruno Kessler Foundation (FBK), Trento, Italy; ²Technische Universität Berlin, Institute of Geodesy and Geoinformation Science, Berlin, Germany; ³GeoPlato Engineering Inc., Bilkent Cyberpark, Ankara, Türkiye

Automatic extraction of vectorized roof structures above LOD2.0 remains challenging due to their geometric complexity and the presence of small and occluded elements over the roofs. Detecting fine-scale roof objects such as chimneys and dormer windows in very high resolution aerial imagery is still an active research topic. This study presents a workflow for automated detection and vectorization roof structures at city scale using Line Segment Detection (LSD) networks. Compared to model-based building reconstruction approaches, LSD networks do not rely on pre-defined roof typologies and are able to extract complex roof structures and small objects over the building roofs. For this purpose, a dataset comprising approximately 139,000 buildings with LOD2.2 roof structures and more than 2.2 million roof segments is generated using 8 cm GSD aerial imagery. An automated end-to-end workflow is developed, trained and tested from the available data. Experimental results indicate that roof structures suitable for LOD2.2 3D roofs can be extracted and vectorized with high accuracy, achieving 58.4% msAP and 73.1% mAPJ with ULSD network. Robustness is further assessed by visual inspection in areas affected by roof-blocking objects such as trees and cast shadows.

1:45pm - 2:00pm

Automatic Large-Scale Topographic Mapping from High-Resolution Aerial Imagery

Weiqin Jiao, George Vosselman, Claudio Persello

University of Twente, ITC Faculty Geo-Information Science and Earth Observation, Netherlands, The

Topographic maps provide structured, polygonal representations of the Earth’s surface, delineating land-cover classes such as buildings, roads, water bodies, and vegetation.

They form the foundation of national geospatial data infrastructures and support a wide range of applications, including urban planning, environmental monitoring, and cadastral management. However, the production and maintenance of such large-scale topographic maps still rely heavily on manual photo-interpretation and vector editing.

While such human-in-the-loop workflows ensure geometric accuracy, they are labor-intensive, costly, and non-reproducible, limiting scalability and update frequency. However, most existing polygonal outline extraction methods are restricted to single-class, which typically leads to overlaps, gaps, and inconsistent shared boundaries when extended to multi-class mapping. Moreover, few studies have demonstrated nationwide implementation or validation, leaving the scalability and generalization of current methods largely unexplored. To address these challenges, this study develops a fully automated framework for large-scale topographic mapping directly from high-resolution aerial imagery. The framework aims to produce seamless, multi-class topographic maps in a single run that remain topologically consistent across diverse urban and rural regions in the Netherlands and beyond.

2:00pm - 2:15pm

Todo Fir Crown Instance Segmentation in dense Plantation Forest using Polar-FFT and Treetop Queries

Yusuke Iuchi¹, Soki Nishiwaki¹, Fumio Takeuchi², Mika Takiya², Takanori Emaru³

¹Graduate School of Engineering, Hokkaido University; ²Forestry Research Institute, Hokkaido Research Organization; ³Faculty of Engineering, Hokkaido University

Instance segmentation of individual trees from UAV-derived orthomosaics and DSMs remains challenging in dense planted forests in Japan because SfM-derived DSMs often have blurred crown boundaries and unstable quality. We propose a PFFT-based method that encodes the local DSM shape around treetop candidates and integrates it into Mask2Former to suppress unreliable candidates and improve crown separation.

Experiments on Abies sachalinensis plantation (Todo fir) data from two sites in Hokkaido showed that the method improved mAP75 from 52.18% to 55.47% and F1 at a confidence threshold of 0.5 from 89.86% to 92.08%, while reducing false positives by 41% without increasing false negatives. The results indicate that treetop-centered local shape cues are useful for instance segmentation in densely planted forests.

2:15pm - 2:30pm

An integrated yolo-seg and geometric analysis framework for construction zone detection and tubular marker damage assessment

Lee Jubin¹, Youn Junhee², Choi Kanghyeok³, Kim Changjae¹

¹Department of Civil and Environmental Engineering, College of Engineering, Myongji University,; ²Department of Future & Smart Construction Research, Korea Institute of Civil and Building Technology; ³Department of Geoinformatic Engineering, Inha University

This study presents an integrated framework combining YOLOv9e-Seg and photogrammetric geometric analysis for detecting road-safety assets and assessing their condition using UAV imagery. Traffic cones and tubular markers, which define construction-zone boundaries, are difficult to detect due to their small size in high-resolution images. To address this, a crop-tiling strategy (512×512 pixels) was applied to enhance the representation of small objects. Polygon-based labeling was used to preserve fine object geometry, and YOLOv9e-Seg was trained to output instance masks and polygon coordinates. During testing, tiled predictions were restored to the global coordinate frame, and duplicate detections were removed by retaining only the highest-confidence results. Geometric analysis utilized segmentation-derived polygons to compute centroids and principal axes, distinguishing intact and damaged tubular markers through vector angle difference analysis. For traffic cones, convex hulls constructed from centroid positions accurately delineated construction-zone boundaries. The proposed approach achieved the highest F1 score at a 512-pixel tile size, improving detection and segmentation of small, slender objects. These results demonstrate that the framework goes beyond basic detection and segmentation by enabling quantitative geometric interpretation and reliable construction-zone reconstruction from UAV data.

2:30pm - 2:45pm

From Aerial to Satellite: Can Super-Resolution Enable Label-Free Model Transfer?

Nina Merkle, Corentin Henry, Sandeep Kumar Jangir, Jens Hellekes, Felix Rauch, Pablo d'Angelo, Franz Kurz

German Aerospace Center (DLR), Germany

Satellite imagery enables large-scale remote sensing applications by providing frequent and large-scale coverage. However, its limited spatial resolution often restricts the use of satellite images in tasks that require detailed, fine-scale information. In contrast, aerial images offer a much higher spatial resolution, allowing the extraction of fine-grained features, but typically cover smaller, more localized areas. In this work, we investigate whether super-resolution (SR) methods can bridge the gap between aerial and high-resolution satellite imagery, enabling a label-free model transfer without additional manual annotations. The idea is to enhance the spatial resolution of high-resolution satellite images, allowing models trained on aerial data to be directly applied to satellite images. Towards this goal, a state-of-the-art SR algorithm is used to upscale three high-resolution satellite images, matching the resolution of the aerial training data. Then, a segmentation network trained on an aerial image dataset is applied to segment roads and parking areas in the super-resolved satellite images. The approach is evaluated on an annotated dataset and compared to the results in the original satellite images. Additionally, we investigate its performance on a low-resolution aerial image. Our results demonstrate that SR facilitates the utilization of models trained on aerial image datasets for large-scale satellite applications without requiring new labels.

2:45pm - 3:00pm

Beyond Vision: How Language effects Visual Grounding in UAV Imagery

Jue Chen¹, Penghui Huang², Ran Ding², Zhentao Zou², Xue Yang², Xue Jiang², Yue Zhou¹, Hongxin Yang¹, Jonathan Li^1,3

¹Hinton STAI Institute and Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University, Shanghai 200241, China; ²Shanghai Jiao Tong University, Shanghai 200241, China; ³Department of Geography and Environmental Management, University of Waterloo,Waterlo0,ON N2L 3G1,Canada

This study tackles multilingual and explicit-implicit gaps in Visual Grounding (VG) for UAV imagery, focusing on real-world UAV needs (e.g., disaster response) that require implicit reference understanding. It evaluates Qwen2.5-VL-7B’s cross-linguistic robustness via Acc@0.5% across nine languages (Chinese, English, Japanese, Russian, Korean, German, French, Spanish, Portuguese). Key results: Explicit VG (using visual attributes) outperforms implicit VG (needing context/common sense) universally. East Asian languages lead in both tasks; Indo-European languages (e.g., Portuguese, 48.63% implicit accuracy drop) lag. Attention analysis shows the model better aligns with East Asian linguistic structures. This work informs LVLM optimization for multilingual UAV applications, guiding future cross-model comparisons.

1:30pm - 3:00pm

IvS6B: Canadian Remote Sensing for Urban Applications
Location: 716A

1:30pm - 1:45pm

Advances in 3D urban Reconstruction and Building Mesh Extraction using Gaussian Splatting and Google Earth

Kyle Gao¹, Jonathan Li^1,2, Linlin Xu³, David Clausi¹

This invited talk showcases two linked advances in Canadian urban remote sensing from the University of Waterloo. The first work presents large-scale 3D urban scene reconstruction and point-cloud densification using Gaussian Splatting with Google Earth Studio imagery. It recovers geometry and photorealistic radiance for the Kitchener–Waterloo region, benchmarking against NeRF baselines and achieving higher view-synthesis quality with faster training. The study demonstrates practical pipelines for city-scale digital twins and urban analytics.

The second study advances building-level reconstruction through the Gaussian Building Mesh (GBM) framework. GBM automatically extracts metrically accurate 3D building meshes from open-access imagery using segmentation models such as SAM2 and GroundingDINO, combined with Gaussian Splatting for dense, photorealistic surface generation. This pipeline enables efficient, data-driven modeling of urban structures, supporting applications from municipal infrastructure documentation to heritage reconstruction.

Together these contributions deliver scalable 3D reconstruction, object-level meshing, and data-driven urban modeling. They strengthen Canada’s leadership in remote sensing research and support resilient urban planning, infrastructure monitoring, and Earth observation–driven decision systems for Canadian cities.

1:45pm - 2:00pm

Semantic-Aware Harmonization Model (SAHM) for Improving Consistency In Large-area, Fine-resolution Urban Land Cover Mapping

Yindan Zhang¹, Yuhong He¹, Gang Chen², Ying Zhang³

¹University of Toronto Mississauga, Canada; ²University of North Carolina at Charlotte, USA; ³Natural Resources Canada, Canada

Fine-resolution urban land-cover (ULC) mapping is essential for understanding intra-urban heterogeneity and monitoring rapid land-use change. However, large-area mosaics from CubeSat constellations such as PlanetScope often suffer from strong radiometric inconsistencies caused by varying sensor calibration, viewing geometry, and illumination, leading to unreliable classification and visual artifacts. This study introduces a Semantic-Aware Harmonization Model (SAHM) that jointly addresses spectral and semantic inconsistencies across multi-source imagery. SAHM integrates two synergistic components: a Spectral Harmonization Module (SHM) for radiometric alignment between PlanetScope and Sentinel-2 imagery, and a Semantic Consistency Module (SCM) inspired by prompt-based architectures to enforce category-level coherence. Through bidirectional interaction, semantic features guide spectral correction, while harmonized representations improve segmentation reliability. Applied to the Toronto and Region Conservation Authority area (TRCA), SAHM achieved an overall accuracy of 91.9%, with F1-scores exceeding 94% for impervious surfaces and 97% for agriculture. Harmonized PlanetScope mosaics demonstrated high spectral fidelity (PSNR = 34.2 dB, SSIM = 0.93) and reduced inter-scene NDVI/NDWI bias (< 0.05). The results highlight SAHM’s capability to produce spatially coherent, semantically reliable urban maps from radiometrically inconsistent high-resolution imagery. This framework offers a scalable solution for consistent urban monitoring across CubeSat constellations, paving the way toward semantic-driven harmonization in next-generation Earth observation.

2:00pm - 2:15pm

Individual tree crown delineation and classification in urban landscapes from multi-source remote sensing by integrating SAM and watershed segmentation

Jingyi Yuan¹, Chuofan Zhang¹, Lu Liang², Hong Zhang^1,3,4, Qin Ma^5,1,3,4

¹School of Geography, Nanjing Normal University, Nanjing, China.; ²Department of Landscape Architecture and Environmental Planning, University of California, Berkeley, USA.; ³Key Laboratory of Virtual Geographic Environment (Nanjing Normal University), Ministry of Education, Nanjing, China.; ⁴Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, China.; ⁵State Key Laboratory of Climate System Prediction and Risk Management, Nanjing, China.

Urban trees enhance the urban environment through various ecosystem services. Individual tree delineation and species classification provide information on the location, structure, and species of each tree from remote sensing datasets, offering valuable data support for efficient and refined urban greening management. However, existing individual tree delineation algorithms developed based on single-source remote sensing datasets struggle to address the complexity of urban green landscapes, such as conifer-broadleaf mixtures, tree-shrub complexes. Additionally, the relationship between classification accuracy and individual tree delineation quality remains unclear.

This study integrates the Segment Anything Model (SAM) and Marker-Controlled Watershed Segmentation (MCWS), combining imagery and LiDAR features, to optimize individual tree delineation in complex urban landscapes. Species classification was then performed on crown datasets from different algorithms to investigate how classification accuracy responds to varying crown qualities.

The results demonstrate that the proposed SAM-WS algorithm effectively enhances individual tree delineation accuracy, achieving the highest F1-Score of 0.75, with improvements of 0.20 and 0.27 over SAM and MCWS, respectively. The classification accuracy based on SAM-WS crowns was the highest among all algorithm-derived crown datasets, with an Overall Accuracy (OA) of 0.79 and a Kappa of 0.64. As the average F1-Score of crown delineation dropped from 1.00 to 0.48, the OA for classification decreased from 0.86 to 0.74, and Kappa from 0.77 to 0.38. Additionally, the classification accuracy of conifers and shrubs was more sensitive to the crown quality. This research offers new methodologies and insights into the application of remote sensing-based urban vegetation monitoring.

2:15pm - 2:30pm

Satellite-based Detection of Invasive Shrubs in Urban Woodlands

Carolyne Qiang¹, Yuhong He²

¹University of Toronto, Canada; ²University of Toronto, Canada

This study develops a satellite-based framework for detecting invasive shrub presence, focusing on common buckthorn (Rhamnus cathartica), across urban woodland environments in southern Ontario. Invasive shrubs exhibit extended leaf phenology compared to native understory species, leafing out earlier in spring and retaining foliage later into fall. Leveraging this phenological contrast, the workflow integrates multi-season Sentinel-2 MSI composites with higher-resolution PlanetScope imagery, combined with 2025 field observations collected across mixed-canopy woodlands in the Greater Toronto Area.

Spectral features (NDVI, EVI, NDWI, red-edge indices, Tasseled Cap transformations) and contextual variables (distance to woodland edges, canopy-openness metrics) are incorporated into a Random Forest classifier designed to distinguish buckthorn presence under complex understory conditions. A presence-background training strategy and spatially blocked cross-validation are implemented to reduce label uncertainty and spatial autocorrelation.

Preliminary results show that early-spring and late-fall imagery substantially improve detection sensitivity, with late-season spectral indices supporting the hypothesis that extended leaf persistence is a reliable cue for invasive shrub identification. This cost-effective workflow highlights the potential of multi-sensor satellite data to support early warning, invasion-risk mapping, and more efficient allocation of ground-validation efforts in urban conservation planning.

2:30pm - 2:45pm

Seasonal analysis of surface temperature and vegetation dynamics using drone-based thermal and multispectral remote sensing

Haoxuan Ge, Yuhong He

Department of Geography, Geomatics and Environment, University of Toronto Mississauga, Ontario, L5L 1C6, Canada

Drone remote sensing offers unique potential for capturing fine-scale variations in land surface temperature and vegetation condition, two tightly coupled variables that jointly regulate surface energy balance, evapotranspiration, and local microclimates. Understanding their interactions is crucial for assessing ecosystem function, evaluating the impacts of land use, and informing nature-based climate adaptation strategies. Yet, despite growing interest, UAV-based thermal and multispectral data have largely been used individually, and their integration for quantifying coupled seasonal dynamics in vegetation function and surface temperature remains limited. To address this gap, this study introduces a commercial off-the-shelf dual-drone multisensory data collection framework. The system integrates thermal infrared and multispectral imaging to analyze seasonal variations in surface temperature and vegetation health. The study area is a suburban-naturalized mixed landscape located at the University of Toronto Mississauga, Canada. Ten monthly drone flights were conducted from August 2024 to August 2025, with thermal and Normalized Difference Red Edge (NDRE) indices mosaiced for analysis. Results revealed distinct seasonal patterns, with impervious surfaces consistently exhibiting the highest surface temperatures, followed by vegetation and water, which were the coolest. NDRE values exhibited summer maxima and winter minima, aligning with the expected phenological cycles of vegetation. Regression analyses indicated that higher NDRE generally corresponded to lower surface temperatures, particularly for maintained trees and evergreen vegetation, highlighting the role of vegetation in moderating local heat. The developed workflow demonstrates the potential of drone-based remote sensing for cost-effective, fine-scale, multi-temporal environmental monitoring. It provides an adaptable framework for future applications in microclimate assessments.

1:30pm - 3:00pm

Forum3A: Legacy Project: How to Secure Funding to Support Geospatial Activities
Location: 716B

1:30pm - 3:00pm

Forum8A: Wildfire Remote Sensing - Bridging Public and Private Solutions
Location: 717A

1:30pm - 3:00pm

InS5: Industry Tech Session
Location: 717B

1:30pm - 5:00pm

General Assembly 2
Location: 701A

3:00pm - 3:30pm

Afternoon Coffee Break
Location: Exhibition Hall "E"

3:30pm - 5:15pm

WG III/1D: Remote Sensing Data Processing and Understanding
Location: 713A

3:30pm - 3:45pm

Spatio-temporal Modeling of Bridge Deformations from Sentinel-1 SAR Images Validated with Multiple In-situ Surveys

Roberto Monti, Federica Gaspari, Rohollah Naeijian, Rebecca Fascia, Mirko Reguzzoni, Lorenzo Rossi, Livio Pinto

Politecnico di Milano, Department of Civil and Environmental Engineering (DICA), 20133 Milan, Italy

Aging bridge infrastructure requires efficient, network-scale monitoring, especially in remote areas where traditional in-situ sensors are costly and logistically challenging. This paper presents a remote sensing framework for structural health monitoring based on spaceborne Synthetic Aperture Radar (SAR). The approach combines Persistent Scatterer Interferometry (PSI) and Least Squares Collocation (LSC), implemented through the PHASE open-source MATLAB software, to derive a millimeter-level spatio-temporal displacement model.

The methodology is applied to a reinforced-concrete viaduct in the Alpine foothills of Lombardy, Italy, using five years of Copernicus Sentinel-1 data. A custom elevation-based spatial filtering strategy enables the isolation of structural displacements from the surrounding topography. The resulting spatio-temporal displacement model captures the expected seasonal thermal behavior of the structure and highlights localized deviations from the dominant cyclic response.

Finally, the SAR-derived model is integrated with UAV photogrammetry and official inspection reports within the P.O.N.T.I. 3D viewer. This multi-source, Digital Twin-like environment facilitates the joint interpretation of remote sensing observations and in-situ evidence, providing a scalable framework to support infrastructure monitoring and management.

3:45pm - 4:00pm

Large-Scale InSAR Deformation Monitoring Using Realistic Simulation-Based Training of a Temporal Convolutional Network: Application to the Phlegraean Fields, Italy

Kourosh Shahryarinia, Mohammad Omidalizarandi, Ingo Neumann

Geodetic Institute Hannover, Leibniz University Hannover, Germany

Large-scale land surface deformation monitoring using Interferometric Synthetic Aperture Radar (InSAR) requires robust detection of changes in long-term deformation trends. However, accurate change point (CP) detection remains challenging due to complex time series characteristics, including seasonal and quasi-periodic components and noise. Classical methods and many existing deep learning approaches rely on restrictive assumptions and training data that do not fully represent real-world InSAR time series, limiting their generalization and scalability in large-scale, real-world applications. In this study, we propose an integrated, fully supervised framework for CP detection in InSAR displacement time series based on Temporal Convolutional Networks (TCNs). The proposed TCN model employs dilated convolutions with multi-scale receptive fields to capture long-term temporal dependencies and complex deformation patterns, enabling robust identification of significant trend changes under noisy conditions. To effectively train this model, we introduce a deep learning-based InSAR time series simulation framework trained on real time series. This simulation framework produces physically consistent InSAR time series that retain essential temporal characteristics while introducing predefined, credible trend changes. Finally, we integrate the trained model into a large-scale anomalous change-detection pipeline that aggregates detected CPs from individual time series into spatially coherent deformation heatmaps suitable for operational monitoring. The proposed framework is evaluated using simulated data and real InSAR time series from the Phlegraean Fields caldera (Campi Flegrei), Italy. The results show clusters of anomalous behavior in the central Campi Flegrei–Pozzuoli area and in parts of Ischia and Procida, consistent with known unrest zones, associated periods, and independent measurements.

4:00pm - 4:15pm

Geometry-conditioned Pix2Pix: leveraging explicit Conditioning on SAR projected local Incidence Angle for SAR-to-EO Translation Quality Improvement

Jinmin Lee, Minkyung Chung, Aisha Javed, Youkyung Han

Seoul National University of Science and Technology, Korea, Republic of (South Korea)

Electro-optical (EO) imagery is intuitive but highly dependent on weather and illumination, whereas synthetic aperture radar (SAR) imagery provides reliable all-weather observations yet offers limited spectral information. To complement these modalities, recent studies have applied cGAN-based image-to-image translation for SAR-to-EO translation. However, side-looking SAR introduces spatial distortions such as foreshortening and layover that cause relative misalignment with EO imagery, undermining pixelwise supervision and yielding structural discrepancies between translated outputs and reference EO imagery. In this study, we propose Geometry-Conditioned Pix2Pix (GC-Pix2Pix), which explicitly conditions on projected local incidence angle (PLIA) information derived from SAR imagery to better preserve structure and alignment in translated EO imagery. The method is based on Pix2Pix and comprises a 2-branch generator and a PatchGAN discriminator. The generator consists of a main network that processes SAR polarimetric channels (VV, VH) and a conditioning subnetwork that extracts PLIA features. The subnetwork uses multi-layer convolutional blocks to capture local PLIA patterns, and the extracted features are then fused with features from the main branch and emphasized via a spatial attention module. For training and evaluation, we assembled a dataset over South Korea that combines Sentinel-1A GRD VV/VH with PLIA and Sentinel-2B Level-2A RGB imagery. We compared GC-Pix2Pix against representative baselines. Across multiple image quality assessment metrics and complementary qualitative analyses, the proposed approach consistently improved SAR-to-EO translation performance.

4:15pm - 4:30pm

Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach

Yiqun Wang, Lujun Li, Meiru Yue, Radu State

SEDAN, SnT, the University of Luxembourg, Luxembourg

Cloud cover in multispectral imagery (MSI) significantly hinders early-season crop mapping by corrupting spectral information. Existing Vision Transformer(ViT)-based time-series reconstruction methods, like SMTS-ViT, often employ coarse temporal embeddings that aggregate entire sequences, causing substantial information loss and reducing reconstruction accuracy. To address these limitations, a Video Vision Transformer (ViViT)-based framework with temporal-spatial fusion embedding for MSI reconstruction in cloud-covered regions is proposed in this study. Non-overlapping tubelets are extracted via 3D convolution with constrained temporal span t=2, ensuring local temporal coherence while reducing cross-day information degradation. Both MSI-only and SAR-MSI fusion scenarios are considered during the experiments. Comprehensive experiments on 2020 Traill County data demonstrate notable performance improvements: MTS-ViViT achieves a 2.23% reduction in MSE compared to the MTS-ViT baseline, while SMTS-ViViT achieves a 10.33% improvement with SAR integration over the SMTS-ViT baseline. The proposed framework effectively enhances spectral reconstruction quality for robust agricultural monitoring.

4:30pm - 4:45pm

Evaluating Deep Matching Models for SAR-Optical Image Pairs using the SpaceNet9 Dataset

Constantin Günzel, Michael Schmitt

Department of Aerospace Engineering, University of the Bundeswehr Munich, Germany

This paper focuses on cross-modal image matching between Synthetic Aperture Radar (SAR) and optical imagery, a long-standing challenge due to disparate sensing physics, radiometric behaviour and geometric distortions. Beyond applicational needs in satellite data fusion and downstream mapping, the study is additionally motivated by the rapid advances of feature matching in the field of Computer Vision. Under a unified, lightweight pipeline, the authors evaluate a classical handcrafted baseline (SIFT) against modern deep matchers, including a modality-invariant approach (MINIMA), as well as a SuperPoint+LightGlue pipeline, using the SpaceNet9 dataset with provided ground truth. The aim is to assess each models' ability to establish reliable correspondences without retraining or modality-specific adaptation, aiming to provide practical guidance for other researchers working with SAR-optical fusion. The paper highlights where pretrained multimodal models already yield consistent correspondences, where they still struggle and outlines possible next steps.

4:45pm - 5:00pm

Detecting Marine Pollutants Using Sentinel-1 SAR and Sentinel-2 Optical Imagery

Jason Manesis¹, Paraskevi Mikeli¹, Katerina Kikaki², Ioannis Kakogeorgiou³, Athanasios Karmas², Konstantinos Karantzalos¹

¹National Technical University of Athens; ²Hellenic Space Center; ³IIT, NCSR "Demokritos"

Marine pollution, including Marine Debris and Oil Spills, poses a serious environmental threat that requires systematic monitoring. While satellite observations and machine learning models have been widely applied in this domain, the use of advanced deep learning techniques remains limited. To support progress in this area, we construct a new annotated Sentinel-1 SAR dataset derived from the MADOS Sentinel-2 marine pollution dataset, including labels for oil spills, sea surface, look-alikes, ships, and offshore platforms. We evaluate several deep learning architectures on this dataset, including traditional models such as U-Net, state-of-the-art segmentation models such as SegNeXt and domain-specific frameworks such as MariNeXt. Our results show that MariNeXt achieves the best performance with an F₁-macro score of 92.7%, significantly outperforming U-Net and SegNeXt. Qualitative analysis using paired Sentinel-2 imagery further validates these findings. The study also highlights the persistent difficulty of detecting marine debris in SAR imagery, particularly when complementary optical data are unavailable.

5:00pm - 5:15pm

A coarse-to-fine cross-view localization framework with BEV-guided retrieval and fine-grained pose alignment

Qiong Wu¹, Panwang Xia¹, Yi Wan^1,2, Xinyi Liu^1,2, Yongxiang Yao¹, Dong Wei¹, Yongjun Zhang^1,2

¹Wuhan University, China, People's Republic of; ²Technology Innovation Center for Collaborative Applications of Natural Resources Data in GBA, Ministry of Natural Resources, Guangzhou, 510075, Guangdong, China

This paper introduces a coarse-to-fine cross-view localization framework that unifies image-level retrieval and geometry-level alignment within a single pipeline. The proposed approach first employs a Bird’s-Eye-View (BEV)-guided retrieval module to establish a perspective-consistent intermediary space, enhancing cross-view consistency and retrieval precision. In the fine stage, a geometry-aware alignment module estimates the 3-DoF pose through interpretable point-plane matching based on BEV correspondences. This hierarchical design bridges global retrieval and local geometric reasoning, achieving both scalability and high localization accuracy. Extensive experiments on the VIGOR benchmark demonstrate that the proposed framework achieves state-of-the-art performance in both retrieval and alignment, significantly improving end-to-end localization precision while maintaining computational efficiency.

3:30pm - 5:15pm

WG IV/2C: Artificial Intelligence and Uncertainty Modeling in Spatial Analysis
Location: 713B

3:30pm - 3:45pm

Comparison of Solar Radiation Estimates of GIS, Satellite, In-Situ, and SDT-based Solar Modelling for Rooftop Solar Energy Planning

Athenee Ballesteros Teofilo, Chayn {Qian} Sun

RMIT University, Australia

Urban rooftop solar planning relies on solar radiation inputs, yet estimates vary across models and measurement methods. This study compares radiation estimates from ArcGIS Solar Analyst, NASA solar radiation values, in-situ observations from research-grade and personal weather stations, and SDT-based Solar Radiation Modelling. We derive hourly global horizontal irradiance (GHI) values from these solar radiation data centres, model building-level estimates, harmonise all sources through temporal alignment, and then evaluate the values. The comparison reveals the hourly modelling of solar radiation models and common solar radiation centres, highlighting where an urban-adjusted local sensor provides lower solar radiation values because of the limited representation of the built and urban environment. Results show that utilising gridded or terrain-based models over urban-adjusted solar radiation values overrepresent due to the uncaptured localised shadings, roof placement effects, and increasing systemic errors for downstream rooftop PV terrain-based assessments. The cross-validated workflow of sensor-based city-scale solar radiation modelling is reproducible and scalable, offering local governments a more nuanced understanding of their solar capacity, and paves the way for carbon emission budget management.

3:45pm - 4:00pm

Uncertainty Quantification for Regression Tasks in Earth Observation

Ritu Yadav, Andrea Nascetti, Yifang Ban

KTH Royal Institute of Technology, Sweden

Deep learning, in particular, has driven hundreds of new studies in remote sensing each year. However, ensuring the reliability of these models requires robust uncertainty quantification, an aspect that remains insufficiently explored. Current remote sensing deep learning models typically yield single, deterministic predictions, such as a class label for each pixel or a single biomass value for a given location or region. While commonly used metrics such as RMSE or classification accuracy summarize overall model performance, they fail to convey the reliability of individual predictions, leaving users without guidance on how much confidence to place in each output. Uncertainty estimation addresses this critical gap by quantifying the variability or confidence associated with model predictions. This enables practitioners to interpret not only what the model predicts but also how confident it is in those predictions, providing a more nuanced understanding that is essential for informed decision-making.

We address aleatoric uncertainty using Sentinel-1 and Sentinel-2 time series, proposing two approaches: (i) Gaussian UC, which predicts mean and standard deviation, and (ii) Quantile UC, which estimates the 10th, 50th, and 90th quantiles to capture asymmetric errors. We evaluate these approaches on three representative EO tasks: building height, canopy height, and aboveground biomass estimation. Our results (ID and OOD) show that both models achieve accuracy comparable to deterministic benchmarks while providing well-calibrated, interpretable, and operationally useful confidence estimates. Notably, both models outperform existing global canopy height products on evaluated sites, including the recent 1 m canopy height maps produced by vision transformers.

4:00pm - 4:15pm

Evaluation of OpenStreetMap Data of the Built Environment with the Help of Spatio-Temporal Digital Elevation Models

Ruiqi Liu, Martin Breunig, Paul Kuper, Mulhim Al-Doori

Karlsruhe Institute of Technology, Germany

Recent advances in remote sensing have shifted the focus from the analysis of individual image scenes to the understanding of complex earth systems. This means that the analysis of dynamic evolutions replaces previous static examinations for fixed time points. Furthermore, interdisciplinary research and the integration of heterogeneous data sources are characterizing this transformation process. Digital Elevation Models (DEMs) are predestined for supporting this process by supplementing orthophotos and map data. Promising applications include city planning, landslide analysis, and flood risk assessment where spatio-temporal change detection is a central concept to be applied. Concerning map data, the OpenStreetMap project, based on the idea of Volunteered Geographic Information, has revolutionized the effective production and update of digital maps. However, OSM data does not include elevation information and often contains incorrect geometric information. In this paper, we introduce a self-training framework for validating OSM building footprints with the aid of high-resolution DEMs. The framework supports building segmentation with a self-supervised approach to improve the representation of OSM building footprints. The availability of Digital Elevation Models is used to check the quality of OSM data. The applicability of the approach is demonstrated by a case study conducted in Karlsruhe, Germany. The promising results are described in detail. With our approach, change detection of OSM data can also be carried out using different temporal versions of DEM and OSM data.

4:15pm - 4:30pm

Uncertainty quantification of laserscanning point clouds for road asset classification

Shirin Malihi¹, Ioannis Brilakis¹, Nima Arij², Abbas Kiani², Stefano Cavazzi³, Qiuchen Lu⁴, Graham Starkey⁵, Matt Peck⁶, George Economides⁷

¹Civil Engineering Department, University of Cambridge, United Kingdom; ²Babol Noshirvani University of Technology, Iran; ³Innovation and Research Department, Ordnance Survey, United Kingdom; ⁴Bartlett School of Sustainable Management, University College London (UCL), United Kingdom; ⁵BIM Department, Costain, United Kingdom; ⁶AtkinsRéalis, & University of Birmingham, United Kingdom; ⁷Digital Twins Department, UK Government’s Department for Transport (DfT), United Kingdom

Accurate and reliable road extraction from LiDAR data remains a major challenge when spectral cues are limited or spatial heterogeneity increases model uncertainty. This study introduces a comparative, entropy-driven framework for evaluating the performance and reliability of road asset detection using three supervised machine learning algorithms—XGBoost, Random Forest (RF), and Support Vector Machine (SVM). Using a high-density aerial point cloud, a reproducible computational pipeline was implemented, to help practitioners in real-world scenarios for selecting the most robust and reliable machine learning methods for large-scale road assets mapping. Beyond traditional accuracy metrics (Overall Accuracy, F1-score, and Kappa coefficient), uncertainty-based evaluation of the outputs has been conducted using KPIs of entropy and sensitivity to training sets to quantify model reliability and spatial instability. Results reveal that the inclusion of RGB significantly reduces entropy across all models. XGBoost achieved the lowest mean entropy (0.084–0.143) and the most consistent probabilistic behaviour, reflecting confident and well-calibrated model. SVM, while statistically the most accurate (OA and Kappa > 0.97), exhibited higher local entropy (≈ 0.23–0.26), implying precise yet less certain classification. RF demonstrated the highest entropy (≈ 0.65–0.70) and the greatest variability, underscoring its sensitivity to feature noise. Under the WOR configuration, mean entropy rose markedly—most for RF_WOR (≈ 0.93) and moderately for SVM_WOR (≈ 0.39)—while XGBoost retained low uncertainty.

Spatial entropy maps further highlighted that uncertainty concentrates along road edges with RGB data but expands diffusely under WOR conditions, emphasizing the critical role of spectral–spatial synergy in constraining ambiguity. entropy-based evaluation provided insights beyond conventional accuracy metrics, revealing paradoxes between correctness and confidence.

4:30pm - 4:45pm

S2PT: Spatio-Sequential Point Transformer for Efficient 3D Scene Understanding

Yuqing Shen¹, Akram Akbar^1,2, Yijun Liu¹, Yanyi Li¹, Chun Liu¹

¹College of Surveying and Geo-informatics, Tongji University; ²College of Electronic and Information Engineering, Tongji University

Efficient processing of large-scale 3D point clouds acquired from Terrestrial or Airborne Laser Scanning (TLS/ALS), presents a significant computational challenge. While transformer-based architectures excel at modeling the global context crucial for interpreting these complex scenes, their quadratic computational complexity makes them infeasible for direct application on massive point sets. To address this scalability bottleneck, we propose the Spatio-Sequential Point Transformer (S2PT), a novel hierarchical architecture for efficient and effective large-scale point cloud processing. Our approach begins by serializing the point cloud into an ordered sequence, which enables the use of attention with linear complexity. This not only circumvents the quadratic bottleneck of standard transformers but also establishes a global receptive field at every layer. To compensate for potential information loss during serialization, we further introduce a novel Spatio-sequential Positional Encoding (S2PE) that synergistically combines 3D local geometric features with 1D sequential order information, enhancing the model’s spatial awareness. Experiments on multiple benchmarks demonstrate that S2PT achieves performance comparable to state-of-the-art methods while being significantly more efficient during training and inference, offering a promising path towards scalable representation learning for large-scale 3D scenes.

4:45pm - 5:00pm

Boundary cues for improved 3D semantic segmentation

Waseem Iqbal, Jens-André Paffenholz

Institute of Geotechnology and Mineral Resources – Geomatics, Clausthal University of Technology, Germany

Accurate semantic segmentation of 3D point clouds is a fundamental task in photogrammetry, robotics, and large-scale scene understanding. Despite recent advances in point-based architectures such as PointNeXt, segmentation performance remains limited near semantic boundaries, where local neighborhoods often contain points from multiple classes, leading to feature ambiguity and oversmoothing.

In this paper, we propose a lightweight boundary-aware learning framework that explicitly models boundary regions during training. Boundary supervision is automatically derived from local semantic label disagreement, eliminating the need for additional annotations. An auxiliary boundary prediction head is introduced to learn boundary-sensitive features, which are subsequently integrated into the segmentation process through a late-stage feature fusion mechanism. In addition, a boundary-aware loss formulation emphasizes boundary regions during optimization, encouraging improved feature discrimination at class transitions.

Experimental results on the S3DIS dataset using the standard 6-fold cross-validation protocol demonstrate consistent improvements over the PointNeXt baseline. The proposed method achieves gains of 3.22% in mean Intersection over Union (mIoU) and 2.85% in mean class accuracy (mACC), with notably improved segmentation quality at object boundaries. Importantly, these improvements are obtained without modifying the backbone architecture or increasing inference complexity.

The results indicate that incorporating boundary-aware supervision provides an effective and efficient strategy for improving segmentation performance in challenging regions.

5:00pm - 5:15pm

Identification of nonlinearity and spatial non-stationary effects of local drivers on the synergy between air quality management and carbon mitigation in the Yangtze River Delta urban agglomeration

Man Guo, Nicholas Hamm

University of Nottingham, China, People's Republic of

China is actively pursuing synergistic governance to address air pollution and carbon mitigation issues. This study, focusing on concentration as a key feature, assesses the synergy performance in the Yangtze River Delta Urban Agglomeration (YRDUA), revealing fluctuating trends with only seven cities showing improvement. To further understand the influences from local drivers, we employed an explainable spatial machine learning approach, integrating Geographical Weighted Regression (GWR), Random Forest (RF), and Shapley Additive Explanation (SHAP) to capture nonlinear, threshold, and interaction effects among explanatory variables. The analysis identifies longitude, SO2 emissions from industrial sources, wind speed, latitude, and the proportion of GDP from tertiary sector as the top five influencing factors, emphasizing the importance of geographical position, local air pollution emission, and meteorological condition. Most drivers exhibit nonlinear impacts and interactions with clear thresholds. Such as, wind speeds, exceeding 9.3 m/s negatively impact synergy. Furthermore, spatial heterogeneity of drivers' influence is evident across cities and regions. Specifically, cities along the eastern coast benefit from geographical advantages that enhance synergy in air quality improvement and carbon mitigation. Meteorological conditions, especially wind speed, significantly influence synergy, with notable differences between northern and southern coastal cities. These findings underscore the need for locally tailored governance strategies that leverage each city's unique geographical and socioeconomic attributes to enhance synergistic governance effectiveness. This research contributes to understanding the complex interplay of local drivers influencing synergistic governance in the YRDUA, providing valuable insights for policymakers aiming to improve air quality and promote sustainable development in rapidly urbanizing regions.

3:30pm - 5:15pm

WG III/4B: Landuse and Landcover Change Detection
Location: 714A

3:30pm - 3:45pm

DAL-UNet: A Dual Attention-Coupled ConvLSTM Network for Multi-Temporal Urban Building Change Detection

Qiang Chen, Peng Yue

Beijing University of Civil Engineering and Architecture, China, People's Republic of

With the acceleration of global urbanization, dynamic change detection of urban buildings is vital for urban planning, resource management, and public safety. Traditional bi-temporal remote sensing-based methods fail to capture gradual building evolution and are prone to noise-induced missed detections and false alarms. While multi-temporal imagery provides continuous temporal information, its sequential and high-dimensional nature poses greater challenges. Existing deep learning models like CNNs excel at spatial feature extraction but lack temporal modeling, while LSTM/ConvLSTM struggles with spatial detail preservation and small-target recognition.

To address issues including insufficient temporal modeling, channel redundancy, weakened spatial attention, and small-target loss, this study proposes the Dual Attention-coupled ConvLSTM Network (DAL-UNet). Its encoder embeds a dual attention module: channel attention selects change-related features and suppresses redundancy, while spatial attention enhances key region responses to improve building edge and small-target discrimination. A fully convolutional LSTM module models temporal evolution while preserving spatial topology. The decoder adopts a dual-branch multi-task framework to optimize change feature upsampling and semantic segmentation, enhancing subtle change perception and spatial detail restoration.

Experiments on the SpaceNet7 dataset show DAL-UNet outperforms state-of-the-art methods, with maximum improvements of 13.04% in F1-score, 1.32% in Precision, and 16.52% in Kappa coefficient. It performs exceptionally in high-rise shadow areas and dense small-target regions, reducing shadow interference via attention mechanisms and alleviating class imbalance through class-weighted loss.

3:45pm - 4:00pm

Efficient Fine-Tuning for Building Damage Assessment with High-Resolution Optical Satellite Imagery: A Case Study for War Damage in Ukraine

Sebastian Gapp¹, Corentin Henry¹, Pablo d'Angelo¹, Friedrich Fraundorfer^1,2, Nina Merkle¹

¹Deutsches Zentrum für Luft- und Raumfahrt, Germany; ²Graz University of Technology

In the aftermath of a disaster, whether natural, industrial, or war-related, a rapid and accurate assessment of building damage is crucial for rescue forces to conduct an effective emergency response. Very high-resolution satellite imagery enables such assessments and serves as an important indicator for understanding the scale of destruction, supporting time-critical rescue operations, and guiding resource allocation. While deep learning models have shown promising results in automating building damage assessment (BDA) from pre- and post-disaster optical satellite imagery, they often fail to generalize to new disasters due to domain shifts. This paper studies the challenge of rapid domain adaptation for BDA in the context of the war in Ukraine. We create a new, challenging dataset annotated with damage grades across six cities in Ukraine, using pre- and post-disaster optical imagery. To facilitate rapid adaptation, we propose an efficient fine-tuning workflow using Low-Rank Adaptation. Our experiments show that this approach substantially improves performance in both out-of-domain and in-domain settings, presenting a practical and data-efficient study for deploying BDA models in time-critical emergency scenarios.

4:00pm - 4:15pm

Urban Expansion, Entropy Dynamics, and Ecological Quality: A District-Based Assessment

Anali Azabdaftari¹, Filiz Sunar²

¹Western Sydney University, Australia; ²Istanbul Technical University

This study examines district-level urban expansion and ecological change in the Hills Shire LGA using multitemporal Landsat imagery, Shannon’s entropy, RSEI, and hotspot analysis to identify spatial patterns of growth and environmental stress.

4:15pm - 4:30pm

Urban sprawl analysis using multi-dimensional Urban Sprawl Index (USI) in Bulacan, Philippines

Benedick Agamao¹, Cale Andriane Herrera¹, Bienvenido III Carcellar^1,2, Carlyn Ann Ibañez¹

¹Department of Geodetic Engineering, University of the Philippines Diliman, Philippines; ²Yamaguchi University

Urban sprawl, characterized by land discontinuity, low population density, and inefficient land use, hinders sustainable urbanization, particularly in rapidly growing regions such as Bulacan, Philippines. This phenomenon places strain on existing infrastructure, contributes to environmental degradation, and exacerbates socio-economic disparities. While previous studies have analyzed urban sprawl, these often neglect the integration of socio-economic factors, thereby reducing the accuracy of their analysis and policy relevance for developing regions. This research seeks to analyze urban sprawl patterns within Bulacan through the integration of socio-economic variables and identify key factors driving this sprawl. The study employs urban sprawl analysis, using the Multidimensional Urban Sprawl Index (USI) to assess land discontinuity, population density, and land use efficiency. Additional analysis using fractal analysis and factor analysis through Geodetector was also employed. The study found a positive shift toward more efficient, compact growth in Bulacan from 2005 to 2020, though mild and severe sprawl remain ongoing challenges. Fractal analysis revealed that complex urban forms encourage infill, while open areas are prone to leapfrog development. Land use benefit and road access consistently drove sprawl, with key factors like population and proximity to the city center changing over time. The study recommends stricter enforcement of zoning regulations to mitigate fragmented growth and the integration of additional socio-economic indicators (e.g., GDP, employment rates, and land values) into future analysis.

4:30pm - 4:45pm

A Two-Stage Pipeline of Segmentation and Classification Using Optical Satellite Imagery for Monitoring Inappropriate Embankments

Yasunobu Shimazaki, Yasunari Morita

PASCO Corporation, Tokyo, Japan

This study demonstrated that a two-stage architecture—comprising a segmentation model followed by a classification model—is effective for embankment extraction. By constructing a large, wide-area training corpus from medium-resolution SPOT imagery, transfer learning to higher-resolution satellites (e.g., Pleiades) was readily achieved. For operational use, exhaustively proposing candidates with the AI model and inserting a brief human check (embankment/non-embankment) per candidate can reduce false positives while limiting missed detections, making the approach sufficiently practical for deployment.

4:45pm - 5:00pm

A High-Precision Land-Sea Segmentation Model Based on the Deep Otsu Method

Tengteng Dong, Mi Wang, Tiyou Zhou, Qianyu Wu

State Key Laboratory of information Engineering in Surveying, Mapping and Remote Sensing(LIESMARS), Wuhan University

Land-sea segmentation is crucial for tasks such as marine target detection and coastline extraction in remote sensing imagery. However, complex and diverse background environments and land-sea boundaries can easily lead to inaccurate segmentation. To address this issue, a high-precision land-sea segmentation model based on the deep Otsu method is proposed. This method first utilizes our proposed remote sensing image texture enhancement algorithm based on Retinex theory and the Canny operator to enhance the remote sensing image and its edge information, further improving the segmentation accuracy of the land-sea boundary. Then, we combine deep learning concepts, the maximum inter-class variance method, and our proposed density space clustering method based on the difference innovation optimization algorithm to propose a deep maximum inter-class variance method for segmenting the ocean and land in the image. Simultaneously, an adaptive multi-scale fragmentation region removal method is proposed to remove small, fragmented regions extracted during the segmentation process. Experimental results show that the proposed method achieves an overall prediction accuracy of 98.41% and an average intersection-union ratio of 96.07%, demonstrating its ability to effectively perform land-sea segmentation tasks.

5:00pm - 5:15pm

From Super-Resolution to Superior Land-Cover Detection: Cross-Channel Attention Network for Aerial Images

Yuwei Cai, Zhimeng He, Meiliu Wu, Brian Barrett

University of Glasgow, United Kingdom

Low-resolution imagery is a major constraint for remote sensing tasks (e.g., urban land cover detection) where accurate classification of buildings, roads, vegetation, and small objects is required. Deep learning-based segmentation models are highly sensitive to image quality, resulting in degraded performance on low-resolution inputs. Super-Resolution (SR) techniques offer a promising solution by enhancing image fidelity to support downstream tasks. This work applied MAPSRNet, a Multi-Attention Pyramid SR Network to aerial images used for multi-class land cover detection. Evaluated on the ISPRS Potsdam dataset, MAPSRNet achieves state-of-the-art SR performance with PSNR of 32.92 dB and SSIM of 0.87, outperforming existing methods such as SRCNN (31.54 dB, 0.83) and DRRN (31.03 dB, 0.82) while maintaining competitive inference speed. Beyond image quality, MAPSRNet significantly improves multi-class land cover segmentation when integrated with a ConvNeXtV2-based U-Net, achieving an overall accuracy of 80.60%, mean IoU of 62.54%, and FwIoU of 68.34%, surpassing not only low-resolution inputs (Overall Accuracy: 65.28%, mIoU: 40.20%, FwIoU: 50.12%) but also high-resolution(HR) ones (Overall Accuracy: 80.50%, mIoU: 62.40%, FwIoU: 68.01%), especially in certain classes such as impervious surface and clutter. These results demonstrate that perceptual and structural fidelity, rather than pixel-level similarity, can drive superior performance in urban land cover segmentation. MAPSRNet offers a practical solution for scenarios where HR imagery is limited or unavailable, highlighting its potential for large-scale remote sensing applications.

3:30pm - 5:15pm

WG IV/9C: Spatially Enabled Urban and Regional Digital Twins
Location: 714B

3:30pm - 3:45pm

A Conversational Multi-Agent Platform for BIM Data Intelligence

Amirhossein Nourbakhshrezaei, Nolan Porther, Mojgan Jadidi

Department of Civil Engineering, Lassonde School of Engineering, York Univeristy, Canada

This paper proposes the development of a multi-agent system (MAS) for Building Information Modeling (BIM) environments, where users interact with a 3D model and a chat-bot to query, validate, and analyze building elements. By leveraging conversational AI and modular agents capable of semantic understanding and geometric computation, this system allows users to retrieve data, perform quality checks, and visualize computed results directly using the BIM information. The approach supports diverse tasks, from attribute completion and filtering to volumetric calculations, thus enabling a more intelligent and accessible BIM experience for analytical purposes.

3:45pm - 4:00pm

Bridging geometric Gaps between digital Survey and BIM through open-source IFC-3D Tiles Integration

Raphaël Vouilloz^1,2, Ken Percy², Nicolas Arellano Risopatron², Sabrina Liu², Philippe Marin¹, Stephen Fai²

¹Université Grenoble-Alpes, ENSAG, MHA (Méthodes et Histoire de l'Architecture) - Grenoble, France; ²Carleton University, CIMS (Carleton Immersive Media Studio) - Ottawa, Canada

The adoption of innovative digital heritage workflows in the Architecture, Engineering, and Construction (AEC) sector faces significant challenges, particularly in integrating digital survey data with Building Information Modeling (BIM) into a unified model. This paper begins with a literature review that outlines the geometric and software-environment constraints complicating such integration and examines various proposed solutions, with particular attention to open-source tools and standard formats.

Building on this foundation, the paper introduces an innovative two-stage method: (1) segmenting, classifying, and enriching digital survey data into a BIM model; and (2) developing a web viewer that hybridizes this BIM model with the original survey data. The proposed workflow relies exclusively on open-source tools and open standards, with Industry Foundation Classes (IFC) used as the native editing format. A seamless continuity is established between the Bonsai add-on for Blender, used as a BIM authoring environment, and the web library That Open Engine, which serves as a dissemination tool enabling interactive querying of BIM data within a web browser. This library shares a common dependency on Three.js with 3DTilesRendererJS, allowing the overlay of a tiled photomesh of the asset. This integration enables the combination of an accurate geometric and visual representation with structured metadata interaction within a unified web environment. Overall, the proposed approach provides a robust and flexible framework for supporting practical applications such as dissemination, documentation, and diagnostic studies of heritage assets.

4:00pm - 4:15pm

A comprehensive framework for multi-LoD 3D building model generation using multi-source LiDAR point clouds for Digital Twin development

Mohamed Ismail, Ahmed Shaker, Songnian Li

Department of Civil Engineering, Toronto Metropolitan University, 350 Victoria Street, Toronto, Ontario, M5B2K3 Canada

This study presents a comprehensive and semi-automated framework for generating multi-Level of Detail (LoD) 3D building models using multi-source LiDAR point clouds to support digital twin development. By integrating airborne, drone-based, mobile, and terrestrial LiDAR platforms, the framework addresses limitations of single-source datasets and enables scalable reconstruction across urban and building scales. A robust preprocessing workflow—encompassing subsampling, denoising, colorization, and two-stage registration—significantly enhances point-cloud quality and achieves seamless fusion of heterogeneous datasets with millimetre-level accuracy.

The framework supports outputs ranging from city-scale footprints (LoD0) to detailed parametric building models (LoD4), enabling applications in smart city planning, facility management, and heritage documentation. A knowledge-based segmentation layer further enables the creation of “Smart Point Clouds,” facilitating component-level querying and efficient generation of floor plans, elevations, and façade models. Real-world evaluations in downtown Toronto demonstrate high accuracy and strong computational performance, with LoD0–LoD2 models produced in minutes on a standard workstation.

By ensuring compatibility with CityGML and IFC standards, the framework enhances interoperability within digital twin ecosystems and supports integration with simulation and decision-support systems. While detailed LoD3–LoD4 modeling still requires manual refinement, the workflow establishes a foundation for future automation through AI-driven segmentation and cloud-based parallel processing. Overall, this research advances scalable 3D modeling practices and provides a practical pathway toward comprehensive, data-rich digital twins for smart cities.

4:15pm - 4:30pm

3D Modelling of vegetation from optical and LiDAR point clouds for inclusion in basic nationwide built environment model

Alex Šrollerů^1,2, Jindřich Horák², Petr Dušánek², Markéta Potůčková¹

¹Charles University, Faculty of Science, Department of Applied Geoinformatics and Cartography, Albertov 6, Prague 2, Czech Republic; ²Land Survey Office, Pod Sídlištěm 1800/9, Kobylisy, 182 11 Prague 8, Czech Republic

With the Czech Republic's impending "BIM Act" driving the creation of a basic built environment model, the study proposes a compliant workflow for incorporating 3D models of two key vegetation feature types from the fundamental geographic vector database: "Forest ground with trees" and "Significant or lonely tree, grove." Modelling relies on nationwide datasets, the digital terrain model, the digital surface model based on image matching of aerial imagery, and supplementary aerial laser scanning data. For the forest features, the process comprised optical point cloud filtration and constrained triangulation, resulting in height-extruded forest base polygons with canopy cover tops. The 3D representation uses MultiSurface geometry, recorded as a PlantCover object in CityGML/3DCityDB, and is in line with the LoD2 standard for buildings. For solitary trees, predefined prototypes were scaled and positioned based on individual tree detection and parameters extracted from point clouds. Features were mapped to the CityGML/3DCityDB SolitaryVegetationObjects class, utilizing Implicit geometry to optimize for data volume and visualization speed. While the digital surface model, which can be easily generated from periodically acquired optical imagery, was sufficient for the forest features, aerial laser scanning data was superior in individual tree modelling. The number of extractable parameters increases with point density and is dependent on the platform used. However, the availability of such higher-density laser scanning data in Europe is limited and varies across countries and regions. The results demonstrate the generation of LoD2 compliant 3D models from nationwide datasets for both vegetation features, visually enriching the basic built environment model.

4:30pm - 4:45pm

Developing Construction Supply Chain Management Digital Twins: An Integrated BIM–GIS and Logistics Information Framework

Raafat Khankan, Mojgan Jadidi

Department of Civil Engineering, Lassonde School of Engineering, York University, Canada

Despite the rapidly evolving and widely adopted tools in the Architecture, Engineering, Construction, and Operations (AECO) industry, Construction Supply Chain Management (CSCM) remains a fragmented practice with poor integration and interoperability between Building Information Modelling (BIM), Geographical Information Systems (GIS), and logistics systems. This research aims to bridge the gap between BIM, GIS, and logistics information by developing a unified, data-informed Digital Twins (DT) framework necessary to support multi-criteria decision-making (MCDM) in CSCM. They key characteristics of this work include: (1) a repeatable integration for heterogenous BIM-GIS environments powered by IoT networks; (2) a short-horizon predictive module optimized for construction logistics and Just-in-Time (JIT) delivery; and (3) a democratized analytics interface.

3:30pm - 5:15pm

WG III/8I: Remote Sensing for Agricultural and Natural Ecosystems
Location: 715A

3:30pm - 3:45pm

Automated Coastline Mapping Using Sentinel-2 NDVI on Google Earth Engine: A Decision Support Tool for Diachronic Coastal Monitoring

Abdelaziz Ouzaka¹, Vincent Lecours¹, Maxime Boivin¹, Yannick Duguay²

¹Laboratoire d'Expertise et de Recherche en Géographie Appliquée (LERGA), Université du Québec à Chicoutimi, Chicoutimi, Québec, Canada; ²Centre de géomatique du Québec (CGQ), Cégep de Chicoutimi, Chicoutimi, Québec, Canada

This study introduces an automated decision-support tool implemented on Google Earth Engine for mapping vegetated shorelines using Sentinel-2 NDVI. The tool enables reproducible diachronic coastline extraction, rapid processing of large datasets, and supports coastal change monitoring and management applications.

3:45pm - 4:00pm

Dynamic Shoreline Analysis (1984-2024) in the Municipality of Bragança, Amazon, Brazil

Rafael Alexandre Alves Menezes¹, Kayque da Silva Dias¹, Henrique Gabriel Marques Moura¹, Milena Marília Nogueira de Andrade^1,2, Eder MS de Paula¹

¹Graduate Program in Geography of Federal University of Para, Brazil; ²Federal Rural University of the Amazon, Brazil

Average rates of shoreline change are key indicators for assessing coastal evolution. The study area is located in Bragança, on the northeast coast of Pará, Brazil, covering urban, estuarine and natural areas. Between 1984 and 2024, despite a general trend of increasing coastline, areas with increasing human occupation experienced significant coastal erosion, causing building retreat, partial loss of homes, and damage to beach access roads. Using the Digital Shoreline Analysis System (DSAS) and time series of dense satellite images processed in Google Earth Engine, the coastline was analyzed in the study area. As a result, the average linear rate of variation showed a slight general retreat of the coastline, accompanied by high morphodynamic variability and low statistical consistency in linear trends. Urbanized sectors exposed to ocean forces were the most vulnerable to erosion, while estuarine and mangrove areas were more stable. The high supply of sediments from the estuaries contributed positively to the addition of the coastline in several regions. These findings emphasize the importance of strategic coastal management considering natural and human influences on shoreline dynamics.

4:00pm - 4:15pm

Cross-Sensor Harmonization and temporal Estimation of Mangrove Leaf Reflectance using Multi-Platform hyperspectral data

-- Zohaib¹, Majid Nazeer¹, Man Sing Wong^1,2,3,4

¹Department of Land Surveying and Geo-informatics, The Hong Kong Polytechnic University, Hong Kong S.A.R. (China); ²Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ³Research Institute for Land and Space, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ⁴Otto Poon Research Institute for Climate-Resilient Infrastructure, The Hong Kong Polytechnic University, Hong Kong, China

This study proposes a practical pipeline for cross-sensor harmonization and short-term temporal estimation of mangrove leaf reflectance using multi-platform hyperspectral data. We combine laboratory (HySpex VNIR-1800; Days 1/3/7), field (Specim IQ; Day 1), and UAV (Cubert X20 Plus; Day 1) measurements over 400–900 nm for three species (Ceriops tagal, Avicennia marina, A. germinans). Field and UAV spectra are interpolated to the HySpex grid, and multiplicative change factors derived from HySpex Day-1→Day-3/7 trends are used to estimate later-day reflectance for non-lab sensors. Accuracy is assessed via RMSE and Pearson’s r, with focus on chlorophyll-sensitive regions (~450, 680, 720–750 nm). Systematic platform effects appear: in-field spectra exceed HySpex by ~2.5% (A. germinans), ~5.7% (A. marina), and ~11.5% (C. tagal), while HySpex exceeds UAV by ~4.38%, ~7.89%, and ~11.5%, respectively. After harmonization, temporal consistency is strong for A. germinans (RMSE ≈0.047–0.050; r ≈0.958–0.981) and solid for A. marina (Specim RMSE ≈0.066–0.081; r ≈0.943–0.970), with higher UAV variability. Spectral trajectories track post-harvest stress: ~15–20% decline near 680 nm for C. tagal and ~10% for A. germinans, alongside expected green and red-edge/NIR shifts. The workflow enables comparable, temporally resolved spectra across instruments, supporting scalable vegetation phenotyping and long-term mangrove monitoring where single-sensor continuity is limited.

4:15pm - 4:30pm

UAS-Based Spectral Imaging for Coastal Vegetation Monitoring and Management – A Case Study

Nithish Manikkavasagam¹, Sudhagar Nagarajan¹, Pandiyan Kesavan¹, Tucker Hindle¹, Peter DeWitt²

¹Florida Atlantic University, United States of America; ²U.S. Department of Interior Bureau of Land Management

Coastal vegetation provides essential protection against shoreline erosion, wave action, storm surge, and supports biodiversity in low-lying tidal environments. This research discusses methods of using UAS based hyperspectral and multispectral sensors and a deterministic Spectral Information Divergence approach to monitor and preserve the ecosystem in coastal environments. The work focusses on implementing the methodology for monitoring different species of mangrove in a protected natural area located in Florida, USA. The achieved accuracy of 90% proves the ability of UAS based remote sensing system to support a resilience-based restoration and long-term monitoring.

4:30pm - 4:45pm

Monitoring Tropical Moist Forest Loss in Sierra Leone’s Protected Areas: Remote Sensing Insights from the Western Area Peninsula National Park

Sayana de Gorostizaga¹, Sarah Muir¹, William Hopkins², Hannah Kemper^1,3, Ivana Milic¹, Paola Di Francesco¹, Giancarlo Pini¹

¹United Nations World Food Programme (WFP) Headquarters, Rome, Italy; ²United Nations World Food Programme (WFP) Sierra Leone Country Office, Freetown, Sierra Leone; ³Ruhr-Universität Bochum, Germany

Deforestation remains a critical global challenge with profound implications for food security, ecosystem resilience, and disaster risk reduction. In Sierra Leone, the Western Area Peninsula National Park (WAPNP), one of the country’s last remaining tracts of primary tropical moist forest, faces increasing pressures from illegal logging, mining, and land encroachment despite legal protection since 2012. These activities threaten essential ecosystem services, including water provision, fertile soils, and local climate regulation, while exacerbating vulnerability to floods, landslides, and droughts.

This study evaluates the extent of WAPNP’s closed-canopy forest cover using Sentinel-2 imagery from 2020 to 2024, complemented by very-high-resolution (VHR) data and ground-truth observations for validation. The analysis identifies the main human drivers of forest loss and maps the spatial distribution and remaining extent of forest cover within the park. The results highlight the power of combining Copernicus Sentinel-2 imagery with open-access forest datasets to provide a reproducible, and cost-effective monitoring of forest cover in data-limited tropical regions, offering a valuable tool for conservation planning and management.

4:45pm - 5:00pm

Model ensemble to constrain uncertainties in the estimation of water needs in woody crops by Remote Sensing

Juan Manuel Sánchez¹, Alejandro Moya², Héctor Nieto³, Álvaro Sánchez-Virosta¹, Joan Miquel Galve¹, José González-Piqueras¹

¹Remote Sensing and GIS Group, Universidad de Castilla-La Mancha, Spain; ²Departamento de Ingeniería Informática, Universidad Autónoma de Madrid, Spain; ³Instituto de Ciencias Agrarias (ICA-CSIC), Madrid, Spain

The expansion of irrigated crops such as almond and pistachio in arid and semi-arid regions poses a challenge in a context of water resource scarcity. Understanding crop water requirements across large areas has become feasible thanks to remote sensing techniques and the growing availability of satellite imagery with increasingly higher spatial and temporal resolution. However, models have shortcomings that lead to uncertainties in their estimates. In this study, we introduce the model ensemble technique as a method to constrain uncertainty in crop water requirements, with a particular focus on woody crops. This study is centered in the province of Albacete, for the period 2022–2024, and combines two surface energy balance models, METRIC and SenET_TSEB, with a water balance model asssited by NDVI imagery to obtain time series of daily actual crop evapotranspiration (ETa), with a spatial resolution of 20–30 meters. Comparison with in situ measurements recorded at two eddy-covariance towers located in almond and pistachio orchards shows better correlation of the results using the ensemble. At a weekly scale, an average error of 4.9 mm d⁻¹ and 2.8 mm d⁻¹ are obtained for the almond and pistachio crops. Accumulated ETa values over the growing season are consistent and provide confidence to assist in irrigation scheduling, detect stress conditions, and/or quantify water needs at a plot scale. These results reinforce the role of satellite remote sensing in water resources management, in particularly relevant crops for our region such as almond and pistachio orchards.

5:00pm - 5:15pm

GNSS-R Vegetation Water Content Retrieval Considering Surface Types

Chuanchi Hao, Kegen Yu, Yu Cao

China University Of Mining And Technology, China, People's Republic of

This study verifies the effectiveness and advantages of spaceborne GNSS-R technology for VWC retrieval, and clarifies that the intercept feature of vegetation observations and Γpeak reflectivity are the core components for constructing high-precision models. The proposed method provides a new technical means for large-scale and efficient VWC monitoring, and has positive significance for improving the assessment of vegetation health and disaster risks.

3:30pm - 5:15pm

WG III/6A: Remote Sensing of the Atmosphere
Location: 715B

3:30pm - 3:45pm

Deep Pretraining Unleashes the Potential of Aerosol Size Information Retrieval

Xing Yan

Beijing Normal University, China, People's Republic of

Aerosol size information, typically represented by fine- and coarse-mode aerosol optical depth (fAOD and cAOD), is crucial for understanding anthropogenic emissions and radiative effects. However, satellite-based retrievals suffer from limited labeled data and high uncertainty over land. To address these challenges, we developed a novel deep pretraining framework capable of mining latent representations from unlabeled satellite pixels, thereby enhancing the accuracy and generalization of aerosol size information retrieval. The framework leverages a self-supervised pretraining stage to capture intrinsic spatiotemporal correlations in multispectral satellite data and transfers these latent features to a supervised fine-tuning model. Using MODIS data combined with AERONET observations, our pretrained model achieved a 10% improvement in correlation and a 15% enhancement in regions without ground observations compared to conventional deep-learning models. The retrieved global fAOD from 2001–2020 reveals a significant decreasing trend (−1.39 × 10⁻³ yr⁻¹), with regional differences—most notably, a threefold stronger decline over China than the global average. These results demonstrate that deep pretraining can effectively exploit unlabeled satellite information, bridging the gap between sparse ground networks and dense global observations, and offering a transformative approach for large-scale aerosol characterization and climate studies.

3:45pm - 4:00pm

Retrieval of aerosol optical/microphysical parameters of FY-4A geostationary satellite based on Transformer

Siyu Liu¹, Lina Xu¹, Minghui Tao², Xincai Chang¹, Huang Zhang¹, Jianxin Ling¹, Dandi Liao¹

¹Hubei Subsurface Multi-scale Imaging Key Laboratory, School of Geophysics and Geomatics, China University of Geosciences, Wuhan, 430074, China; ²Hubei Key Laboratory of Regional Ecology and Environmental Change, School of Geography and Information Engineering, China University of Geosciences, Wuhan, 430074, China

Atmospheric aerosols are a key factor influencing the Earth's radiation balance and climate change, and the accuracy of their retrieval is crucial for environmental monitoring and climate research. FY-4A AGRI, with its high-frequency observation capability, can provide aerosol data at high temporal resolution. Combined with deep learning technology, it enables efficient monitoring of dynamic aerosol variations. This study develops a retrieval algorithm for aerosol optical and microphysical parameters based on the Transformer deep learning model, specifically designed for the FY-4A geostationary satellite. The algorithm achieves multi-parameter collaborative retrieval of aerosol optical depth (AOD), fine/coarse-mode aerosol optical depth (FAOD/CAOD), and single scattering albedo (SSA). This research overcomes the reliance on prior assumptions inherent in traditional physical retrieval methods. By integrating multi-band spectral features, geometric observation parameters, and data from 104 AERONET sites, it significantly enhances retrieval accuracy under the complex surface conditions of East Asia. Experimental results demonstrate high accuracy in validation against AERONET sites, with correlation coefficients of R=0.915 for AOD, R=0.897 for FAOD, R=0.851 for CAOD, and R=0.536 for SSA. Comparative validation of various aerosol product spatial distributions highlights the advantages of the proposed algorithm in capturing aerosol diurnal variations (such as haze dissipation processes) and extreme events (e.g., dust storms and biomass burning). This study provides a new technical approach for regional air quality monitoring and climate effect assessment, advancing the application of China’s geostationary meteorological satellites in aerosol monitoring.

4:00pm - 4:15pm

Bioaerosol-driven heavy metal deposition and Biospheric response: A remote sensing-assisted Phytoremediation study in the Pin Valley National Park, North-Western Himalayas

Abhinav Galodha¹, Deepika Sharma²

¹School of Interdisciplinary Research (SIRe), Indian Institute of Technology Delhi, IIT Delhi, India; ²Department of Botany, Himachal Pradesh University (HPU), Shimla, Himachal Pradesh, India

Heavy metal pollution presents a formidable challenge to global ecosystems, threatening biodiversity, soil and water quality, and human health. The atmosphere serves as both a source and long-range conveyor of bioaerosols, complex particles that include bacteria, fungal spores, and dust-bound heavy metals, profoundly influencing biosphere health and ecosystem function. In this study, we investigate atmosphere-biosphere interactions in Pin Valley National Park, a cold desert ecosystem in the Western Himalayas, by analyzing how bioaerosol-mediated deposition of heavy metals shapes vegetation stress and phytoremediation dynamics. Integrating field spectroscopy, in-situ chemical analysis (ICP-MS), and multi-temporal satellite data, we mapped heavy metal hotspots (Pb, Cd, Ni, Cr) and linked them to shifts in vegetation health and thermal indices. We observed significant spatial overlap between elevated metal concentrations likely introduced via long-range atmospheric transport and suppressed vegetation indices. Phytoremediator species such as Brassica juncea and Populus exhibited strong metal uptake, revealing natural biospheric buffering capacity against airborne contaminants. Additionally, iron oxide and hydrothermal indices indicated that soil mineral conditions, modulated by deposition, may influence microbial and root zone dynamics. This multidisciplinary assessment underscores the role of the atmosphere not merely as a depositor but as a dynamic bioreactor influencing terrestrial microbiomes and plant stress responses. By offering a scalable, remote sensing–assisted framework for monitoring ecosystem health and contaminant transport, our work directly supports SDG 13 by identifying atmospheric pathways of pollutant stress under warming trends, contributes to SDG 15 by protecting fragile alpine ecosystems through phytoremediation, and aligns with SDG 17 as an interdisciplinary approach.

4:15pm - 4:30pm

Assessing cross-season, AOD-PM2.5 Relationships as a Function of Meteorological Parameters in Sherbrooke, Québec, Canada

Alireza Zhaleh Doost, Norman T. O'Neill, Yacine Bouroubi, Mickaël Germain

Université de Sherbrooke, Canada

The relationship between aerosol optical depth (AOD) and surface PM2.5 concentrations remains a significant difficulty in remote sensing-based air quality assessments due to meteorological conditions and aerosol vertical structure. This relationship is investigated using daily observations from 2021 to 2024 in Sherbrooke, Quebec, Canada.

Ground-based AERONET AOD500 and satellite-based MAIAC AOD at 550 nm are analyzed separately, together with surface PM2.5 measurements from a local PurpleAir sensor. Meteorological parameters such as relative humidity, boundary layer height, temperature, and wind speed are available from ERA5 reanalysis. Vertically resolved aerosol information from MPLNET lidar is used to identify elevated aerosol layers associated with transported wildfire smoke.

The approach combines Pearson and Spearman correlations, partial correlation analysis, multivariate regression, and Random Forest (RF) modeling to capture nonlinear interactions. Results indicate weak but statistically significant correlations between AOD and PM2.5 (r ≈ 0.26-0.30), with stronger monotonic relationships. A pronounced seasonal dependence is observed, with the strongest coupling in autumn and weak or insignificant relationships in winter. Partial correlation analysis suggests that a residual association between AOD and PM2.5 remains after accounting for meteorological influences.

RF models improve predictive performance (R² ≈ 0.39), although performance degrades in winter. Sensitivity analysis indicates that transported smoke plumes can influence the AOD-PM2.5 relationship, particularly when partial mixing into the boundary layer occurs.

4:30pm - 4:45pm

First global XCO2 Observations from spaceborne Lidar

Hongyuan Zhang, Ge Han, Yiyang Huang, Wei Gong

Wuhan University, China, People's Republic of

Over the past decade, nearly ten satellites dedicated to atmospheric CO2 concentration monitoring have been launched, significantly advancing our understanding of the global carbon cycle. In 2022, China launched the DaQi-1 (DQ-1) satellite, which carries the Aerosol and Carbon Dioxide Lidar (ACDL)—the first spaceborne lidar sensor for CO2 monitoring. Relying on laser-based active sensing, ACDL can detect global XCO2 at nighttime, serving as an important complement to existing passive optical CO2 satellite missions. This study aims to introduce the scientific community to the XCO2 retrieval methodology of ACDL and its initial XCO2 product. The first version of ACDL XCO2 products scheduled for release is called “v1.0”. This paper presents a comparison between XCO2 at daytime and nighttime. Nonetheless, challenges remain, including reliance on meteorological reanalysis data and uncertainties in spectroscopic parameters. In future product versions, we plan to improve data quality through enhanced denoising techniques and signal processing methods for low signal-to-noise ratio (SNR) cases. We hope that this initial ACDL XCO2 product will spark broader interest and participation from the scientific community, thereby contributing fresh momentum to climate change research.

4:45pm - 5:00pm

Cross-city transfer learning for Sentinel-5P-driven NO2 prediction in data-sparse urban environments

Fjoralba Janku¹, Francesco Mauro¹, Luigi Russo², Babak Memar³, Alessandro Sebastianelli⁴, Paolo Gamba², Silvia Liberata Ullo¹

¹University of Sannio, Benevento, Italy; ²University of Pavia, Pavia, Italy; ³University La Sapienza, Rome, Italy; ⁴CMCC Foundation - Euro-Mediterranean Center on Climate Change, Caserta, Italy

Traditional forecasting methods of air pollutants show intrinsic limitations due to the complexity of atmospheric interactions. Recent research has moved toward the employment of artificial intelligence (AI)-based approaches and satellite data processing. The framework proposed in this study is a transfer learning (TL) model to estimate surface-level NO2 concentrations across multiple locations by using satellite and environmental data.

The approach integrates Sentinel-5P TROPOMI-derived tropospheric NO2 columns, meteorological variables (temperature, precipitation etc), spatial coordinates and temporal features.

A CatBoost regression model is implemented, leveraging a Leave-One-City-Out (LOCO) TL framework across five cities (Berlin, London, Madrid, Paris and Toronto) in the world. This enables the model transfer from multiple source domains to a new target city with minimal ground-based data. Experimental results are outperforming city-specific baseline models, by showing an increased prediction accuracy, a reduced Root Mean Square Error (RMSE) by approximately 7% and a Coefficient of Determination (R2) higher by 2.7%. Toronto, which represents an environment with a low monitoring density, benefits most from TL, with R2 improving from 0.58 (baseline) to 0.66 (transfer) and RMSE dropping from 6.44 µg/m3 to 5.84 µg/m3.

A detailed Leave-One-Block-Out (LOBO) ablation study shows how each group of features contributes to the performance of the model. Spatial coordinates and meteorological features are the most influential predictors of NO2 concentration, while the satellite NO2 data increase model generalization. These results highlight the potential of cross-city TL and remote sensing synergy for scalable urban air pollution monitoring, especially in limited ground-based monitoring scenarios.

5:00pm - 5:15pm

Enhanced Ozone Downscaling in Megacities Using a SHAP-Optimized U-Net Model

A.A. Kakroodi, Hossein Barekati, Hamid Kiavarz Moghaddam

University of Tehran, Iran, Islamic Republic of

High-resolution mapping of tropospheric ozone is essential for urban environmental assessment; however, satellite-derived ozone products are generally too coarse to capture neighborhood-scale variability in complex megacities such as Tehran. This study introduces an interpretable deep-learning framework that downscales coarse Sentinel-5P ozone observations to a 30-m spatial grid by integrating a U-Net convolutional architecture with SHapley Additive exPlanations (SHAP). A diverse suite of predictors—including land-surface indicators, meteorological parameters, terrain morphology, and chemical precursors—was harmonized and resampled to a unified spatial resolution. SHAP analysis was applied to quantify each predictor’s contribution, enabling the removal of redundant or low-impact variables before model training. Using spring 2020 as the evaluation period, the optimized U-Net successfully reconstructed fine-scale ozone gradients and reproduced Tehran’s characteristic north–south pattern driven by topography and emission density. Comparative analysis with preliminary outputs demonstrates that feature optimization enhances spatial coherence, reduces noise artifacts, and improves the representation of localized hotspots. Statistical evaluation further showed strong agreement between the downscaled ozone estimates and observational data at both station and district scales, demonstrating effective generalization across heterogeneous urban environments. Overall, the findings highlight the potential of combining deep learning with interpretability techniques to refine coarse satellite ozone observations and provide a scalable, high-resolution framework for urban air-quality monitoring and exposure assessment.

3:30pm - 5:15pm

IvS9: Remote Sensing and Geospatial Technologies for Vegetation Fire Management and Recovery Resilience
Location: 716A

3:30pm - 3:45pm

A new Canadian radar satellite mission to retrieve snow water equivalent

Benoit Montpetit¹, Chris Derksen¹, Vincent Vionnet¹, Marco Carrera¹, Julien Meloche¹, Nicolas Leroux¹, Franck Lespinas¹, Jean Bergeron²

¹Environment and Climate Change Canada, Canada; ²Canadian Space Agency

This talk will highlight the future Canadian radar satellite mission, currently named the Terrestrial Snow Mission, under development by Environment and Climate Change Canada, in partnership with the Canadian Space Agency and Natural Resources Canada.

The mission concept will be presented, as well as recent scientific advancements made in the field of snow radar remote sensing, modeling and data assimilation, to continue the advancement of the mission's science readiness level.

This Canadian radar mission will provide weekly coverage of the northern hemisphere with Ku-band SAR data, and, coupled with modeled data, will provide daily snow water equivalent data, to assist hydrological applications and decision-making.

3:45pm - 4:00pm

Airborne Lidar derived Snow Water Equivalent outputs to improve spatialized Raven hydrologic Snowpack Water simulation

Celeste Barnes¹, Ryan MacDonald², Brandi Newton³, Chris Hopkinson⁴

¹University of Lethbridge, Alberta, Canada; ²MacDonald Hydrology Consultants Ltd., Cranbrook, BC, Canada; ³Alberta Environnent and Protected Areas, Alberta, Canada; ⁴Western University, Ontario, Canada

River systems originating from the Southern Alberta Canadian Rocky Mountains provide snowpack meltwater to an extensive downstream reservoir and irrigation network. Future water supplies have the potential to be significantly decreased due to changes in climate and reduced winter snowpack melt regimes. Estimating accurate water volumes in mountain regions is especially challenging. Current practices for estimating snow water equivalent (SWE) over a large mountain region use single point field-based snow measurements generally at valley or sub-alpine elevations. These field measurements are not spatially representative of basin-wide snowpack variability. The Alberta River Forecast Centre uses the Raven hydrological modelling framework to estimate daily winter snow water equivalent (SWE). To address the need for more accurate simulations of spatially explicit SWE, a combined airborne lidar and field snowpack sampling and modelling framework was compared with a Raven Model simulation. “Single point in time” SWE estimates were obtained between 2014 to 2021 using a combination of a) airborne lidar snow depth models, and b) public field sampled snow density. However, annual water yields cannot be generated from this type of snow sampling. The goal of this study was to improve spatialized Raven modelled SWE using the spatially-explicit lidar-based gridded SWE estimates across the West Castle Watershed (WCW, approximately 100 km^2). Results indicated Raven modelled SWE outputs were underestimated in comparison to the lidar-derived SWE with the largest deviation in the sub-alpine forested and grassland areas. Further research aims to use these comparative data to improve Raven-simulated wintertime headwater SWE estimates.

4:00pm - 4:15pm

Assessing SWOT WSE retrievals and monitoring karst-influenced surface water dynamics in Bruce Peninsula National Park

Ethan Parker, Ben DeVries, Aaron Berg

University of Guelph, Canada

This study evaluates water surface elevation (WSE) retrievals from the Surface Water and Ocean Topography (SWOT) mission and investigates lake dynamics in the karst influenced environment of Bruce Peninsula National Park, Ontario. SWOT derived WSE measurements are validated against high frequency in situ depth logger data referenced to a consistent vertical datum using GNSS. The analysis compares multiple SWOT products, quality filtering approaches, and pixel aggregation methods to determine optimal workflows and assess performance under varying surface conditions, including open water, small surface area (<1km2), vegetation, and ice cover. Results demonstrate that SWOT accuracy is strongly dependent on surface conditions and lake characteristics, with reduced performance in smaller or vegetated systems. The study also examines spatial correlations in lake level variability to identify potential karst influences on hydrological connectivity. These findings provide guidance for the effective use of SWOT in monitoring inland water systems and highlight its potential and limitations for hydrological applications in complex environments.

4:15pm - 4:30pm

Snowpack Water Resource Forecasting and Public Education using Airborne Lidar Sampling, Imputation, Melt Simulation and Game Engine Visualisation

Chris Hopkinson¹, Celeste Barnes², James Craig³, Ryan MacDonald⁴, Brandi Newton⁵, James Graham⁶

¹Western University, Canada; ²University of Lethbridge; ³University of Waterloo; ⁴MacHydro; ⁵Govt Alberta; ⁶Neospatial Corp

Comparing airborne lidar datasets collected during snow-free and snow-covered ground conditions enables snow depth mapping at high accuracy and resolution (Hopkinson et al. 2004, Deems et al. 2013). Imputation of snow depth samples combined with field-based or modeled density can produce SWE for small to meso-scale (~100 km2) watersheds (Barnes et al, Submitted, Cartwright et al. 2020, Hopkinson et al. 2012). The goal of this study was to test lidar-based sampling and imputation in an operational regional (>20,000 km2) basin-scale SWE and runoff forecasting framework. Following initial tests in the winter of 2023, two lidar sensors were flown in March (Teledyne Optech Galaxy) and April (Teledyne Optech Titan) 2024 (and again in 2025 and 2026 – results not reported here), to collect 76 snow depth transects (~1 km wide, >2,000 km2) over the Bow and Oldman River Basin headwaters (>400 km north-south, >50 km east-west) near coincident with field samples at 28 sites. For 85 transect intersections, snow depth covariance was high (r2 0.70, RMSE 0.12m), with a small but acceptable bias of -0.04m or -5% (r2 0.94, n 198). An online digital twin platform is being developed to host the snow depth modeling results, as well as real-time weather telemetry and landscape change for public education and data dissemination purposes.

4:30pm - 4:45pm

A Deep Learning-Based Approach for Field-Scale Surface Soil Moisture Estimation Using SAR and Optical Satellite Data

Elahe Jahan Nejadi, Ramata Magagi, Kalifa Goïta

Université de Sherbrooke, Département de géomatique appliquée, Centre d’applications et de recherches en télédétection (CARTEL), QC, Canada

Surface soil moisture (SSM), representing the moisture content within the top layer of soil, provides valuable information and plays an important role in agricultural management. This study presents a deep learning (DL)-based method to estimate field-scale SSM time series over vegetated agricultural areas in Manitoba, Canada, by combining microwave and optical remote sensing (RS) data with auxiliary information. The input dataset was built using Sentinel-1 Synthetic Aperture Radar (SAR) and Harmonized Landsat Sentinel-2 (HLS S30) optical imagery, together with meteorological variables, soil temperature, crop type, topography, and soil texture. Since Sentinel-1 and HLS images were not acquired simultaneously, temporal interpolation was applied to align optical feature values with SAR acquisition times. Features were extracted at 30 m around nine Real-Time In-Situ Soil Monitoring for Agriculture (RISMA) stations.

A one-dimensional convolutional neural network (1D-CNN) was developed to learn local temporal patterns from the multi-source input dataset. The model was trained on multi-year data from 2016 to 2024 and externally validated on 2017 and 2021. On the validation dataset, the model achieved strong accuracy, with R² = 0.815, RMSE = 0.036 m³/m³, and MAE = 0.026 m³/m³. Model interpretation using Shapley additive explanations (SHAP) highlighted a physically coherent set of predictors, including vegetation cover and structure indices, radar backscatter features, solar radiation, minimum air temperature, and precipitation. Overall, the proposed DL framework provides accurate and interpretable field-scale SSM estimates suitable for agricultural monitoring and downstream water-management applications.

4:45pm - 5:00pm

Issues and potentials of multi-sensor water level monitoring: lesson learned at Recentino Lake, Italy

Alireza Hamoudzadeh¹, Lorenza Ranaldi¹, Veronica Laici¹, Alessandra Maria De Pace¹, Valeria Belloni¹, Roberta Ravanelli², Mattia Crespi^1,3

¹Geodesy and Geomatics Division, Sapienza University of Rome, 00184 Rome, Italy; ²Geomatics Unit, University of Liège, 4000 Liège, Belgium; ³Sapienza School for Advanced Studies, Sapienza University of Rome, 00161 Rome, Italy

Surface water monitoring is critical due to increasing climate impacts, yet small reservoirs (0.01–1 km²) often lack the in-situ infrastructure required for consistent observation. This study evaluates the reliability of the Surface Water and Ocean Topography (SWOT) satellite mission for monitoring such water bodies by integrating UAV-based Digital Elevation Models (DEMs) and traditional gauge station data. A UAV survey was conducted at Recentino Lake (Umbria, Italy) in December 2024 to generate a high-resolution DEM (1.56 cm/pixel) with a vertical accuracy of 3.4 cm. Parallelly, SWOT data were processed by strictly retaining high-quality flags and applying a temporal outlier removal filter based on water level change velocity. The water surface elevation (WSE) derived from the DEM was compared with the processed SWOT data and in-situ gauge records. Results indicated high consistency between the UAV-DEM and SWOT-derived levels (110.78 m and 110.76 m, respectively) after harmonizing height reference frames. Conversely, comparisons with the gauge station revealed significant systematic biases (+18 cm vs. DEM; +44 cm vs. SWOT), attributed to the gauge’s undefined vertical datum. Despite this bias, the SWOT and gauge time series showed a reasonable correlation. These findings demonstrate the applicability of SWOT data for monitoring small reservoirs but underscore the critical challenge of vertical inconsistency across observing systems. Also, the study highlights the urgent need for unified vertical reference frames to ensure the accurate integration of heterogeneous hydrological data from different sources (satellite, aerial, and ground).

5:00pm - 5:15pm

Physics-Based and Machine Learning Approaches for Adjacency Effect Correction in Small Inland Water Bodies: A Case Study of Canadian Lakes Using Sentinel-2 Data

Marc-Antoine Lecours-Toutloff, Guy Armel Fotso Kamga, Samuel Foucher, Yacine Bouroubi, Yannick Huot, Norm O’Neill

Department of Applied Geomatics, Université de Sherbrooke, Canada

This presentation focuses on the challenge of atmospheric correction for high-resolution optical satellites (Sentinel-2) in the presence of adjacency effects, a major source of radiometric bias over small inland water bodies. Because water reflectance is extremely low in the visible and near-infrared, even small contributions of photons scattered from surrounding land surfaces can distort surface reflectance estimates of the observed water body. Traditional physics-based models such as 6SV offer radiative consistency but are limited by assumptions of atmospheric homogeneity and Lambertian surfaces, while empirical and semi-empirical approaches struggle to generalize across diverse atmospheric and geometric conditions.

This project addresses these limitations by developing a Physics-Informed Machine Learning (PIML) pipeline. We emulate heavy 3D Monte Carlo simulations to generate synthetic point-spread function (PSF) datasets. These datasets feed a tabular foundation model (TabPFN), leveraging In-Context Learning to capture the adjacency effect's non-linear dynamics without architectural retraining. We compare TabPFN against classical machine learning (XGBoost) using Sentinel-2 and in situ data. Results demonstrate TabPFN's superiority in resolving complex higher-order scattering, offering a rapid, physically consistent operational pipeline.

3:30pm - 5:15pm

Forum3B: Legacy Project: How to Secure Funding to Support Geospatial Activities
Location: 716B

3:30pm - 5:15pm

Forum8B: Wildfire Remote Sensing - Bridging Public and Private Solutions
Location: 717A

3:30pm - 5:30pm

InS6: Industry Tech Session
Location: 717B

3:30pm - 5:30pm

P3: Poster Session 3
Location: Exhibition Hall "E"

Concealed Object Discrimination in Forested Areas using PolTomoSAR with various Baseline Configurations

Youssra El Bennioui¹, Laurent Ferro-famil^1,2, Yue Huang³

¹ISAE-SUPAERO, Toulouse, France; ²CESBIO, University of Toulouse, France; ³Meteo-France, Toulouse, France

Detecting objects hidden beneath a forest cover with Synthetic Aperture Radar (SAR) is challenging due to strong vegetation scattering, canopy attenuation, and ground returns. This work investigates two methods for detecting concealed targets using Polarimetric tomographic SAR (PolTomoSAR). The first approach exploits full-rank polarimetric tomographic focusing to achieve high-resolution separation of scattering sources and estimate their polarimetric responses. Target detection is then carried out using descriptors derived from decomposition techniques, such as the polarimetric entropy, and double-bounce scattering intensity, enabling the identification of man-made objects embedded within a dense vegetation layer. The second approach considers a compact configuration using only two interferometric SAR (InSAR) images. Coherent ground-notching suppresses the dominant ground scattering contribution, while preserving responses from above-ground scatterers. It is demonstrated that the baseline value plays a significant role in the detection process, and an optimum value is selected. Both methods are evaluated using L-band data set acquired by the DLR F-SAR over Dornstetten, Germany. Results demonstrate successful detection of concealed objects for varying baseline configurations.

Crop Classification Using Time-Series Landsat Data: A Comparison of Attention-Based LSTM, GRU, and TCN Models

Yuta Tsuchiya, Rei Sonobe

Shizuoka University, Japan

This study aimed to develop a highly accurate crop classification framework using multi-temporal Landsat 9 imagery and advanced deep learning architectures for the Tokachi Plain, a major agricultural region in Japan. Six time-series scenes, acquired between May 2 and September 16, 2024, were used to classify six crop categories: beans, beetroot, grassland, maize, potato, and winter wheat. Three temporal models with attention mechanisms were evaluated: long short-term memory (LSTM), bidirectional gated recurrent unit (Bi-GRU), and temporal convolutional network (TCN). Of the models tested, the TCN + Attention architecture achieved the highest overall accuracy (81.3%), significantly outperforming LSTM and Bi-GRU (p < 0.001). The Near-Infrared (NIR) band (Band 5) consistently exhibited the highest importance, highlighting its sensitivity to vegetation structure and chlorophyll content. Despite relying on only six optical scenes, the proposed model demonstrated robust performance comparable to or exceeding previous multi-sensor studies. These results underscore the potential of combining freely available Landsat 9 time-series data with attention-enhanced deep learning methods for efficient and scalable crop classification. The findings emphasize the important role of NIR reflectance during key growth stages and the effectiveness of TCN architectures in modeling temporal spectral variations for agricultural monitoring applications.

Evaluating GAN-Based RGB Image Translation Using ALOS-2 Polarimetric SAR Data for Agricultural Monitoring

Hirofumi Shimaoka¹, Rei Sonobe¹, Kan-ichiro Mochizuki²

¹Shizuoka University, Japan; ²Pasco,Japan

Optical satellite imagery plays a vital role in agricultural monitoring but is often constrained by cloud cover and illumination conditions. Synthetic aperture radar (SAR) offers an all-weather alternative, and recent advances in deep generative models provide opportunities to reconstruct optical-like imagery directly from SAR data. In this study, we investigated the potential of generating realistic red-green-blue (RGB) images of croplands using generative adversarial networks (GANs) trained on ALOS-2/PALSAR-2 quad-polarimetric (quad-pol) data. A distinctive feature of our work is the evaluation of not only backscatter coefficients (Gamma nought) but also polarimetric parameters derived from quad-pol decompositions, including the generalised Freeman–Durden, H/A/Alpha, and Yamaguchi four-component methods. Our results showed that paired image-to-image translation methods, such as feature-guiding GAN and pix2pixHD, achieved high similarity to PlanetScope reference imagery, with mean structural similarity index values exceeding 0.98 across all SAR inputs. In contrast, unpaired approaches demonstrated more variable performance depending on the input features. Notably, PUT showed significant improvement when H/A/Alpha or Yamaguchi decompositions were used, whereas Freeman–Durden produced results comparable to Gamma nought. The performance gap between paired and unpaired frameworks was most evident in heterogeneous landscapes, such as areas with adjacent grasslands and forests. These findings demonstrate the effectiveness of GAN-based translation from polarimetric SAR to RGB imagery for agricultural monitoring. The integration of polarimetric information adds value to unpaired learning schemes, and the ability to generate optical-like imagery under challenging observation conditions has strong potential for practical use in crop monitoring and assessment.

Evaluating Mask R-CNN for instance segmentation of ceramic roofs in a Brazilian urban area using UAV imagery

Marcelo Henrique de Oliveira¹, Danilo Marques de Magalhães¹, Antonio Vieira dos Santos Júnior¹, Diandra dos Santos Moura¹, Tomás Lyra Lutterbach¹, Vitória Diniz de Oliveira¹, Luisa da Cunha Vieira²

¹Dept. of Geography and Environmental Planning, São Paulo State University, Rio Claro, Brazil; ²Geotechnical Project Management, BVP Geotecnia e Hidrotecnia, Belo Horizonte, Brazil

The performance of the Mask R-CNN model for instance segmentation of ceramic rooftops was evaluated using a high-resolution orthomosaic generated from UAV-based photogrammetry. Model training and inference were performed in ArcGIS Pro 3.5.3 with a ResNet-50 backbone. The model demonstrated high detection reliability, achieving a Precision of 96.62%, a Recall of 78.81%, and an F1-score of 86.81% at an Intersection over Union (IoU) threshold of 0.5. Most omission errors were associated with light-colored, elongated rooftops, highlighting limitations in the representativeness of the training sample and morphological variability. Fragmentation of larger rooftops into multiple segments was also observed, which affected accuracy metrics. To address this, a topological post-processing step was implemented to merge overlapping polygons, thereby improving segmentation consistency. These results indicate that Mask R-CNN is effective for high-resolution rooftop mapping, especially in applications requiring high precision. The approach is operationally feasible and transferable to similar datasets, enabling scalable analyses. It serves as a complementary tool for urban mapping, supporting the monitoring of urban dynamics and the analysis of construction patterns related to building standards and socioeconomic conditions.

Assessing applications of self-supervised learning for tree species classification from LiDAR point clouds

Jake Pott¹, Baoxin Hu¹, F. Wayne Bell²

¹Dept. of Earth and Space Science and Engineering, York University, Canada; ²Forest Ecology and Silviculture, Ontario Forest Research Institute, Canada

Individual tree species classification from LiDAR (Light Detection And Ranging) point clouds has significant potential to support forest inventory and management, yet remains challenging due to complex three-dimensional canopy structures and the limited availability of labelled ground truth data. This study investigates self-supervised learning for tree species classification from LiDAR point clouds by comparing the PointMAE, a masked autoencoder-based model, with two supervised baselines, PointNet and PointNet++. Using the FOR-species20k dataset, two xperiments were conducted: a 33-species classification and a 6-species classification, each evaluated with point cloud sizes of 2048 and 8192 points. Using 2048 points, the PointMAE achieved the highest overall accuracy in both experiments (0.67 and 0.89 respectively), utperforming PointNet++ (0.63 and 0.84) and PointNet (0.39 and 0.75). Across all models, performance decreased when using 8192 points, indicating sensitivity to point cloud density

and sampling. Per-species analysis showed that coniferous species with distinctive crown geometries were the easiest to classify, while broadleaf species with similar crown forms, particularly Carpinus betulus, were the most challenging. These results show that self-supervised pretraining can improve classification accuracy over fully supervised approaches, highlighting its value for forestry applications where labelled data are limited.

The POD-HAR framework: deriving latent space dynamics for land surface evolution

Weidong Tang¹, Zheng Zhang²

¹Beijing University of Posts and Telecommunications, China, People's Republic of; ²Aerospace Information Research Institute, CAS, Beijing, China

This paper introduces the POD-HAR framework, a novel approach for deriving latent space dynamics in land surface modeling. The framework leverages Proper Orthogonal Decomposition (POD) to reduce data dimensionality by extracting dominant orthogonal modes and their temporal coefficients. It then applies Harmonic Analysis Regression with Sparsity (HAR) to identify sparse, interpretable nonlinear dynamical systems from this low-dimensional representation. By integrating these methods, POD-HAR establishes a regression-based technique for discovering parsimonious, often nonlinear, models that efficiently represent high-dimensional land surface evolution.

Quality Inspection and Intelligent Fusion Method for Automated Production of Large-Scale Remote Sensing Image Tiles

Zhen Wang¹, Cong Wang¹, Chang Gao^2,3, Hongping Zhang¹, Dejin Tang¹, Heng Li¹, Xinyan Zheng¹, Lei Ding¹

¹National Geomatics Center of China, China, People's Republic of; ²BGP INC., China National Petroleum Corporation, Hebei, China; ³Kunlun Digital Technology Co., Ltd. Beijing, China

To address inefficiencies in manual inspection and color/geometric inconsistencies in tile production for web map services, this study develops an automated intelligent post-processing workflow. It integrates three core modules: automatic metadata quality inspection, computer vision-based image quality inspection (targeting invalid regions and color anomalies), and intelligent color uniformity adjustment with seamless edge fusion. By combining rule engines and image processing algorithms, automatic quality control and consistent fusion of produced/online tiles are achieved, significantly improving tile production automation and product reliability.

A study on the role of wake patterns in ship type classification using medium resolution SAR imagery

Benyamin Hosseiny, Kamirul Kamirul, Odysseas Pappas, Alin Achim

University of Bristol, United Kingdom

Classification of vessel types in Synthetic Aperture Radar (SAR) imagery is essential for maritime surveillance, yet distinguishing between ships with similar geometric characteristics—such as cargo and tanker vessels—remains challenging, particularly in medium-resolution images. This study investigates the role of wake patterns in improving ship-type classification using NovaSAR S-band imagery with 6 m spatial resolution. A dataset comprising 319 image patches (205 cargo, 114 tanker) was curated, including both centered ship patches and extended patches capturing wake structures. Experimental results demonstrate that incorporating wake information yields a 2–9% improvement across multiple evaluation metrics compared to ship-only scenarios. These findings highlight the potential of wake patterns as complementary features for enhancing classification accuracy in SAR-based maritime applications.

Super Resolution of Sentinel-2 Imagery Using Latent Diffusion Models For Photovoltaic Site Assessment

Ferdaous Chaabane¹, Mohamed Amin Chamtouri¹, Moez Zouari¹, Ghaith Kouki², Florence Tupin³

¹Higher school of Communication of Tunis, Tunisia; ²State University of New York College of Environmental Science and Forestry, Department of Environmental Ressources and Engineering, United States; ³Department of Image and Signal Processing, Telecom ParisTech, France

The growing demand for renewable energy has emphasized the importance of detailed geospatial information for photovoltaic (PV) site assessment and planning. Sentinel-2 imagery provides a valuable and widely accessible resource, yet its native 10-meter spatial resolution limits the ability to identify small structures such as rooftops, narrow roads, and compact built-up zones. This constraint affects the accuracy of solar suitability analyses and highlights the need for enhanced-resolution imagery capable of capturing finer spatial details.

This paper presents a photovoltaic (PV) assessment and optimization framework that integrates a resolution enhancement module based on latent diffusion models. This module operates in the latent space and relies on an iterative diffusion process to reconstruct fine urban and peri-urban structures, leading to higher-resolution products that support more accurate PV potential analysis and solar deployment.

Cloud-filtered Sentinel-2 L2A scenes are processed through this framework to produce ×4 enhanced imagery with an effective 2.5-meter resolution. Pretraining on

cross-sensor datasets can support realistic recovery of buildings, roads, and other small features while maintaining spectral coherence. The enhanced imagery enables more accurate rooftop segmentation, which serves as input for comprehensive photovoltaic potential assessment.

The installation optimization integrates multiple factors including solar radiation data, atmospheric conditions, shading analysis, rooftop orientation, tilt angles,

and panel layout efficiency to maximize energy generation capacity while considering technical and economic constraints. Qualitative evaluation demonstrates high-quality visual enhancement, confirming the relevance of this resolutionenhancement step within the overall workflow dedicated to PV site suitability analysis and installation optimization under real-world environmental conditions.

A robust and transferable AI workflow for segmenting ground-mounted Photovoltaic Systems

Kilian Sperber, Jonathan Kraus, Henry Meissner, Ulf Kreuziger

OTH Amberg-Weiden

The given contribution describes an efficient artificial intelligence (AI) workflow for the detection and segmentation of ground-mounted photovoltaic (PV) systems in Bavaria (Germany), which can be transferred to any region. A two-stage approach was developed based on digital orthophotos (DOP) with a resolution of 20 cm (DOP20) or 100 cm (DOP100). Two different AI models, U-Net and YOLO, are used to identify and segment PV systems. The combined approach, which first analyses low-resolution DOP100 images and then uses targeted high-resolution DOP20 tiles, increases efficiency, by processing only relevant image areas with high resolution. Initial tests in three Bavarian districts show a high level of accuracy for both AI models. The approach is designed to be used for area-wide segmentation in Bavaria and thus contribute to change detection and quality assurance of the Digital Basic Landscape Model (ATKIS Base-DLM). Furthermore, the generalisation capability of the workflow was validated using an independent high-resolution dataset from the Piedmont region in Italy, where the models achieved promising recognition rates even without applying the post-processing pipeline.

Super-Resolution and Multi-Resolution Biomass Mapping from Coarse Labels via Weak Supervision and Spatial Priors

Rachel Nubwimana

University of Copenhagen, Denmark

We present a novel deep learning framework for above-ground biomass (AGB) estimation that produces high-resolution and multi-resolution biomass maps from coarse labels. The method is designed for the cases where dense pixel-level labels are unavailable. Using only 100 m scalar AGB values as supervision, our model predicts spatially detailed AGB maps at 100 m, 10 m, 3 m, and 1 m resolutions from PlanetScope imagery.

The task is formulated as a mass-conserving super-resolution problem, where each low-resolution label is reallocated over a high-resolution patch via learnable spatial weights. Our architecture is a lightweight encoder-decoder with four output heads, one per resolution scale. The final prediction is constrained to preserve total biomass per patch.

To guide spatial distribution without dense ground truth, we incorporate self-supervised learning (contrastive and equivariant losses), learnable pooling modules, and ecological priors such as NDVI/SAVI to suppress model hallucinations.

Trained on PlanetScope mosaics and ESA CCI-derived 100 m AGB maps, the model is evaluated on independent LiDAR-derived field plots. It explains 86% of the observed AGB variance (R² = 0.86) with only 2% bias, outperforming the baseline AGB map and recent CHM-based models in fine-scale detail.

This work demonstrates that both high-resolution and multi-resolution biomass mapping can be achieved from coarse supervision alone. It opens new opportunities for scalable AGB monitoring especially in data-scarce landscapes, with applications in ecological modeling, carbon stock estimation, and resolution-adaptive remote sensing.

A Multi-Stage Deep Learning Framework for Shadow Detection in Aerial Orthophotos

Yohei Kobayashi, Mitsuteru Sakamoto, Sho Nakamura, Toshiaki Satoh

PASCO, Japan

Shadow correction is an important preprocessing step not only for visual enhancement but also for improving object recognition performance in remote sensing imagery. Although many datasets and deep learning models have been proposed for shadow detection and removal, most of them focus on natural images. In contrast, high-resolution aerial orthophotos contain large continuous shadows caused by tall buildings, especially in urban areas, and existing models often fail to handle such large-scale structures effectively.

In this study, we construct a new shadow annotation dataset specifically designed for aerial orthophotos with spatial resolutions of 20 cm/pixel and 5 cm/pixel. Furthermore, we propose a three-stage multi-resolution segmentation framework that progressively refines shadow predictions from low to high resolution. Predictions from lower-resolution stages are used as auxiliary information to guide higher-resolution prediction. Experimental results demonstrate that the proposed approach improves fuzzy Intersection over Union (IoU) by approximately 0.05 compared with a previously published shadow detection model, and also outperforms a single-stage baseline, particularly for large continuous shadow regions. The framework is also applicable to other large-scale segmentation tasks requiring extensive receptive fields.

From Urban 3D Imagery to Low-Altitude Flight Risk Perception: A Construction Method for the Low-Altitude Flight Safety Zones of Surveying and Mapping UAVs and Its Application in Shanghai

Sijian Wang, Wen Zhang

Shanghai Municipal Insititue of Surveying and Mapping, China, People's Republic of

With the in-depth penetration of UAV technology in fields such as geographic information surveying and mapping, the urban low-altitude economy has ushered in a critical period of rapid development. Among these fields, the safety issues in geographic surveying and mapping are particularly prominent. UAVs in this field are mainly used for field data collection of geographic information products such as digital orthophoto maps (DOM) and 3D oblique models. They realize fully automated flight mode through pre-set route planning, which significantly improves operational efficiency and operational convenience. However, they are confronted with the core technical challenge of "how to accurately determine the safety of flight routes within the survey area". This issue has become a key bottleneck restricting the safe and efficient operation of surveying and mapping UAVs.

This study takes remote sensing images and 3D geographic data as core supports, and combines multi-source data fusion technology and related algorithms to construct the "low-altitude flight safety field for urban surveying and mapping UAVs", drawing on the concept of “low-altitude safety corridors”. In essence, this field is a standardized digital 3D spatial grid system that covers the airworthy area of urban surveying and mapping UAVs, features three-dimensional connectivity, and supports intelligent coding. Shanghai was selected as a typical research area for data testing and verification. The test results show that the data achievements of this system can efficiently provide flight safety guarantees for the operation of surveying and mapping UAVs.

MSCTFormer: A High-Resolution Water Body Extraction Network for Hyperspectral Remote Sensing Images Based on a Hybrid CNN-Transformer Architecture

Zhiyong Zhang¹, Bo Song^1,2, Bo Li¹, Qingyang Wang¹, Hong Huang¹

¹College of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541004, China; ²College of Environmental Science and Engineering, Guilin University of Technology, Guilin 541006, China

Precise monitoring of water resources is crucial for addressing global climate change. Water body extraction based on remote sensing imagery constitutes a core technical approach. Existing methods which based on CNN or Transformer (Chen et al., 2018; Gu et al., 2022; Lu et al., 2024), still encounter challenges when processing high-resolution imagery, including blurred boundaries, significant scale variations, and low computational efficiency. This makes it difficult to achieve a high degree of balance between accuracy and efficiency in water body extraction. To address these restrictions, this study proposes a residual network model integrating multi-scale contextual attention, called as MSCTFormer. It provides a novel approach for achieving high-precision and high-efficiency water extraction.

MCAM: A Multi-scale Cyclic Adaptive Mamba Network for Hyperspectral Image Classification

Yihang Zou¹, Lina XU¹, Yanni Dong²

¹Hubei Subsurface Multi-scale Imaging Key Laboratory, School of Geophysics and Geomatics, China University of Geosciences, Wuhan, 430074, China; ²School of Resource and Environmental Sciences, Wuhan University, Wuhan, 430079, China

This paper proposes the MCAM model to address key challenges in hyperspectral image (HSI) classification. The core of the model comprises a cyclic adaptive scanning module, which achieves multi-view feature fusion through dynamic weights, and a multi-scale convolutional block, designed to extract hierarchical spatial features. Combined with an improved loss function, the model significantly enhances the discriminative capability for confusing land-cover categories. Experimental results on several public datasets demonstrate that MCAM outperforms existing methods in classification accuracy.

Modular Fusion for Individual Tree Crown Delineation from Airborne LiDAR Data

Josue Gourde, Baoxin Hu, Qian Li

Department of Earth and Space Science and Engineering, York University, Toronto, Ontario, Canada

This paper proposes a modular fusion framework for delineating individual tree crowns from airborne LiDAR-derived canopy height models in a temperate mixed-wood forest in Ontario, Canada. Current instance segmentation models require expensive polygon annotations and tightly couple detection with segmentation, making cross-architecture fusion difficult. Limited forestry training data further causes transformer detectors to collapse on small datasets. The proposed framework decouples detection, fusion, and segmentation into independent stages. Two detectors, Faster R-CNN and DINO, are implemented with both ResNet-50 and domain-specific Masked Autoencoder backbones, with supplementary Finnish Taiga data stabilizing transformer training. A threshold-anchored score normalization maps each detector's confidence to a common scale before Weighted Box Fusion, enabling fair combination of architectures with incompatible confidence distributions. The fused bounding boxes prompt the Segment Anything Model (SAM) to generate per-tree polygon masks without domain-specific mask annotations. SAM's automatic mask generator additionally fills gaps where both detectors missed trees; SAM 1 is preferred over SAM 2, which produced fewer than half the automatic masks and missed smaller understory crowns. On two test plots with 233 and 107 ground truth trees, the framework achieves mask F1 scores of 0.79 and 0.61 at IoU thresholds of 0.25 and 0.50, matching 193 of 233 trees on the primary plot. Visual inspection indicates that many SAM-generated boundaries align more closely with canopy structure than the reference polygons. The modular design allows components to be independently replaced or upgraded, providing a practical pathway from LiDAR-derived CHMs to polygon-level crown delineation in data-limited forestry applications.

Remote Sensing Image Captioning via Dual-Stream Fusion and Spatial Relation-Aware Encoding

Rong Chen, Guorui Ma, Lunjun Fan

State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China

Remote sensing image captioning (RSIC) aims to describe key objects in remote sensing images using natural language, with significant applications in disaster assessment, land-use identification, and scene understanding. Existing methods face two critical challenges: insufficient cross-modal alignment due to the domain gap between generic visual representations and remote sensing semantics, and inadequate spatial relation modeling among regions in complex scenes, which compromises the semantic precision and logical coherence of generated descriptions. To address these issues, this paper proposes the Dual-Stream Relation-Aware Transformer (DSRAT) for remote sensing image captioning. On the visual encoding side, multi-scale CNN features serve as the foundation, fused with domain-specific semantic priors from RemoteCLIP through a gated dual-stream fusion module to achieve adaptive alignment of multi-source visual information. Subsequently, a spatial relation-aware mechanism is introduced into the encoder self-attention, which explicitly encodes geometric relationships such as relative position, distance, and orientation between regions as attention biases, enhancing the model’s capability for structured representation of complex spatial layouts and multi-object interaction scenarios. Finally, adaptive weighted aggregation of multi-layer encoder outputs generates discriminative cross-modal memory representations for the decoder. Experiments on the RSICD and NWPU-Captions datasets demonstrate that DSRAT achieves state-of-the-art performance across six metrics on RSICD and all seven metrics on NWPU-Captions. In particular, DSRAT achieves a significant performance improvement of +14.45 CIDEr on NWPU-Captions compared to the state-of-the-art method, validating the effectiveness of the proposed approach.

Evaluating a Weighted Ensemble of Deep Learning Models for Individual Tree Crown Delineation from LiDAR Data

Dylan Gaspar, Baoxin Hu, Qian Li

York University, Canada

This study investigates a weighted ensemble framework for individual tree crown (ITC) delineation using LiDAR-derived canopy height models (CHMs). Three deep learning models, Mask R-CNN, U-Net, and YOLO were first independently evaluated to establish the baseline performance under consistent training and evaluation conditions. A weighted ensemble was then constructed by combining model outputs through a voting‑based fusion scheme, with an exhaustive search performed across multiple weight configurations to identify the ones that maximize common evaluation metrics. While certain weighting configurations yielded improvements in quantitative measures such as intersection over union (IoU), recall, F1 score, and accuracy relative to individual models, qualitative analysis revealed that these gains often coincided with substantial under segmentation, manifested as large, merged crown regions. This discrepancy highlights the limitations of binary map voting for instance level delineation and indicates that metric driven ensemble optimization may not reliably reflect instance level segmentation quality. The findings suggest that more expressive fusion strategies may be necessary for effective ensemble based ITC delineation in future work.

Mapping sediment texture variability of carbonate beach sediments of Nogas Island using Sentinel-2 , hyperspectral spectroscopy, and granulometry

Og Locaba¹, James Cesar Refran¹, Patrick Fernandez², Sheila Mae Santander - de Leon²

¹Philippine Space Agency, Philippines; ²University of the Philippines Visayas

This paper presents an integrated approach using hyperspectral spectroscopy, granulometric analysis, and Sentinel-2 multispectral imagery for detailed mapping of carbonate beach sediments on Nogas Island, Philippines. By constructing a spectral library from field and laboratory data and employing the Spectral Angle Mapper (SAM) algorithm alongside the Grain Index, this study characterizes spatial variability in sediment grain size and carbonate composition. The methodology combines field sampling with remote sensing to generate maps that reveal sediment texture patterns influenced by hydrodynamics and depositional environments. The findings demonstrate that finer carbonate sediments exhibit higher reflectance and distinct spectral absorption features, enabling differentiation from coarser grains. This research highlights the potential of integrating multispectral satellite data with hyperspectral spectral libraries to provide rapid, reliable coastal sediment assessments critical for environmental monitoring, biodiversity conservation, and sustainable management of vulnerable tropical island beach systems.

Land Cover Classification of multi-Source airborne Data using conventional and deep-learning-based unsupervised Domain Adaptation

Edwin Deisling, Raphael Zipperer, Benedikt Kottler, Kevin Qiu, Melanie Böge, Dimitri Bulatov

Fraunhofer IOSB Ettlingen, Germany

For an increasing number of applications, land cover maps can be generated from remote sensing imagery using conventional and deep-learning-based semantic segmentation models.

Relying on a large pool of training data, the networks struggle with the spatial-temporal-spectral heterogeneity in the complex and diverse remote sensing imageries, leading to a significant number of errors in the model predictions.

This paper presents a workflow comprising domain adaptation and classification.

In particular, we analyze two domain adaptation techniques: First, a conventional histogram-matching method, which has turned out to be a surprisingly fast and reliable tool in a previous study, and second, a CycleGAN, which we applied both in its standard form and with the perceptual loss, thereby penalizing style inconsistencies on deeper layers.

By applying the workflow to three remote sensing datasets and six directions of domain adaptation, we show that there is ``no free lunch'' in the sense that all domain adaptation methods have their advantages.

Depending on the dataset, classification method, and especially on the availability of 3D data, the performance gap can be reduced to up to 1.5\% of the mean F1 score, demonstrating the soundness of the proposed method.

Road Segmentation from Satellite Imagery Based on an Improved SAM Model

Bingquan YAO, Haolin Liu, Wenjuan Mao, Jin Zhou

National Quality Inspection and Testing Center for Surveying and Mapping Products, China, People's Republic of

Road network is an important infrastructure of urban spatial structure and traffic system. Its accurate acquisition is of great significance for urban traffic analysis, automatic driving map construction and disaster emergency response. With the wide acquisition of high-resolution remote sensing images, automatic extraction of road masks from remote sensing images has become an important research direction in the field of remote sensing image understanding. However, the existing deep learning methods still face the problems of obvious modal differences and insufficient modeling of road structure continuity in remote sensing scenes. To solve the above problems, this paper proposes a remote sensing image road segmentation model LR-SAM based on SAM (Segment Anything Model). In this model, the LoRA (Low Rank Adaptation) fine-tuning strategy is introduced to achieve efficient parameter updating, and the MS multi-scale feature interaction module is designed in the coding phase to enhance the expression ability of the linear structure and fine-grained information of the road. At the same time, the original prompt encoder is removed and a lightweight ad decoder is constructed to achieve multi-scale feature fusion. In the reasoning stage, TTA (Test Time Augmentation) strategy is introduced to improve the stability and segmentation accuracy of the model. Experimental results based on chn6-cug and SAT-MTB datasets show that the proposed method achieves 97.20% and 85.06% mIoU and 96.67% and 84.94% F1-score, respectively, which is significantly better than the mainstream road segmentation method, and verifies the effectiveness of the proposed improvement points.

Research and Implementation of Key Technologies for High Resolution Satellite Image Instant Service System

Jin Liu, Ye Du

Beijing SatImage Information Technology Co., Ltd,, People's Republic of China

With the development of Earth observation technology, China's domestic high-resolution satellite remote sensing technique has achieved high-quality development. Currently, in orbit land resource satellites can obtain over 4500 images globally every day. With the explosive growth of data volume, traditional image processing method can not meet users demand for spatial information services with high frequency and large area. How to achieve automated image processing with massive data volume, and to provide real-time image services to users efficiently has become an urgent problem to be solved.

Combining UAV SAR Tomography and Photogrammetry to study an Active Volcanic Vent in Iceland

Alina V. Shevchenko¹, Eduardo Freitas², Christian Wimmer², Egill Á. Gudnason³, Magnus T. Gudmundsson⁴, Gro B.M. Pedersen⁵, Luciano P. de Oliveira⁶, Magdalena S. Vassileva¹, Daniel Müller¹, Dibakar K. Ritushree^1,7, Mahdi Motagh^1,7, Thomas R. Walter¹, João Moreira², Martin Herold¹, Frieder M. Knabe⁸, Gian C. Oré Huacles⁹, Hugo E. Hernandez-Figueroa⁹

¹GFZ Helmholtz Centre for Geosciences, Germany; ²Radaz S.A., Brazil; ³Iceland GeoSurvey ISOR, Iceland; ⁴University of Iceland, Iceland; ⁵Icelandic Meteorological Office, Iceland; ⁶Technology Innovation Institute, United Arab Emirates; ⁷Leibniz University Hannover, Germany; ⁸Wissenschaftsladen Potsdam e.V., Germany; ⁹University of Campinas, Brazil

The recent volcanic unrest on Iceland's Reykjanes Peninsula was an excellent opportunity to better understand volcanic processes and develop hazard mitigation strategies. The eruption was studied using various direct and remote-sensing techniques. Here, we present an innovative UAV-based TomoSAR approach application, combined with photogrammetry, to explore the external and internal structures of an active volcanic vent within the Sundhnúkur crater row, where nine eruptions have occurred since December 2023. The surveys were conducted on 20 May 2024 (12 days after the end of the March–May eruption) and on 1 August 2024 (40 days after the May–June eruption). For optical data collection, we used a DJI Mavic 3T quadcopter, equipped with an RGB camera and an infrared sensor. The radar data were acquired using a UAV-based interferometric SAR system, Explorer RD350, which is capable of collecting P-band data in helical-trajectory mode. The optical data were processed using the standard photogrammetric workflow, and the SAR data were processed using the Refractive Back Projection algorithm, which enabled the extraction of amplitude images as slices at given depths with a ground penetration of up to 20 m. Our results show that the higher-intensity areas in the subsurface images correspond to the vent's crater center, while the lower-intensity areas correspond to the slopes of its cinder cone, composed of loose volcanic material. We assume that the higher-intensity areas in the amplitude images represent structures of denser material at depth, e.g., a lava conduit within the volcanic cone.

Space–Time Analysis of Nighttime Light Intensity in Phoenix, Arizona (1992–2024)

Scotty Fogle, Zhiyong Hu

University of West Florida , United States of America

Analysis of the Phoenix area between 1992 to 2024, using DMSP-OLS and VIIRS Data.

Comparative Study of Edge Losses for Remote Sensing Image Super-Resolution

Minkyung Chung, Youkyung Han

Seoul National University of Science and Technology, Korea, Republic of (South Korea)

Image super-resolution (SR) techniques have achieved significant performance improvements with the advancement of deep learning. Accordingly, deep learning-based SR methods have become the mainstream approach in SR research and are widely applied across various fields, including remote sensing. However, most state-of-the-art SR studies are primarily driven by computer vision research and tend to focus on generating visually realistic images rather than preserving structural fidelity with respect to the input images. In remote sensing applications, maintaining structural fidelity is particularly important because SR outputs are often used in downstream analytical tasks such as object detection.

In this study, we investigate the use of edge loss to enhance the structural fidelity of SR images for remote sensing imagery. The effectiveness of edge loss was evaluated using multiple benchmark datasets on both convolutional neural network (CNN)- and generative adversarial network (GAN)-based SR models. Several representative SR network architectures and GAN training frameworks were employed to assess the impact of integrating edge loss into the training objective. The experimental results demonstrate that incorporating edge loss improves both the structural fidelity and perceptual quality of SR images. Among the evaluated edge operators, the Prewitt-based edge loss showed the most consistent improvements compared with the Sobel- and Laplacian-based edge losses. These results indicate that edge loss is an effective and easily implementable strategy for improving SR reconstruction quality in remote sensing imagery. Furthermore, it can be combined with other edge-aware techniques to further enhance perceptual quality.

A multi-granularity distributed parallel processing method for time-series InSAR and application to mapping ground deformation of whole China

Yonghong Zhang, Yonghui Kang, Hongan Wu, Jujie Wei

中国测绘科学研究院, China, People's Republic of

InSAR parallel processing become very attractive in recent years with the exponential growth of SAR data volume. Many InSAR parallel algorithms are deployed on cloud platforms with fixed hardware and network environments, or adopt a single granularity (e.g., scene-level or pixel-level), leading that the computing resources are not fully explored.

This research proposes a novel multi-granularity distributed parallel processing framework for time-series InSAR (TS-InSAR). The framework integrates three granularity levels (data granularity, task granularity, and algorithm granularity) and designs an adaptive scheduling strategy to dynamically adjust granularity based on task characteristics and computing resource status.

The proposed proposed multi-granularity parallel TS-InSAR processing framework has been employed to map ground deformation of the whole China territory annually since 2022, facilitating national-scale geohazard assessment.

Comparative Evaluation of Machine Learning Models for Gold Prospectivity Mapping: A Case Study from Labrador, Canada

Yahya Chebab¹, Kongwen Zhang¹, Shuyun Xie², Jiangtao liu³

¹University of the Fraser Valley, Canada; ²University of Geosciences, China; ³China Geological Survey, China

Machine learning has become an increasingly important tool for quantitative prediction of complex mineralization patterns, offering new opportunities for improving mineral prospectivity mapping. Recent studies have shown that algorithms such as neural networks, support vector machines, and gradient boosting can capture nonlinear relationships and integrate diverse geoscientific variables with high predictive power. At the same time, traditional knowledge driven approaches such as the fuzzy weights of evidence method continue to demonstrate competitive performance, especially in geologically heterogeneous regions. This study provides a comparative evaluation of four machine learning models including logistic regression, support vector machine, backpropagation neural network, and extreme gradient boosting, together with the fuzzy weights of evidence method. The analysis is applied to a distinct environmental and geological predictor dataset from Labrador, Canada, a region characterized by complex lithological variation and limited historical exploration data. The goal of the study is to assess the robustness, stability, and generalization ability of these methods when transferred to previously unused datasets and differing geological conditions. Model evaluation is performed using cross validation, feature importance analysis, and spatially aware performance metrics. The resulting prospectivity maps highlight similarities and differences among the algorithms and identify areas with high potential for gold mineralization. The findings provide insight into the strengths and limitations of machine learning and knowledge based methods for mineral exploration and support the development of reproducible and interpretable workflows for regional scale mineral prediction.

A hybrid framework for indoor UAV-based 3D point cloud segmentation

Mohsen Shahraki, Ahmed Elamin, Ahmed El-Rabbany

Department of Civil Engineering, Toronto Metropolitan University (TMU), Toronto, Ontario, Canada

Accurate segmentation of indoor 3D point clouds is essential for applications such as autonomous navigation, robotic interaction, and augmented reality mapping. Indoor scenes, however, remain difficult to segment due to clutter, occlusions, and repetitive structural patterns that often mislead conventional geometric or rule-based approaches. While deep learning models have improved segmentation accuracy by learning features directly from raw points, they typically require large annotated datasets and significant computational resources.

This paper presents SAMNet++, a hybrid segmentation framework that combines unsupervised segment generation with supervised refinement to achieve high accuracy while reducing annotation effort. In the first stage, a SAM-based LiDAR module—adapted from the Segment Anything Model—produces coarse, label-free segment proposals by leveraging fused LiDAR–RGB data. These proposals capture object boundaries and structural regions without manual labelling. In the second stage, a refined PointNet++ network enhances semantic precision and class consistency through targeted supervised learning.

To develop and evaluate the system, a dedicated indoor dataset was collected using a UAV equipped with a LiDAR sensor and an RGB camera, covering multiple rooms and corridor environments. Experimental results demonstrate that SAMNet++ outperforms state-of-the-art baselines in precision and F1-score, particularly when segmenting fine architectural details or navigating cluttered indoor spaces. With its balanced accuracy, efficiency, and reduced dependence on annotations, SAMNet++ offers a practical solution for real-time indoor mapping and scene understanding.

Prototype Design of a Data Warehouse for Determining, Mapping, Monitoring and Visualizing Urban Heat Islands: the Case of Zagreb and Split, Croatia

Andrija Krtalić, Ana Kuveždić Divjak

University of Zagreb Faculty of Geodesy, Croatia

The research presented in this paper focuses on monitoring the phenomenon of urban heat islands (UHI) and provides local authorities with decision-making assistance in preventing their occurrence or mitigating the consequences of existing ones.

This paper proposes the design of a prototype design data warehouse for structured management, integration and analysis of multi-source geospatial data related to UHI detection and mitigation, focusing on two major Croatian cities: Zagreb and Split. Research in this area is the result of two started projects about UHI.

The proposed system is expected to provide a consistent and scalable framework for managing the heterogeneous geospatial datasets needed to understand urban climatic conditions. By standardising data handling and building on open data sources, the system creates the conditions for robust analysis of UHI patterns and for the development of tools that can support both research activities and the operational needs of local authorities.

Designed as a foundation for future monitoring mechanisms, planning tools and mitigation strategies, the system also aims to encourage broader use of open geospatial data in environmental and urban-climate studies. Its reproducibility and transparency should contribute to establishing a stable framework for further research and for practical applications in climate-resilient urban development.

Development and Application of an Automated Full-Process Framework for Unauthorized Land-Use Parcel Verification Driven by a UAV Hangar System: A Case Study in Shanghai, China

Yang Li, Ming Hu, Wen Zhang

Shanghai Surveying and Mapping Institute, Shanghai 200063, P.R. China

Unauthorized land-use parcels are key targets in territorial spatial governance. Featuring diverse types, scattered distribution, strong concealment, traditional monitoring—satellite remote sensing with time lag and manual inspections with limited coverage—fails to meet the demand for rapid localization and verification.

This study proposes an automated verification framework driven by UAV hangars, integrating five links: intelligent scheduling, automatic data collection, real-time transmission, semantic interpretation, result dissemination. Adopting a "cloud-edge-terminal" architecture, it incorporates direct georeferencing, parcel segmentation, and improved A*+ algorithm-based path planning, achieving closed-loop automation of "detection-verification-evidence collection."

Field tests in Shanghai with 6 UAV hangar stations and 120 parcels showed 100% coverage, 75% less manual work, and adaptability to diverse scenarios. It addresses "slowness, omission, inaccuracy" in traditional workflows, providing a technical paradigm for data-driven territorial governance.

Long-term Analysis of Rainfall Variability and Gridded Precipitation Product Performance in Coastal Southeast China

Yuli Wang¹, Joseph Awange¹, Ting On Chan²

¹Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, 999077 Hong Kong (SAR), China; ²School of Geography and Planning, Sun Yat -sen University, 510275 Guangzhou, China

Accurate precipitation estimation is essential for hydrological applications and hazard monitoring in coastal regions, where complex terrain and strong land–sea interactions pose major challenges. This study investigates long-term rainfall variability and evaluates the performance of six gridded precipitation products—PERSIANN, IMERG, CHIRPS, ERA5-Land, GSMaP, and MSWEP—over the Guangdong–Hong Kong–Macao Greater Bay Area during 2001–2023. The results reveal pronounced spatial heterogeneity in precipitation trends: coastal subregions show a clear drying tendency, whereas the inland mountainous region remains comparatively stable. Despite these spatial differences, all regions exhibit synchronized interannual variability, suggesting the dominant influence of large-scale climatic drivers. All evaluated products successfully capture the unimodal seasonal cycle associated with the South China Monsoon, but notable discrepancies emerge during the peak rainy season, when intense convective rainfall leads to greater uncertainty. Among the six datasets, GSMaP and IMERG consistently outperform the others, showing higher correlation coefficients and lower RMSE across most months. In contrast, PERSIANN performs less reliably during low-intensity rainfall periods, while ERA5-Land systematically underestimates peak rainfall intensity. Overall, this study highlights the importance of region-specific evaluation of precipitation products in complex coastal environments and provides practical guidance for hydrological applications, hazard assessment, and disaster risk management.

An Early Detection Method for Heavy Rainfall Using Satellite Data

Seokhwan Hwang, Heejun Park, Jungsoo Yoon, Narae Kang

Korea Institute of Civil Engineering and Building Technology, Korea, Republic of (South Korea)

This study presents an operational framework for the early detection of heavy rainfall based on the temporal dynamics of Cloud-Top Temperature (CTT) observed by geostationary meteorological satellites. The central hypothesis is that a characteristic “rapid rise followed by a sharp fall” in CTT serves as a precursor signature of subsequent convective intensification, as verified by radar-observed rainfall surges. The temporal pattern is analytically decomposed into the rise–peak–fall–trough phases, and the temperature drop amplitude (swing) between the peak and trough is quantified to define the WATCH (Warning and Threshold-based Convective Hotspot) window that indicates potential heavy-rain development. Two categories of lead time are formulated: the observed lead time, representing the exact temporal offset between the onset of CTT cooling and radar-detected rainfall intensification; and the estimated lead time, inferred from the gradient of the CTT decrease when radar data are unavailable or delayed. An edge-enhancement algorithm is implemented to minimize omission at the temporal boundaries, while adaptive thresholding and regional calibration enhance the algorithm’s transferability across diverse climatic and topographical environments. The proposed method is designed for real-time satellite operations and can be seamlessly integrated into existing satellite-radar hybrid nowcasting systems. By detecting convective growth phases preceding radar reflectivity increases, the method extends the effective warning lead time and improves the reliability of short-term rainfall forecasts. The findings demonstrate that CTT-based dynamic monitoring provides a physically consistent and computationally efficient tool for flash-flood preparedness, early warning, and rapid situational awareness in operational meteorological and hydrological applications.

Can 2000–2024 Daily Historical Records Alone Project Next-Year Wildfire State Transition? A Case Study in British Columbia, Canada Using a Conditional Categorical Generative Model

Yimin Zhu, Zhengsen Xu, Lincoln Xu

University of Calgary, Canada

this paper, we define a new wildfire risk prediction task from the perspective of wildfire state transition of next year, and hence, propose a novel approach named Wildfire State Transition Discrete Diffusion Model (WildfireSTDDM), that can directly capture the high-dimensional distribution of wildfire risk only through available and on hand historical wildfire events, with the following characteristics: (1) A 25-year-long-term daily wildfire historical record for British Columbia (BC) province, Canada is built deriving from the Fire Information for Resource Management System (FIRMS) with $10\text{km} \times 10\text{km}$ spatial resolution, using spatial aggregation. We define four wildfire state transition types based on the presence or absence of fire in a three-year historical period versus the fourth year: Persistent no-fire, New ignition, Fire cessation, and Persistent fire. (2) The proposed model can capture the categorical distribution of wildfire state transition type conditioning on the historical records and is trained in an end-to-end fashion, contributing to less cumulative error. (3) The proposed model can generate a high confidence map of next year's wildfire risk only through the long-term daily historical wildfire event without any other driving factors, and also correlate with the complex and stochastic wildfire pattern. (4) Since our model depicts the discrete wildfire state of each pixel forward as a discrete-time-inhomogeneous stochastic process, making it well-suited for characterizing next year's wildfire state transition uncertainty in model projections by performing multiple posterior sampling through Monte Carlo.

Remote Sensing Image Strip Removal Technology Based on the Ultralytics Model

远姚, 莉冯, 景秋罗

Hohai University, China, People's Republic of

This study proposes a stripe removal method for remote sensing grayscale images based on ultralytics. First, we have got images from GEE, and stripes were annotated via Label Studio. Second,we have trained the ultralytics model with the annotated dataset, and adopting the best weights combined with pre-trained model for new image annotation. Finally, for stripe removal, the trained model detected stripe regions in remote sensing images and located their bounding box coordinates. Non-stripe areas were marked, with the largest normal area selected as the reference. Stripe region pixel data were segmented using detected bounding boxes, followed by histogram matching between stripe regions and the reference area to align grayscale distribution. Corrected stripe regions were replaced back to original positions to generate and save stripe-free images. This method achieves accurate stripe detection and effective grayscale correction, providing a reliable solution for remote sensing image preprocessing.

GEMAUT (2006–2026): A Brief History of a Robust and Open-Source Tool for the Automatic Generation of High-Resolution Digital Terrain Models from Satellite-Based Surface Models

Nicolas Champion

IGNF, France

This contribution presents GEMAUT, a robust and open-source tool dedicated to the automatic generation of Digital Terrain Models (DTMs) from high-resolution satellite-based Digital Surface Models (DSMs). The paper provides a historical overview of the methods used for DTM extraction over the past twenty years, from early morphology-based filters to physically based optimization models and recent deep learning approaches. This retrospective is complemented by an analysis of the evolution of Earth-observation sensors, whose increasing spatial resolution now enables the application of LiDAR-oriented ground-filtering techniques directly to satellite DSMs.

The latest version of GEMAUT removes one of the main limitations of earlier implementations by eliminating the need for an external ground mask. Ground points are automatically extracted from the DSM using either the slope-based filter implemented in SAGA or the Cloth Simulation Filter available in PDAL. The terrain is then reconstructed through an energy-based surface optimization approach that combines robust data fidelity terms with curvature-based regularization.

A second major contribution is the introduction of a fully automatic quality assessment module. By analysing local DSM–DTM elevation differences, GEMAUT produces a spatialized precision mask that estimates the relative vertical accuracy at pixel level. This capability supports reliable quality control in operational and industrial workflows.

The tool has been fully refactored, relies exclusively on open-source libraries, and is publicly released on GitHub to encourage transparency, reproducibility, and collaboration within the ISPRS community.

Using NGRDI index to assist in forest canopy gaps classification of UAV RGB imagery

Yu-xuan Zhan^1,2, Yong-Jun Zhang³, Chun-Hong Wei³, Jan-Chang Chen³

¹R&D Center, National Pingtung University of Science and Technology(NPUST), 1, Shuefu Road, Neipu, Pingtung 912301, Taiwan (R.O.C.); ²Doctoral Program in Bioresources, National Pingtung University of Science and Technology(NPUST), 1, Shuefu Road, Neipu, Pingtung 912301, Taiwan (R.O.C.); ³Department of Forestry, NPUST, 1, Shuefu Road, Neipu, Pingtung 912301, Taiwan (R.O.C.)

The formation of canopy gaps alters forest microclimates, influencing understory regeneration, soil organic matter decomposition, and nutrient cycling, thereby playing a crucial role in forest ecology. Traditional methods for detecting canopy gaps typically rely on multispectral imagery or LiDAR data, which are accurate but costly and technically demanding. In recent years, several studies have explored the feasibility of using UAV-based RGB imagery for gap detection. This study utilized UAV RGB imagery to analyze the temporal dynamics of canopy gaps to assess the feasibility of employing RGB-based vegetation indices for canopy gap detection. The Normalized Green–Red Difference Index (NGRDI) combined with DSM differencing was used for analysis. Results show that when NGRDI < 0.03, forest areas can be effectively categorized into two classes: “canopy gaps” and “canopy cover.” The overall classification accuracy reached 93% with a Kappa coefficient of 0.68. However, the omission error was 44.44%, which suggesting that the model requires improvement in detecting small or edge gaps. It is recommended that identified threshold is used as a preliminary criterion for “canopy versus non-canopy” classification, supplemented with DSM or CHM data to improve detection accuracy.

Using Deep Learning–Extracted Road Networks for More Accurate Small Satellite Geometric Correction

Aleš Marsetič^1,2, Peter Pehani¹, Nina Krašovec^1,3

¹ZRC SAZU, Novi trg 2, 1000 Ljubljana, Slovenia; ²SPACE-SI, Aškerčeva 12, 1000 Ljubljana, Slovenia; ³Faculty of Civil and Geodetic Engineering, University of Ljubljana, 1000 Ljubljana, Slovenia

Imagery from small satellites has been available for decades, yet automatic and accurate geometric correction remains a persistent challenge, especially when dealing with imagery which exhibit higher radiometric variability and a lower signal-to-noise ratio. This study introduces an enhanced version of the geometric processing module within the STORM processing chain, designed to perform fully automated orthorectification of images from small satellites. The module leverages publicly available ancillary data and deep learning-based road extraction techniques to eliminate the need for manual data collection and preprocessing. Ground Control Points (GCPs) are automatically generated by matching roads extracted from satellite imagery with corresponding vector roads obtained from open-access web databases. The orthorectification pipeline integrates several key components: ancillary data preparation, road extraction, GCP extraction, and final orthorectification using a digital elevation model. Experimental results on NEMO-HD small satellite imagery demonstrate that the proposed method can achieve accuracies of less than two pixel. The integration of deep learning for road detection provides a novel and effective approach for the fully automated orthorectification of satellite data of various types.

A Dual-Task Optimization Approach for Digital Elevation Model Correction with Spaceborne LiDAR Data

Yibo Ling, Ting On Chan, Wing Sum Yan

School of Geography and Planning, Sun Yat-sen University, China, People's Republic of

Digital Elevation Models (DEMs) are essential for terrain analysis and environmental applications, yet freely available global DEMs such as the Shuttle Radar Topography Mission (SRTM) DEM often contain noticeable elevation errors. Recent advances in space-borne LiDAR, particularly Ice, Cloud, and land Elevation Satellite-2 (ICESat-2), provide highly accurate elevation observations for DEM correction. However, most existing studies treat DEM correction as a single regression task and pay limited attention to correction direction, although direction errors may further degrade the corrected DEM. To address this issue, this study proposes a dual-task optimization framework for DEM correction using ICESat-2 data and auxiliary topographic and environmental variables. The network includes a shared feature extraction backbone, a regression branch for estimating correction values, and a classification branch for predicting whether DEM elevation should be increased or decreased. Kent County, New Brunswick, Canada, was selected as the study area, where 35,823 ICESat-2 elevation points were used for model training and validation. Results show that the proposed method outperforms Random Forest, XGBoost, and a conventional deep neural network, achieving a root mean square error (RMSE) of 1.76 m, a mean absolute error (MAE) of 1.37 m, and a direction consistency rate (DCR) of 75.05%. Compared with the original SRTM DEM, the corrected DEM reduces RMSE and MAE by approximately 27.6% and 25.9%, respectively, and improves DCR by 1.66% over the conventional deep neural network (DNN). These results demonstrate that incorporating correction direction into the learning process can effectively improve DEM correction accuracy and directional reliability.

A comparative framework for deriving True Tree Crown (TTC) from Pseudo Tree Crown (PTC)

Namya Kamboj¹, Frank Zhang¹, Baoxin Hu²

¹University of the Fraser Valley, Abbotsford, Canada; ²York University, Toronto, Canada

Recent advances in UAV-based remote sensing have made high-resolution 2D imagery widely available, however the extraction of 3D tree structure from such data remains a primary challenge. This paper presents a novel framework for deriving True Tree Crown (TTC) geometry from Pseudo Tree Crown (PTC) representations, through a graph-based learning model. The PTC is generated from single nadir RGB images by interpreting grayscale intensity as height. This serves as an intermediate 2.5D representation that bridges the gap between conventional imagery and full 3D structure. We establish a spatial correlation between PTC and LiDAR-derived TTC meshes using geometric feature extraction and correspondence analysis. Preliminary results on synthetic data demonstrate a strong correlation between PTC and TTC height distributions, confirming that PTC encodes meaningful structural information. To learn the mapping from PTC to TTC, we propose a Graph Neural Network architecture with three GraphConv layers (64 – 128 – 256 channels), residual connections, and a composite loss function combining Chamfer distance with Laplacian and edge regularization. This framework enables the estimation of complete 3D tree crowns from single RGB images, transforming vast historical 2D image archives into valuable 3D forest data for ecological monitoring, carbon accounting, and sustainable forest management.

Comparison Between Unmanned Aerial Vehicle (UAV) and RTK-GNSS Surveying Methods for DEM Generation in Wetlands

KEHINDE HASSAN BABALOLA, Kevin Musungu, Desmond Radamba

CAPE PENINSULA UNIVERSITY OF TECHNOLOGY, South Africa

Advancements in unmanned aerial vehicle (UAV) technology have enhanced remote sensing and photogrammetry, enabling high-resolution mapping of terrain. This study evaluated the accuracy of digital elevation models (DEMs) derived from UAV-based structure-from-motion (SfM) photogrammetry by comparing them with real-time kinematic global navigation satellite system (RTK GNSS) survey data in the Steenbras Lower Dam wetland catchment, Cape Town, South Africa. High-resolution RGB imagery was captured using a DJI Phantom 3 UAV at an altitude of 35 meters above the highest terrain point, with a ground control network shared with the GNSS survey. Pix4D software was used to reconstruct the terrain, producing digital surface models, orthophotos, and ultra-high-resolution point clouds. Accuracy was assessed using 1,502 corresponding points. Initial metrics were affected by tall vegetation in the northern and southern periphery of the wetland. After filtering out absolute differences exceeding 0.5 m, the median elevation difference decreased from 0.464 m to 0.222 m, the median difference reduced from 0.344 m to 0.217 m, and the RMSE dropped from 0.605 m to 0.260 m. These results demonstrate that UAV-derived DEMs provide reliable and precise topographic information for wetland catchment mapping.

Exploring the Potential of Non-invasive Geospatial Tools for Initial Investigations of Archaeological Sites: A Case Study of Dholavira, Gujarat

Bhanu Priya Chouhan¹, Sourav Kumar¹, Hina Pande¹, Poonam Seth Tiwari², Shashi Kumar¹, Shefali Agrawal³, Suresh Kannaujiya⁴

¹Photogrammetry and Remote Sensing Department, Indian Institute of Remote Sensing, India; ²Geoweb Services, IT & Distance Learning Department, Indian Institute of Remote Sensing, India; ³Geospatial Technology & Outreach Program, Indian Institute of Remote Sensing, India; ⁴Geosciences Department, Indian Institute of Remote Sensing, India

Dholavira, India’s second-largest Harappan site after Rakhigarhi, dating from 3000–1500 BCE, is renowned for its sophisticated water management system and has attracted significant archaeological interest since its discovery in 1968. Despite decades of conventional surveys, many structures remain unidentified, constraining spatial understanding of the site. This study develops a multi-sensor, multi-platform framework using active and passive datasets (optical, microwave, and LiDAR) from satellite, UAV, and ground-based sources to support improved documentation and analysis of archaeological features. Earth Observation (EO) datasets were processed to identify surface anomalies using multi-sensor analysis, while Synthetic Aperture Radar (SAR) data were used to delineate potential subsurface zones for subsequent GPR investigations. UAV-LiDAR data were utilized to enhance high-resolution 3D surface mapping of the site. Guided by satellite-derived anomalies, Ground Penetrating Radar (GPR) surveys were conducted at selected locations to investigate subsurface features. The GPR results revealed shallow hyperbolic reflections and stratigraphic discontinuities up to ~1.5 m depth, indicative of buried structures and disturbed ground conditions, with depth estimates derived using an assumed velocity model for dry sandy soils. Terrestrial Laser Scanning (TLS) enabled high-resolution three-dimensional reconstruction of excavated structures, showing close agreement with Archaeological Survey of India (ASI) records. The results demonstrate an effective and interpretable framework for archaeological prospection and multi-scale analysis, with future potential for integrating machine learning to advance systematic site analysis and digital heritage conservation.

Temporal Spectral Dynamics of Runway Surfaces Using Multi-Year Sentinel-2 Imagery for Infrastructure Condition Assessment

Ishtiaq Ahmed, Umesh Kumar Sharma, Pradeep Kumar Garg, Aditya Kumar Thakur

Indian Institute of Technology Roorkee, India

Runway surface deterioration poses critical challenges for aviation safety and maintenance planning. Traditional inspection techniques are often labor-intensive and localized, lacking temporal continuity for assessing long-term degradation. Previous studies have primarily focused on pavement visual distress or thermal imaging, leaving a significant gap in non-destructive, satellite-based monitoring of runway condition using multispectral data.This study addresses that gap by employing multi-year Sentinel-2 Surface Reflectance imagery (2021–2025) to evaluate surface degradation of the Deoghar Airport runway. Six spectral bands (B2, B3, B4, B8, B11, B12) were analyzed to compute four spectral indices—Aggregate Degradation Index (ADI), Composite Condition Index (CCI), Surface Reflectance Index (SRI), and Thermal Stability Index (TSI). Temporal mean composites for each January were generated and analyzed for pixel-wise trends. Results revealed from 2021 to 2025, ADI decreased from 0.0876 to 0.0789, CCI increased from -0.2069 to -0.1718, SRI rose from 1.5171 to 1.6484, and TSI improved from -0.0158 to -0.0059, indicating overall runway surface stabilization with gradual roughness increase. A mean degradation rate of 0.010 year⁻¹, with 93.5% of pixels in the moderate class, 4.3% in high, and 2.2% in critical condition. The B12 band showed the maximum mean change (289.73), while B2 exhibited the most statistically significant trends (p < 0.05 for 72.1% pixels). The findings confirm that spectral reflectance indices effectively capture physical and chemical surface transformations. This method provides a scalable, non-destructive framework for continuous monitoring of runway health and supports predictive maintenance decision-making for sustainable infrastructure management.

Forest Regeneration Assessment By Integrated Index And Remote Sensing In Semi Arid Land In The North West Of Algeria

Ahmed Zegrar, Nadjla Bentekhici, mohamed Ghabi

Centre of Spatial Techniques, Algeria

The ecological analysis of desertification requires knowledge of post fire regeneration in the mid-step, influenced by topographic conditions and climate parameters. The North West regions of Algeria are affected each summer by violent forest fires which last over several days and affects woodlands, natural forests and reforestation. Usually NDVI is used, other derived index from radiometric data in remote sensing are widely used to monitor vegetation dynamics. The aim of this study is to determine the fire severity and monitor vegetation recovery with using multitemporal spectral indices together with topographical factors, and to recognise the different regeneration patterns of each burnt area. Several variables (such as climat, lithology, slope, aspect) were considered in order to analyse their possible relationship with the recovery process. Some of these variables showed a significant effect over the regeneration time, although further analyses seem still needed. Pre-fire and post-fire Landsat images and Alsat, were obtained to assess the related fire severity with using the widely-used Normalized Vegetation Index (NDVI) and modified Soil Adjusted Vegetation Index (MSAVI); Ratio vegetation index (RVI), and the index of regeneration (RI), to determine vegetation regeneration dynamics for period (2005-2007-2009 and 2015). Analysis showed that north-facing and east-facing slopes have higher regeneration rates in compared to other aspects. In addition, analysis of NDVI and RI stratified by pre-fire vegetation conditions and post-fire burn severity estimates could also be beneficial. And in this context post fire regeneration and topographics aspects are most important to ecological analysis of desertification in semi arids areas.

Investigating the Relationship Between Urban Heat Island Effect and Its Influencing Factors: A Case Study of Perth

Kelzang Dema¹, Petra Helmholz¹, Philip Burton²

¹Spatial Sciences, School of Earth and Planetary Sciences (EPS), Curtin University, Perth; ²Open Space Design Australia (OSDA), Perth, Western Australia

Urbanisation is accelerating globally and is a defining feature of modern cities. In 2016, 55% of the global population lived in cities, projected to reach nearly 70% by 2050. Rapid urban and population growth pose major challenges for sustainable development. By 2030, global urban land cover is expected to reach 1.2 million km²—three times that of 2000. This transformation involves significant Land Use Land Cover (LULC) changes, often converting natural vegetation into impervious surfaces like buildings and roads.

Urbanisation strongly correlates with rising Land Surface Temperature (LST) and intensified Urban Heat Island (UHI). Despite global attention to UHI, few studies have examined the spatio-temporal dynamics of LST in relation to recent urbanisation trends in Perth, Australia. As the city undergoes rapid suburban expansion and faces increasingly hotter summers, it is vital to understand how new urban development affects thermal patterns.

This study aims to address this gap by:

1. Identifying and delineating the areas of new development in Perth between 2005 and 2024,

2. Analysing and comparing LST patterns between long-established older and newly developed areas

3. Investigating the relationship between LST and its contributing factors, such as building and population density, tree canopy cover, surface moisture, albedo, and proximity to rivers

To achieve these aims, the study evaluates urban expansion between 2005 and 2024 and quantifies thermal differences using multi-temporal Landsat-derived LST.

A Multimodal and Multitemporal Deep Learning Semantic Segmentation Method based on Variational Autoencoder for Multimodal Remote Sensing Image Time Series

Luca Bergamasco¹, Marius Guichard², Mauro Dalla Mura², Francesca Bovolo¹

¹Fondazione Bruno Kessler, Italy; ²Institut polytechnique de Grenoble, France

Multimodal Remote Sensing (RS) methodologies have been increasingly studied in recent years due to their capacity to analyze multimodal RS data acquired from different sensors, thereby providing improved temporal resolution and extracting richer information than single-modal RS data. Deep Learning (DL) methodologies have accelerated the study of multimodal RS methods, thanks to their ability to learn features during training automatically. Many multimodal DL methods exploit this capability to learn a shared domain across modalities. However, most of them struggle to align heterogeneous modalities in a common representation. For this reason, we propose a supervised multimodal DL method that analyzes image time series acquired by different sensors to perform semantic segmentation. The proposed DL method is based on a Variational Autoencoder (VAE) that models the spatio-temporal information of the multimodal input image time series, with encoders and decoders composed of 3D convolutional layers, and learns the probability distributions for each modality. The probability distributions are combined to derive a joint distribution used for semantic segmentation. Learning the joint probabilistic distribution is achieved by combining the probabilistic parameters across modalities using a Product of Experts (PoE) approach. The feature maps derived from the obtained latent space are processed through three decoders. Two decoders aim to reconstruct the input multimodal image time series. The third decoder performs a semantic segmentation based on the inputs. Experiments conducted on the MultiSenGE and Austria datasets, which comprise Sentinel-1 and Sentinel-2 image time series acquired in France and Austria and representing heterogeneous classes, yielded promising results.

Mapping Surface Area Changes in Three Major Reservoirs on the Island of Trinidad between 2017 and 2023 using Sentinel-1 SAR Imagery

Deanesh Ramsewak¹, Arvind Jagassar², Shiel Williams³

¹University of Portsmouth, United Kingdom; ²British Columbia Institute of Technology, 3700 Willingdon Ave, Burnaby, BC V5G 3H2, Canada.; ³The Centre for Maritime and Ocean Studies, The University of Trinidad and Tobago, Trinidad and Tobago

Rapid urbanization and climate change have the potential to negatively affect water availability in the coming decades. The Caribbean region is particularly at risk since, among other factors, large water storage facilities are not as abundant as in larger nations. It is imperative therefore, that water resources in the small island nations of this region are efficiently managed and monitored. Recent open-source, satellite earth-observation capabilities and data have presented additional tools for managers of this critical resource to better manage water and water infrastructure. In this study, we demonstrate the capacity of utilizing Synthetic Aperture Radar (SAR) data from the Sentinel-1 satellite for mapping surface area changes in three reservoirs on the island of Trinidad using a Google Earth Engine (GEE) framework. Sentinel-1 data was processed using GEE to produce average reservoir surface area calculations for each season (wet and dry) of each year for the period 2017-2023. The resultant reservoir surface area values were cross referenced against average seasonal precipitation values obtained from the CHIRPS (Climate Hazards Group Infra-Red Precipitation with Station data) database. The approach used in this study can be integrated into existing water resource monitoring frameworks to improve efficiency at little to no additional cost.

Monitoring the Spatial Dynamics of Mikania micrantha During the Flowering Season Using Multi-epoch UAV Imagery: A Case Study North of Liyu Lake, Hualien, Taiwan

Yu-Cheng Huang¹, Yi-Ta Hsieh², Ping-Hung Chiang¹, Bo-Yi Lin¹, Jan-Chang Chen¹

¹National Pingtung University of Science and Technology, Taiwan, R.O.C.; ²National Ilan University, Taiwan, R.O.C.

Mikania micrantha is one of the most aggressive invasive alien plant species in low-elevation landscapes of Taiwan. This study used fixed-wing UAV imagery to monitor its flowering-season distribution in a primary monitoring area north of Liyu Lake, Hualien County, eastern Taiwan. Rather than treating the dataset as a continuous annual time series, the analysis was based on three flowering-season observation epochs acquired on 14 January 2021, 7 December 2021, and 4 January 2024. UAV imagery was collected using an eBee X platform and processed in Pix4Dmapper Pro to generate high-resolution RGB orthomosaics with an average ground sampling distance of 3.08 cm/pixel. M. micrantha patches were delineated through manual image interpretation, and kernel density estimation (KDE) was applied to evaluate changes in spatial concentration and hotspot distribution. The interpreted infestation area decreased from 2,094.74 m² in the first epoch to 1,361.94 m² in the second, then increased to 1,799.09 m² in the third. KDE results showed a similar pattern, with persistent core infestation zones and renewed expansion in surrounding areas, including a new hotspot in the southeastern part of the monitoring area. These findings demonstrate the practical value of UAV-based monitoring for adaptive invasive plant management.

Noise-Aware Data Augmentation for Robust Road Detection in Small Satellite Imagery

Nina Krašovec^1,2, Aleš Marsetič^1,3

¹ZRC SAZU, Novi trg 2, 1000 Ljubljana, Slovenia; ²Faculty of Civil and Geodetic Engineering, University of Ljubljana, 1000 Ljubljana, Slovenia; ³SPACE-SI, Aškerčeva 12, 1000 Ljubljana, Slovenia

This presentation examines how to improve automatic road extraction from small-satellite images, where image quality is often limited by lower SNR and higher radiometric variation. The study tests whether data augmentation with noise and blur during pretraining can make deep-learning models more robust under these challenging conditions. Using a two-stage transfer-learning setup, a U-Net with a ResNet-50 encoder was first pretrained on PlanetScope RGB imagery and then fine-tuned on data from NEMO-HD, a Slovenian microsatellite mission. Several types of synthetic noise and blur were evaluated at different intensity levels.

Machine Learning for Marine Dock Detection Using LiDAR Intensity and Detectron2

Harald Steiner, Olena Pylypenko, Louise Ethier

Provincial Government of BC, Canada, Canada

The availability of high-resolution LiDAR data and advances in machine learning have opened new possibilities for automating coastal infrastructure mapping. This work presents a streamlined workflow for detecting marine docks using LiDAR intensity data and Detectron2, a state-of-the-art convolutional neural network framework. The approach integrates intensity normalization, scan-angle correction, and transfer learning to improve detection accuracy across diverse environments. Applied to LiDAR tiles from British Columbia’s Sunshine Coast, the method achieved detection rates of 70–80%, significantly reducing manual digitization effort. While recall remained high, variability in precision and segmentation accuracy highlights challenges in geometric alignment. The proposed workflow offers a scalable, data-driven solution for marine infrastructure mapping, supporting applications in coastal planning, environmental monitoring, and emergency response. Future work will explore 3D kernel point convolutions to enhance spatial accuracy and leverage elevation gradients directly from point clouds.

From Satellite to Simulation: An AI-Driven Pipeline for Rapid, Reality-Based Aeronautical Environments

Sylvain Baradat, Alexis Barot, Thomas Bonfort, Sébastien Bonopera, Sébastien Bosch, Pierre Castets, Benjamin Cepas, Emmanuel Garcia, Alexandre Mayerowitz, Jérémie Noel, David Souveton, Michael Tonon, Jeremy Yviquel

Airbus Defence & Space, France

The aerospace sector urgently requires high-fidelity, real-world simulation environments that are both current and reactive, a challenge traditional workflows fail to meet. We present a fully automated, cloud-based pipeline developed by Airbus Defence & Space to produce trustworthy, reality-based aeronautical simulation data at a global scale.

Our core innovation is the automated co-extraction of a complete object stack—including precise building footprints, vegetation, and road networks—from the same Very High Resolution (VHR) satellite imagery source. This process, leveraging a multi-model deep learning approach based on foundation model paradigms, guarantees absolute spatial and temporal coherence across all extracted features.

The extracted features are then processed to generate high-fidelity LoD 2.1 3D geometry. This is achieved using a robust geometric framework and RANSAC-based plane fitting to reconstruct complex roof structures, delivering watertight volumes and filtering out photogrammetric noise. The pipeline is fuelled by the agile Pléiades Neo constellation and will be further reinforced by the four-satellite CO3D constellation, drastically improving revisit rates and ensuring data currency.

Operational validation on a 1000 km² diverse test area confirmed the system’s scalability, achieving full Digital Twin dataset generation in under 24 hours. This workflow effectively bridges the gap between raw satellite acquisition and actionable, high-fidelity simulation environments.

Finding DEM0: A Zero-Shot Depth Maps Calibration Framework for Generating Digital Elevation Models

Riccardo Contu^1,2, Andrea Nascetti³, Roberta Ravanelli⁴, Valerio Marsocci², Mattia Giovanni Crespi^1,5

¹Department of Civil, Building and Environmental Engineering, Sapienza University of Rome, Rome (RM), Italy; ²ESA, Φ-lab, Largo Galileo Galilei 1, Frascati (RM), 00044, Italy; ³Division of Geoinformatics, KTH Royal Institute of Technology, 10044, Stockholm, Sweden; ⁴Geomatics Unit, Department of Geography, University of Li`ege, Li`ege, Belgium; ⁵Sapienza School for Advanced Studies, Sapienza University of Rome, Rome, Italy

Accurate terrain elevation information is fundamental for geospatial analysis and environmental monitoring. Traditional 3D survey

methods such as LiDAR and photogrammetry provide high accuracy, but are costly, time-consuming, and limited in temporal

coverage. This work introduces Finding DEM0, a zero-shot framework that converts monocular depth predictions from foundation

models into metrically calibrated Digital Elevation Models (DEMs) without requiring supervised training. The approach leverages

the geometric consistency of DepthAnything V2 and anchors it to global elevation references from the Copernicus DEM and GEDI

LiDAR data through a linear regression-based calibration. Experiments conducted on around 2,500 tiles throughout the French

territory show consistent improvements over resampled Copernicus DEM baselines (approximately 1.5 m in vegetated areas and

more than 2.0 m in urban regions). The framework thus enables frequent, low-cost DEM updates using only high-resolution optical

imagery, eliminating the need for repeated airborne LiDAR/photogrammetric acquisitions and facilitating continuous and precise

elevation monitoring.

A Dual-Branch Deep Learning Framework for Social-Media-Driven Wildfire Verification and Precise Location Correction

Xuewen Yang^1,2, Yonghua Qu^1,2, Dongyi Liu^1,2

¹beijing normal university, Beijing, People's Republic of China; ²State Key Laboratory of Earth Surface Processes and Disaster Risk Reduction, Faculty of Geographical Science, Beijing Normal University, Beijing,People's Republic of China

Wildfires are among the most destructive natural hazards, posing significant threats to ecosystems, infrastructure, and human life. While satellites provide objective information for burned-area assessment, their temporal resolution is insufficient for immediate response. Conversely, social media offers rapid eyewitness reports but suffers from limited reliability, vague descriptions, and spatial inaccuracy.

To bridge this gap, this study presents a hybrid verification framework that integrates social-media-derived event information with remote-sensing imagery and deep learning. The aim is to automatically confirm fire occurrence and refine coarse social-media coordinates to pixel-level accuracy. The major innovations include:

A large-scale GEE hierarchical search to locate possible burned regions.

A dual-branch deep learning model that performs change detection with pre- and post-fire Sentinel-2 patches.

A centroid regression mechanism enabling precise geolocation correction.

A Global Wind Turbine Detection Framework Using Optical-Imagery under Installation Suitability Constraints

Yongjiu Feng, Rong Wang, Zhenkun Lei

Tongji University, China, People's Republic of

With the increasing global attention to clean energy, wind turbines (WTs) play a vital role in addressing both the greenhouse gas emissions and long-term energy sustainability. Nevertheless, accurately detecting the WT installations form remote sensing images remains a challenge. Existing data sources, such as the WT points of interest (POI) from OpenStreetMap (OSM), rely primarily from volunteer contributions are often incomplete or inconsistent, limiting their reliability for scientific assessment. This study proposes a global WT detection method form high-resolution remote sensing imagery via yolov8 deep learning model. The key contribution lies in constructing a WT installation suitability map based on multi-source spatial data, which reduce the search area by 38.99%, and improve the efficiency of global WT identification. In addition, to mitigate the challenges of small-target recognition in high-resolution remote sensing images, a method incorporating projection deformation of image regions is introduced. Using this method, more than 400,000 WT targets worldwide were successfully identified. Compared with OSM records, the method achieved an accuracy of 91.67% and revealed 48,688 newly installed WTs. This work provides a valuable tool for evaluating both the current status and future potential of global wind energy development, thereby supporting sustainable energy transitions.

Global 30-m annual urban fractional green Vegetation Cover Dataset from 1984 for over 60,000 urban Areas

Haoran Cheng, Yuhong He

University of Toronto, Canada

Reliable, comparable measures of urban green cover are essential for a sustainable urban future. We construct a global, annual 30-m fractional green vegetation cover (FGVC) dataset covering over 60,000 urban areas from 1984 onward. Using Landsat imagery in a cloud environment, the workflow adapts to each image by learning local endmember spectral signatures before applying constrained spectral mixture analysis, mitigating the influence of endmember spectral variability. Accuracy against reference maps is high (r > 0.8; MAE < 10%; RMSE < 13%), and agreement with a widely used product at 500 m is strong (r > 0.7; MAE < 12%; RMSE < 15%). We will provide pixel layers, city/regional indicators, and validity metrics to support applications including SDG monitoring, climate-adaptation planning, and equity-minded urban greening.

Cloud Masking in Polar Regions with Foundation Models for Multispectral Satellite Imagery

Tianyi You^1,2, Alexander Baumann^3,4,5, Michael Greza^1,2, Benjamin Busam^1,2

¹Photogrammetry and Remote Sensing, Technical University of Munich, Munich, Germany; ²Munich Center for Machine Learning (MCML); ³Siemens AG, Munich, Germany; ⁴Heidelberg University, Heidelberg, Germany; ⁵Division of Intelligent Medical Systems, German Cancer Research Center (DKFZ), Heidelberg, Germany

Cloud masking has been a critical processing step in earth observation (EO) satellite systems. Its applicability in polar regions remains difficult due to the significant challenges in the differentiation between cloud and snow areas. Despite diverse EO satellite imagery, it lacks a general approach to leverage them jointly due to the sensor dependency of most cloud masking frameworks. Vision foundation models (VFMs) offer new perspectives in realizing towards sensor-agnostic frameworks for cloud masking, however it remains under-explored and merits further investigation.

In this contribution, we propose a solution that leverages the strong feature extraction capabilities of novel foundation models for cloud masking in polar regions, building on prior works of the developed cloud masking models and the subsequent cross-sensor transferability study. The architecture mainly utilizes the pretrained self-supervised backbone from mainstream foundation models (i.e. DINOv3) and effectively adapts to downstream tasks through fine-tuning with the adaptable decoder. It also investigates text-aligned DINOv3 by incorporating pretrained text encoders to enable multimodal understanding for additional EO applications, including text-prompted identification and object query of geographic features in satellite imagery.

Compared to the prior works on the developed transformer-based cloud masking models, the VFM-based approach offers several key contributions of model capabilities, in terms of foundational backbone, sensor-agnosticity, multimodality, etc. The VFM-based multimodal approach employs advanced spectral-spatial encoding strategies compared to vision baselines for the assessment of text-alignment strategies for improved semantic tasks, establishing foundations for emerging vision-language tasks that enable trustworthy EO applications.

AI4EO: Accelerating Earth Intelligence for All with AI-Driven Earth Observation

Yifang Ban

KTH Royal Institute of Technology, Sweden & Lead, GEO AI4EO Enabler

The rapid expansion of Earth Observation (EO) data - from multispectral/hyperspectral to SAR, LiDAR, and dense time series - offers unprecedented opportunities to understand and monitor our changing planet. Concurrently, advances in artificial intelligence (AI) are transforming how these massive, multimodal datasets can be processed, interpreted, and translated into science-based decision support. Aligned with GEO’s Earth Intelligence for All Strategy, this work presents an integrated vision for accelerating global geospatial intelligence through AI-driven EO.

The GEO AI4EO Enabler plays a central role in realizing this vision. Designed to embed AI within GEO’s broader Earth intelligence ecosystem, it brings together a global network of AI and EO experts to foster cross-disciplinary collaboration, support capacity building, and develop and disseminate reproducible, accessible AI tools. The Enabler provides a framework to standardize AI-in-EO methodologies, promote responsible and ethical AI practices, and strengthen data-driven decision-making across diverse applications. As environmental and societal pressures intensify, this coordinated approach aims to make Earth intelligence more inclusive, scalable, and impactful.

Building on this foundation, we showcase transformational AI-driven EO applications: geospatial foundation model development and benchmarking; large-scale 2D and 3D urban mapping and continuous change detection; rapid flood and wildfire monitoring using satellite time series; multi-hazard building-damage assessment; and generative AI techniques that synthesize fine-resolution observations from coarse sensors for high-frequency operational monitoring.

By coupling the GEO AI4EO Enabler’s collaborative agenda with cutting-edge AI-driven EO, this work charts a clear pathway toward democratizing Earth intelligence and enabling informed decisions for a more sustainable and resilient future.

High-Resolution Mapping of Rock Outcrop Surface Conditions for Trace Metal Pollution Assessment near the Rouyn-Noranda Copper Smelter (Quebec, Canada)

Joël Masimo Kabuanga, Nicole Fenton, Osvaldo Valeria

Université du Québec en Abitibi-Témiscamingue

The rocky outcrops around the Horne copper smelter in Rouyn-Noranda (Quebec, Canada) exhibit highly variable surface conditions due to a century of atmospheric emissions. These surfaces act as passive archives of heavy metal deposits, but they remain poorly mapped due to their small size, spectral heterogeneity, and frequent mixing with vegetation or anthropogenic materials. This study presents a deep learning approach for high-resolution mapping of rock outcrops and their surface condition using multisensor remote sensing data.

We combined 0.2 m orthophotos (Vexcel UltraCam Eagle), Sentinel-1 SAR, Sentinel-2 multispectral imagery, and 1 m LiDAR derivatives to classify seven surface cover types: vegetation-covered rock, degraded soil mixed with till, smooth black-coated rock, anthropogenic surfaces, smooth uncoated rock, eroded till, and rough bare rock. The training data was created from a systematic 5 × 5 m annotation grid and field observations.

A U-Net convolutional neural network was trained for semantic segmentation using RGB orthophotos and features derived from LiDAR (slope, roughness, relief shading). The model achieved an overall accuracy of 86%, with high separability between bare rock classes and moderate confusion between degraded soils and eroded moraines. Probability and uncertainty maps with a resolution of 0.2 m were created from the softmax outputs to facilitate spatial interpretation.

The resulting maps reveal distinct spatial patterns of black coatings induced by pollution and erosion processes around the smelter. This work demonstrates the potential of multisensor fusion and deep learning for detailed environmental mapping in contaminated industrial landscapes.

Fitness Reconstruction with Gradient Synergy: Enhancing SVM Optimization for Remote Sensing Classification

Li Sun, yonggang Wu

Huazhong University of Science and Technology, China, People's Republic of

Intelligent optimization algorithms are powerful tools for complex geospatial computing, focusing on the exploration of key regions in the solution space. A primary application is the automated identification of optimal parameters for classifiers like SVMs, which is crucial for remote sensing. Traditional penalty methods are hindered by their empirical penalty factors: overly small values cause the search to remain trapped in infeasible regions, while excessive values divert it from the true optima, particularly under equality constraints. To address this, we reconstruct the fitness function based on the Karush–Kuhn–Tucker (KKT) optimality conditions. This formulation inherently ensures convergence to the feasible region and explicitly leverages the inverse collinearity between the objective and active constraint gradients at the boundary. Consequently, infeasible solutions are guided efficiently along a composite gradient direction toward the boundary, enabling high-precision, adaptive tracking. Our approach improves convergence efficiency and substantially reduces reliance on penalty parameters.

Toward Wavelength‑Independent Urban Scattering Characterization in Polarimetric SAR Data

Yong Wang, Dingfeng Duan

University of Electronic Science and Technology of China, China, People's Republic of

Polarimetric synthetic aperture radar (PolSAR) is gaining increasing attention for monitoring and analyzing urban areas and their changes, such as area extraction (Wang et al., 2024) and mapping (Wu et al., 2021). A critical foundation for the studies is the accurate characterization of urban scattering mechanisms. This task can be accomplished using polarimetric decomposition methods (Quan et al., 2023).

PolSAR systems are undergoing rapid technological developments, aiming for fine spatial resolution, wide swath, and multiple wavelengths. The development or variation of system parameters leads to changes in both the geometric and physical interaction (mechanism) of the imaging process for a radar target in urban areas in Earth observation. Then, understanding urban backscatter is challenging. In this study, we focused on the wavelength effect on the scattering mechanisms of urban targets in PolSAR data. An alteration approach has been proposed to achieve an equivalence in the decomposition results using PolSAR data across different wavelengths. After the approach, urban targets in the decomposed results exhibit consistency across the three bands , qualitatively and quantitatively. The approach is viable in reducing the impact of radar wavelength on the PolSAR decomposition result.

UAV LiDAR remote sensing for potentially large-scale rock fall detection

Sina Fatolahzadeh, Sergio A. Sepúlveda

Department of Earth Sciences, Simon Fraser University, Burnaby, BC, Canada

This study presents an integrated approach for identifying potential large-scale rock fall areas using high-resolution UAV LiDAR data collected over the Stawamus Chief, British Columbia, Canada. The methodology couples UAV-derived morphometric and structural analysis with software-based block detection and stability evaluation to delineate unstable areas in rock masses and quantify their potential failure modes. A comparison study with terrestrial laser scanner (TLS) data was also conducted to compare different remote sensing dataset resolutions and accuracy.

Standardized SAR Processing Platform with Cross-Sensor Consistency for Operational Monitoring

Chin-Rou hsu¹, Ya-Lun, S. Tsai¹, Shu-Wei Chang¹, Chun-Chia Huang², Michelle Chang³, Irene Hsu³, Jen-Yu Han¹

¹National Taiwan University, Taiwan; ²National Ilan University, Taiwan; ³National Space Organization, Taiwan

This study presents an integrated and user-accessible framework for SAR imagery analysis that bridges SAR data processing and AI applications. The framework focuses on three objectives: (1) establishing a standardized pre-processing pipeline for harmonized cross-sensor Level-2 products, (2) enhancing usability through a streamlined interface, and (3) demonstrating practical applications through three AI modules—oil-tank detection and geometric measurement, shoreline extraction and change analysis, and ship detection. Experimental results show that the system achieves 1.5–2× faster processing compared to manual workflows and enables consistent analysis across multi-sensor SAR data, including TerraSAR-X and ICEYE. The oil-tank module achieves 86.5% detection accuracy with sub-pixel height estimation, while the ship detection module achieves up to 100% detection accuracy under high-resolution conditions and 90% overall accuracy. Shoreline analysis demonstrates consistent detection of temporal coastal changes. These results demonstrate that the proposed framework provides a practical and scalable solution for integrating multi-sensor SAR data into AI-based operational monitoring.

Estimation of feather dune movement and sand flux with multi-source remote sensing data

Mimi Peng, Yifei Li, Yinghui Quan

Xidian university, China, People's Republic of China

The Kumtag Desert in northwestern China hosts one of the world’s most extensive fields of feathered dunes, whose continuous migration poses a direct threat to downstream oases, farmland and water resources. Yet, monitoring dune mobility in this hyper-arid environment is challenging. In this study, we develop a multi-sensor remote sensing framework that combines Sentinel-2 optical imagery and Sentinel-1 SAR data with a dense optical flow algorithm to derive high-resolution, spatially continuous displacement fields for 2017–2022. Sub-pixel displacements from COSI-Corr are used as an independent benchmark, and time series of dune migration rates are reconstructed through least-squares inversion. We further couple the remotely sensed migration rates with regional wind data to estimate sand flux and invert dune heights based on sediment mass conservation. The results reveal a persistent northeast–southwest migration of feathered dunes, with typical velocities of ~5–8 m/yr and a clear negative correlation between dune height and migration rate. The proposed framework overcomes key limitations of traditional methods and provides a transferable tool for two-dimensional kinematic analysis, aeolian hazard assessment and desertification control in complex dune systems worldwide

An Open-Source Application and a Benchmarking Framework for Sentinel-2 Image Sharpening

Aristides Vaiopoulos¹, Vasileios Tsironis², Konstantinos Karantzalos²

¹Raymetrics S.A., Spartis 32, Metamorphosis, Athens, Greece; ²NTUA, Department of Topography, Remote Sensing Laboratory, Athens, Greece

Earth Observing (EO) satellites are an invaluable tool in remote sensing and have various applications. Spatial resolution is often

crucial to those applications. The current work focuses on sharpening Sentinel-2 images. Moreover, a new application/program has

been developed towards this goal. The application sharpens Sentinel-2 lower resolution bands (20m, 60m) and creates a 12-band

image in 10m resolution. To run the program, one needs to load a Sentinel-2 L2A product, select one or more pansharpening methods

and click the fuse button. This process will fuse the whole scene, but it is possible to crop areas of interest and process them instead.

To validate the process, 14 pansharpening methods were employed and tested against well-known image quality metrics. On all areas

of interest, the quality indices agree with each other. However, the indices tend to penalize methods who fail spectrally, which is

correct, but they also tend to favor images with poor performance in the spatial domain. MS-SSIM seems to rank better the algorithm

images and is closer to the visual comparison assessment. HPF is one of the best performing methods for sharpening a L2A product

of Sentinel-2. ATWT, AWLP, HCS and LMM are good alternatives according to our results. The application, S-2 Sharpy (A

Sentinel-2 Image Sharpening GUI) is made available on Github. Furthermore, its generic counterpart, PanFusion (Image

pansharpening GUI for various sensors) is also made available on the mentioned platform, since it was the application that set the

foundation for the current application and study.

Comparison of Machine Learning and Physics-Based Approaches for Thermal Infrared Simulation

Benedikt Kottler, Raphael Zipperer, Eva Strauss, Dimitri Bulatov

Fraunhofer IOSB, Germany

Thermal simulation in urban digital twins enables effective monitoring of surface urban heat islands and supports climate adaptation planning. This paper evaluates machine learning and physics based approaches for this task through a unified validation framework based on 3D point clouds applied to an urban region in Berlin. The framework enables comparison of RandLA Net for 3D point cloud processing, InfraGAN for 2D texture synthesis, and physics based simulation on triangulated mesh geometries. RandLA Net architecture is adapted for thermal prediction and tested with two feature sets: RGB only and RGB with physics derived material parameters. Deep learning methods demonstrate severe spatial overfitting: training errors are minimal (MAE less than 1 K), but test performance degrades significantly on unseen regions with MAE increasing by factors of 1.9 to 2.5. Unexpectedly, augmenting with material parameters worsens generalization, indicating inadequate feature integration. Physics based simulation maintains consistent predictions (MAE approximately 8 K) with systematic bias addressable through calibration. These results motivate hybrid approaches embedding physical constraints into neural architectures for robust urban thermal modeling.

High-Resolution Downscaling of Urban Land Surface Temperature via Machine Learning

Naeim Mijani¹, James Voogt¹, Mohammad Karimi Firozjaei², Majid Kiavarz³

¹Department of Geography and Environment, Western University, London, ON N6A 5C2, Canada; ²College of Management, University of Tehran, Tehran, Iran; ³Department of Remote Sensing and GIS, University of Tehran, Tehran, 1417964743, Iran

Land surface temperature (LST) obtained from satellite observations is a key parameter for understanding Earth surface-atmosphere energy exchange and urban thermal environments. However, the use of existing satellite-derived LST datasets for urban applications is limited by the coarse spatial resolution and the mixed-pixel problem. By integrating both two-dimensional (2D) surface properties and three-dimensional (3D) urban morphological characteristics, this study proposes a machine learning-based framework for high-resolution downscaling of satellite-based urban land surface temperature (SULST). A Random Forest model was developed to generate a 1-m downscaled SULST (DSULST) map. The model demonstrates strong performance, with a Pearson correlation coefficient of 0.89, RMSE of 1.15 K, NRMSE of 0.095, MAE of 0.56 K, and an index of agreement of 0.95. The 1-m DSULST maps reveal substantial sub-pixel thermal heterogeneity that is not captured by conventional 30-m LST data. Fine-scale spatial patterns associated with vegetation, building structures, and roads are clearly resolved in the downscaled 1-m temperature maps. These results highlight a critical limitation of satellite-derived LST in representing intra-urban thermal variability. The findings demonstrate that enhancing the spatial resolution of urban LST is essential for urban applications, including modeling surface energy fluxes, pedestrian-level heat exposure, and energy consumption, all of which benefit from higher spatial resolution.

AURORA-Track: Uncertainty-Aware Identity Prediction for Robust Multi-Object Tracking in Satellite Video

Xin Huang, Pei Mi, MaoYu Wang, XuLei Shi, Wei Qin, PengJie Tao

School of Remote Sensing and Information Engineering, Wuhan University

Multi-object tracking in satellite videos faces unique challenges including small object sizes, low spatial resolution, frequent cloud-induced occlusions, and dramatic scene variations across geographic regions. Existing trackers, predominantly designed for ground-based scenarios, struggle to maintain reliable identity associations when satellite imagery exhibits long temporal gaps, transient visibility losses, and shifting appearance distributions. To address these challenges, we develop AURORA-Track, an end-to-end tracking framework that builds upon the Multiple Object Tracking as ID Prediction (MOTIP) backbone tailored for satellite video analytics. AURORA-Track introduces three key innovations: (1) an uncertainty-aware ID prediction module that augments the MOTIP decoder with calibrated confidence estimation, enabling robust handling of ambiguous associations and reducing false re-identifications; (2) a cloud/shadow-aware trajectory model that explicitly detects visibility degradations and leverages historical motion context to sustain tracking under partial or prolonged occlusions; and (3) a cross-scene knowledge transfer branch that meta-learns priors across diverse urban, maritime, and rural environments and rapidly adapts to new regions with minimal supervision. Extensive experiments on public satellite video datasets, including SatSOT and SatVideoDT, demonstrate that AURORA-Track achieves state-of-the-art performance, improving HOTA and reducing ID switches compared to leading baselines. These results validate the effectiveness of combining the MOTIP backbone with uncertainty-centric, occlusion-robust, and scene-adaptive enhancements for reliable satellite video tracking.

Multi-Sensor Random Forest Downscaling for 10 m LST Mapping and Urban Heat Island Monitoring in a Small-Sized City

Nicola Colaninno

Politecnico di Milano, Department of Architecture and Urban Studies (DAStU), Italy

Urban heat islands (UHIs) present a critical challenge to sustainable urban development, demanding high-resolution monitoring tools for effective climate adaptation. We address this need by implementing a machine learning framework for downscaling Land Surface Temperature (LST) data, demonstrating its ability to capture fine-scale thermal variations. The methodology leverages multi-sensor remote sensing data fusion, integrating high-resolution optical observations from Sentinel-2 with thermal imagery from Landsat 9 (daytime LST reference) and ASTER (nighttime LST reference). Random Forest (RF) regression is employed, utilizing Sentinel-2 multispectral bands, derived spectral indices (e.g., NDVI, NDBI) to characterize land cover, and a Digital Elevation Model (DEM) to account for topographic effects. The RF model was rigorously trained and its hyperparameters optimized via randomized cross-validation to predict LST at a 10-meter resolution. Results demonstrate robust performance, achieving a high R2 of 0.75 (Mean Absolute Error, MAE: 1.7°C) for daytime LST and R2 of 0.50 (MAE: 0.6°C) for nighttime LST. The resulting downscaled maps delineate pronounced heat accumulation in dense built-up areas, notably its historic center and large commercial zones, contrasting sharply with cooler vegetated areas and green urban corridors. A comparative assessment against bilinear interpolation, TsHARP thermal sharpening, and linear regression confirms that the RF framework achieves the best balance between predictive accuracy, spatial coherence with the source thermal data, and meaningful sub-pixel detail, effectively preserving the critical fine-scale thermal patterns. Ultimately, this study advances UHI monitoring by enabling the precise identification of heat-vulnerable areas, thereby supporting targeted mitigation strategies even in small and medium-sized cities.

Seasonal Assessment of Land Use Impacts on Daytime and Nighttime Urban Heat Island Intensity Patterns in a Hot and Arid Region: A Case Study of Ahvaz, Iran

Mohammad Karimi Firozjaei¹, Naeim Mijani², Majid Kiavarz³, Solmaz Fathololoumi⁴, Mohan Dev Joshi²

¹College of Management, University of Tehran, Tehran, Iran; ²Department of Geography and Environment, Western University, London, ON N6A 5C2, Canada; ³Department of Remote Sensing and GIS, University of Tehran, Tehran, 1417964743, Iran; ⁴School of Environmental Sciences, University of Guelph, Canada

This study aims to assess the seasonal impact of land use on daytime normalized urban heat island (DNUHI) and nighttime normalized UHI (NNUHI) in Ahvaz, one of the hottest cities in Iran. To this end, 63 corrected Landsat images acquired in 2024 were used, and daytime land surface temperature (DLST) and nighttime land surface temperature (NLST) were derived for the four seasons. Thereafter, DNUHI, NNUHI, and normalized UHI (NUHI) indices were derived by normalizing the temperature differences between urban and non-urban areas. A land use layer consisting of 14 classes was overlaid with the thermal data to investigate the role of land use type in controlling thermal patterns. The results showed that the highest DNUHI values were observed in industrial (0.14-0.20) and oil (0.12-0.19) areas, which generated the highest daytime heating. At night, the highest NNUHI values were recorded in industrial (0.12-0.24), military (0.07-0.20), and oil (0.08-0.18) land uses, indicating the strong heat storage capacity of these areas. In contrast, green spaces, orchards, and agricultural lands showed the lowest DNUHI and NNUHI values (about 0.01-0.06). These findings can inform the design of sustainable climate strategies, the development of green spaces, and land use management to reduce urban heating.

Johannesburg’s Urban Heat Island dynamics: Socio-economic and thermal patterns

KEVIN MUSUNGU, SINDANE MPENDULO

Cape Peninsula University of Technology, South Africa

Urbanisation in Johannesburg is significantly altering local climate conditions, yet long-term, satellite-based analyses of the Urban Heat Island (UHI) effect remain limited. This study addresses this gap through a ten-year (2014–2024) spatio-temporal assessment of Land Surface Temperature (LST) patterns and their socio-economic drivers. Landsat 8 imagery processed in Google Earth Engine (GEE) provided high-resolution LST data, which were integrated with regional socio-economic indicators, including population density and poverty metrics, and analysed using Ordinary Least Squares regression to examine their statistical relationships.

Findings indicate an apparent intensification of the UHI effect, with Johannesburg’s average LST in 2024 0.79°C higher than in 2014, and a 28% increase in population. Spatial analysis identified Regions D and G as persistent heat islands. At the same time, Region B consistently remained a cool zone, reflecting the significant role of land use and land cover in shaping intra-urban temperature variations. Poverty consistently correlated with higher surface temperatures, whereas population density showed a weak or negative relationship, suggesting that factors such as vegetation cover, construction materials, and surface permeability exert a greater influence on local temperatures than population density alone. Comparative analysis with other South African cities indicates that these patterns are systemic and socio-economically driven, highlighting broader issues of environmental inequality.

The study concludes that Johannesburg’s UHI effect is intensifying and raising urgent environmental justice concerns. It recommends targeted, socially equitable interventions, including urban greening programmes, cool roofing and paving materials, and thermal resilience strategies in informal settlements, to promote climate-adaptive and inclusive urban development.

Integrating Multi-Source Temperature Data and Explainable Deep Learning for Urban Microclimate Analysis

Shiqi Tu^1,2, Qingming Zhan^1,2, Yinghui Xiao^1,2, Ruihan Qiu^1,2, Zhihua Liu¹

¹School of Urban Design, Wuhan University, Wuhan 430072, China; ²Research Center for Digital City, Wuhan University, Wuhan 430072, China

Understanding the relationship between land surface temperature (LST) and near-surface air temperature is critical for urban microclimate research, especially for fine-scale thermal assessment in heterogeneous urban environments. This study investigates the spatial and temporal coupling between satellite-derived LST and in-situ air temperature during the summer of 2024 (June–August) on a university campus characterized by mixed building forms, surface materials, vegetation, and water bodies. High-resolution LST data were derived from Landsat-8 imagery, while near-surface air temperature was measured using a dense IoT-based monitoring network consisting of 19 observation sites.

Instead of treating LST as a direct proxy for air temperature, the analysis focuses on comparing spatial rankings, diurnal variations, and surface–air temperature differences across monitoring sites to identify systematic patterns of thermal consistency and divergence. The results show that LST presents stronger spatial differentiation than near-surface air temperature, whereas air temperature exhibits smoother spatial patterns and clear nighttime convergence. Surface–air temperature differences vary systematically across environmental settings, indicating heterogeneous coupling relationships between surface and atmospheric thermal conditions.

To further examine spatial correspondences, a convolutional neural network combined with Gradient-weighted Class Activation Mapping (Grad-CAM) was employed to evaluate whether spatially reweighted LST information better explains observed air temperature variability. The results indicate that emphasizing thermally relevant surface regions improves the consistency between satellite-derived thermal signals and in-situ air temperature observations.

Overall, this study provides an interpretable framework for analyzing surface–air temperature relationships at the micro-scale and supports more reliable urban thermal environment assessment by integrating satellite observations with ground-based measurements.

Machine Learning for recognition and mapping of rare earths in Brazil using reflectance spectroscopy and hyperspectral satellite imagery

Matheus Gomes¹, Ruy Castro², Gustavo Vieira²

¹Aeronautics Institute of Technology, Brazil; ²Institute for Advanced Studies, Brazil

This work presents a Machine Learning approach for the automatic recognition and mapping of rare earths in Brazil. While the country holds the world's second-largest reserves, identifying these valuable elements remains a challenge. By combining reflectance spectroscopy measured in the laboratory with open-access hyperspectral satellite imagery, a specific rare earths dataset is compiled. This dataset is used to train, validate and test neural networks to correctly classifiy rare earths by their spectral signatures.This method provides a novel and efficient tool for mineral prospecting and supports the geological community in assessing the national potential of these critical resources.

High-resolution LiDAR and thermal UAV data for 3D analysis of urban vegetation structure and its cooling effect in San Nicolás, Mexico

Kevin David Rodríguez González¹, Aylet Vega-Aguilar^1,2, Fabiola Doracely Yépez-Rincón¹, Jacinto Treviño Carreón³, Andrés Velastegui-Montoya⁴, David Freire-Granda⁴, Isabel Adriana Chuizaca Espinoza⁴

¹Universidad Autónoma de Nuevo León, Faculty of Civil Engineering, Department of Geomatics, San Nicolás de los Garza, Nuevo León, México; ²Departament of Geography and Regional Planning, Institute for Research in Environmental Sciences of Aragon (IUCA), Universidad de Zaragoza, España; ³Faculty of Engineering and Sciences, Universidad Autónoma de Tamaulipas, Ciudad Victoria, Tamaulipas, México; ⁴Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University, Guayaquil, Ecuador

Urban vegetation is essential for mitigating the Urban Heat Island effect, yet its cooling performance depends on its three-dimensional structure. This study combines high-resolution Unmanned Aerial Vehicle - based LiDAR (Zenmuse L2) and thermal imaging (Zenmuse H20) to analyze vegetation structure and surface temperature across 4 urban parks in San Nicolás de los Garza, Mexico. LiDAR data were processed to generate Digital Terrain Model, Digital Surface Model and Canopy Height Model models, enabling the segmentation of individual trees and extraction of structural metrics such as canopy height, crown area and point density. Thermal orthomosaics were co-registered with LiDAR models to quantify temperature contrasts between vegetated and impervious areas. Results reveal consistent cooling effects in all parks, with vegetated zones showing 8–15 °C lower surface temperatures depending on canopy density and maturity. Larger parks with continuous canopies displayed the strongest thermal regulation. This integrated LiDAR–thermal approach provides a precise and scalable framework for assessing microclimatic benefits of urban vegetation, supporting climate-resilient planning in rapidly urbanizing regions.

Trend analysis and temperature prediction using MODIS time series Images in the Metropolitan Regions of Campinas and Piracicaba, Brazil

Matheus França Duarte da Silva, Danilo Marques de Magalhães, Carlos Eduardo Nascimento dos Santos, Aline Rocha Gonçalves, Manuela Saran Rando Barros

Dept. of Geography and Environmental Planning, São Paulo State University, Rio Claro, Brazil

This study examines land surface temperature (LST) trends and future projections in the Metropolitan Regions of Campinas and Piracicaba, São Paulo, Brazil, from 2002 to 2022. A time series of 15,091 MODIS LST images (MOD11A1 and MYD11A1 products, v6.1) was processed using Google Earth Engine to generate monthly composites, which were subsequently analyzed in ArcGIS Pro. Harmonic regression modeling identified seasonal and interannual temperature trends and simulated monthly temperatures through 2033. Eight municipalities, grouped by urban density, were selected for detailed comparison. The results indicate persistently higher LST values in highly urbanized areas, while municipalities with initially lower urbanization levels exhibited steeper warming trends over time. Projected January temperature increases between 2023 and 2033 range from 0.4°C to 1.0°C, with the most pronounced changes occurring in areas experiencing rapid land-use transformation. These findings are consistent with broader patterns of urban heat island intensification, emphasizing the combined effects of vegetation loss, impervious surface expansion, and urban densification. While the projections are statistical estimates based on historical trends, they provide valuable guidance for climate adaptation strategies and urban planning. This study demonstrates the utility of MODIS time series and multidimensional GIS analysis for monitoring and forecasting thermal dynamics in rapidly urbanizing regions.

Interpolation methodologies comparison for Heat Index Assessment

Ivone G. Zapata-Wah, Dariela A. Vazquez-Rodriguez, Fabiola D. Yepez-Rincon, Andrea N. Escobedo-Tamez

Autonomous University of Nuevo Leon, Civil Engineering Institute, Geomatics Department, Mexico

Urban development is often accompanied by anthropogenic activities, changes in land morphology and serious damage to natural areas. Consequently, the urban climate is also affected, as temperatures are higher in urban centers and because of the presence of the urban heat island phenomenon, which poses a health threat to local citizens. The Monterrey Metropolitan Area (MMA) is the second-largest urban area in Mexico and is characterized for rapid urbanization and industrialization processes, steep climate conditions and the presence of urban heat islands. This combination makes living conditions rough for its inhabitants, especially for vulnerable groups. In order to quantify and compare heat vulnerability in urban areas, metrics such as the Heat Index measure the heat exposure and its effects on the human body. This study interpolated both relative humidity and temperature information from 15 local climate monitoring stations to determine the Heat Index for the six hottest weeks of the 2023 summer in the Monterrey Metropolitan Area. The interpolation methodologies used (IDW, Kriging and Spline) were later compared in order to cross-validate the results and define the most accurate performance base on both MAE and RMSE statistical analysis.

Multispectral Anomaly Detection: Comparison of sensor bands in conventional and machine learning approaches

Eva Strauss¹, Dimitri Bulatov¹, Edwin Deisling¹, Arno Schmidt², Peter Solbrig², Daniel Wagner², Alexander Gatter², Martin Kolloch²

¹Fraunhofer IOSB, Germany; ²Rheinmetall Electronics GmbH, Germany

Operational monitoring increasingly depends on UAV imagery for safety, environmental, and infrastructure applications. Yet detecting unexpected objects remains challenging when targets blend into the background or operations extend to low-light and night conditions. Modern UAV platforms with integrated sensors now make high-resolution RGB, co-registered multispectral, and longwave infrared data more and more readily available, motivating methods that exploit complementary reflectance and thermal cues. In this paper, we address the detection of camouflaged objects by multispectral anomaly detection. We study 15 different three-channel stacks deviated from several image modalities, including real imagery and simulated longwave infrared images that encode the expected scene. This allows us to recast anomaly definition as reality–simulation discrepancy, as alternative to the conventional anomaly definition. We separately apply four detectors of differing categories to these image stacks: the classical Reed–Xiaoli Detector, a Region-of-Interest extractor, the Isolation Forest as convenctional machine learning approach, and a finetuned deep learning model. Evaluation is based on well-established metrics including precision, recall, and the F1-Score. Results reveal that combinations of near- and longwave infrared offers the best accuracy, longwave infrared alone is competitive, and simulated infrared imagery generally reduces performance, most likely due to a rather significant reality–simulation gap. We conclude that combining reflectance and thermal channels is critical for robust anomaly detection and that compact deep models currently provide the best trade-off for operational deployment.

Spaceborne spectral and thermal datasets for REE mapping using machine learning techniques: A case study on Siwana Ring Complex, Rajasthan, India

Ronak Jain

Banasthali Vidyapith, India

The Siwana Ring Complex (SRC), located in western Rajasthan, India, is a distinctive geological formation characterized by its elliptical configuration. It primarily consists of rocks from the Neoproterozoic-era Malani Igneous Suite, reflecting its ancient volcanic origins. Peralkaline granitic rocks attract attention due to their potential to host valuable mineral deposits, particularly rare earth elements (REEs) and niobium (Nb). This study explores the potential of spaceborne imaging spectrometer (EMIT) and multispectral (Sentinel-2 MSI and ASTER Thermal) datasets for demarcation of REEs-bearing peralkaline granites, along with the potential sites of REEs. Silica and feldspar mapping was performed through the ASTER TIR dataset for targeting the potential sites of alteration zones within the peralkaline granites. Support Vector Machine (SVM) and Random Forest (RF) machine learning algorithms were applied on the EMIT and Sentinel-2 datasets for targeting the peralkaline granites of the region, which are the host rock for the REEs. The accuracy achieved through the EMIT and Sentinel-2 classified image varies. SVM and RF accuracies for EMIT are 93% and 96% respectively, while for Sentinel-2 are 95% and 99% respectively. Integrating the results from ASTER TIR with Sentinel-2 and EMIT highlighted the REEs-enriched zones within the peralkaline granites. This study demonstrated the potential of synergic use of thermal with spectral datasets for REEs delineation.

A novel wavelet-based destriper with spatial progressive attention for infrared images

Wei You¹, Zhiqiang Bian², Xinbo Zhou², Yi Xu¹, Kaimin Sun¹, Min Zhang³

¹Wuhan University, China, People's Republic of; ²Shanghai Institute of Satellite Engineering, China, People's Republic of; ³North Automatic Control Technology Institute, China, People's Republic of

information from intra-strip, inter-strip, to global levels. Specifically, a strip attention unit is proposed to harvest the contextual information for each pixel from its adjacent pixels in

the same row or column, while row attention and global attention are combined with their wide-ranging feature representation. This multi-scale attention mechanism address

local stripe artifacts and progressively incorporate broader image context in challenging conditions and provides more reliable results for practical applications. Experimental evaluations on multiple datasets demonstrate significant improvements compared to state-of-the-art methods. The method is designed to be computationally efficient and suitable for real-world deployment in fields.

Spatiotemporal Transformer Networks for Reconstructing Historical Landsat Time Series

Masoud Babadi Ataabadi¹, Darren Pouliot², Dongmei Chen¹, Temitope Seun Oluwadare¹

¹Laboratory of Geographic Information and Spatial Analysis, Department of Geography and Planning, Queen's University, Kingston, ON K7L 3N6, Canada; ²Landscape Science and Technology Division, Environment and Climate Change Canada, Ottawa, ON K1A0H3, Canada

The Landsat program provides over five decades of moderate-resolution satellite imagery, offering an invaluable record for monitoring land cover and land use changes. Despite its consistent calibration and open-access policy, Landsat’s low temporal resolution and frequent cloud contamination lead to sparse and irregular time series, limiting its usefulness for temporally continuous analyses. Reconstructing these missing observations is essential to improve temporal consistency and enable more accurate environmental monitoring. Previous studies, including our earlier work with the closed-form continuous-depth neural network (CFC-mmRNN), have shown promising results in modelling irregular Landsat time series. While the CFC-mmRNN achieved higher accuracy and lower computational cost than traditional methods such as continuous change detection (CCD), its performance declined under extremely sparse conditions, highlighting the need for more robust approaches. To address these limitations, this study introduces two transformer-based models for reconstructing very sparse historical Landsat time series: a one-dimensional Transformer and an enhanced three-dimensional variant that integrates a convolutional neural network (ResNet) with the Transformer architecture. The 1D Transformer processes individual sparse time series as input, whereas the 3D Transformer employs image patches (spatiotemporal cubes) to capture both spatial and temporal dependencies. Both models were applied to Landsat data (1985–2023) across the Canadian Prairies and evaluated against the CFC-mmRNN under varying spectral bands, data densities, and seasonal conditions. The results demonstrate that the Transformer-based models consistently outperform CFC-mmRNN, providing more accurate and temporally consistent reconstructions, particularly under extremely sparse observation scenarios.

Deep Learning Benchmarks for short-term Arctic Sea Ice Forecasting

Bo-Ram Kim¹, Junhwa Chi^1,2

¹Department of Data Engineering, Pukyong National University, Republic of (South Korea); ²Major of Big Data Convergence, Division of Data Information Sciences, Pukyong National University, Republic of (South Korea)

Rapid Arctic warming has accelerated sea ice decline, intensifying interest in the Northern Sea Route (NSR) and the demand for reliable short-term forecasts. This study benchmarks non-recurrent deep learning models for daily sea ice concentration (SIC) forecasting over the NSR using the NSIDC-0051 SIC record (1988–2023). For each forecast, models ingest the previous 30 days of SIC on a 64 × 128 grid and predict the subsequent 10 days. Models are trained and validated with a five-fold walk-forward scheme over 1988–2020 and tested on 2021–2023. Two deployable architecture families are evaluated: CNN-based and Transformer-based backbones. To align with NSR operations, evaluation focuses on navigation-centric metrics. SIC fields are thresholded at 15% to define ice masks, and forecast skill for 1–10-day leads is assessed using Integrated Ice Edge Error, Mean Boundary Error, Intersection over Union and Anomaly Correlation Coefficient. CNN-based backbones consistently outperform Transformer-based backbones for boundary and overlap metrics across all lead times, with PoolFormer achieving the lowest errors and highest overlaps and leading short-term anomaly skill. However, the family-mean boundary error for the CNN group exceeds 30 km at a 7-day lead and 35 km at a 10-day lead, indicating that the practical utility of current models for NSR route planning is limited beyond about one week. These findings support modern CNN-based architectures for operational short-term Arctic sea ice forecasting and highlight the need for hybrid designs that preserve strong spatial feature encoding while better representing multivariate temporal dependencies.

Snow water equivalent trends in North America through the lens of passive microwave remote sensing and deep learning models

Krystin Kys, Isteyak Isteyak, Karim Malik, Yusriyah Rahman, Karanveer Sidhu, Hala Al Daker

University of Windsor, Canada

Over the past decades, snow cover trends in North America have been analyzed, providing vital information to the Global Climate Observing System and other stakeholders about the looming signals of climate-driven snow declines. Detecting daily changes in snow parameters (e.g., snow depth, snow cover extent, and snow water equivalent) is, however, fraught with challenges, including internal variability unrelated to climate signals. We used GlobSnow's passive microwave remote sensing data and a Siamese U-Net model to compare patterns of daily changes in snow water equivalent (SWE) over the mid- and high-latitude regions of North America. The model detected changes in SWE with an F1-score of 94.8% and 100.0% in locations where it was not trained, and 99.3% at the location where it was primarily trained; this suggests the model's generalization potential to different climatologies and geographic locations. Using the model, we computed a similarity vector to compare SWE trends. We found that although lake-effect snowfall may be prevalent in the Great Lakes Basin during the winter months, the region consistently records the highest frequency of daily changes in SWE. Alaska, Yukon, and the Northwest Territories tended to have minimal daily changes in SWE, suggesting that latitudinal gradients may dominate changes in the snow regime and cryosphere's processes in the warming climate scenarios.

OPTIG: Open-source Python Tool for Ice Thickness and Glacier volume.

Anurag Sharma¹, Avtar Singh Jasrotia^1,2

¹Department of Remote Sensing and GIS, University of Jammu, Jammu 180006, Jammu and Kashmir, India; ²Department of Geology, University of Jammu, Jammu 180006, Jammu and Kashmir, India

This contribution introduces OPTIG, an open-source Python tool for modeling glacier ice thickness and volume using Glen's Flow Law. The tool integrates geospatial inputs including DEMs, surface velocity raster, and flowline data to perform subglacial bed inversion and identify potential glacial lake outburst flood (GLOF) hazard sites. Validation against GPR measurements demonstrates ±22% uncertainty ranges. OPTIG empowers data-scarce regions with accessible, high-fidelity glaciological analysis for climate adaptation and hazard resilience.

AI-assisted physical modeling of sun glint to improve inter-sensor consistency of remote sensing reflectance in coastal waters

Milad Niroumand-Jadidi, Lorenzo Mentaschi, Sonia Silvestri

University of Bologna, Italy

The remote sensing of biophysical parameters in aquatic systems, such as water constituent concentrations, depends strongly on the quality of the spectral data. Sun glint, specular reflection from the water surface, is a major artifact that can substantially contaminate the remote sensing reflectance (Rrs). Accurate modeling of glint is essential, particularly in multi sensor analyses, to ensure seamless Rrs and water constituent products. We build upon the recently developed WASI AI model to mitigate sun glint effects. WASI AI is an AI assisted physical inversion framework that offers key advantages over traditional physics only approaches, including improved handling of spectral ambiguities and significantly faster inversions. We evaluate the effectiveness of WASI AI’s glint correction capability through an inter sensor consistency analysis between Landsat 9 and Sentinel 2. The analysis uses near simultaneous acquisitions over optically complex coastal waters of the Adriatic Sea. The two overpasses are only a few minutes apart, which allows to assume stable bio-optical conditions. However, sun glint can vary rapidly because it is sensitive to viewing and illumination geometry as well as wind driven surface roughness and currents. These factors may affect the data from the two sensors differently. Our results show that the WASI-AI glint correction identifies substantial differences in magnitude and spatial patterns of glint between the near simultaneous Landsat 9 and Sentinel 2 acquisitions. The Rrs consistency analysis demonstrates that, after glint correction, agreement between corresponding bands of the two sensors improves on average by 6% in R^2 and by 5% in NRMSD.

Near Real-Time Flood Mapping from Sentinel Data Using Machine Learning Techniques

Matija Gerčer, Tanja Grabrijan, Krištof Oštir

University of Ljubljana, Slovenia

This study presents a near-real-time flood-mapping approach that integrates satellite-based Earth observation (EO) data, digital elevation models (DEMs), and machine-learning (ML) techniques. Several publicly available flood datasets were evaluated; however, none fully met the requirements for spatial coverage, data quality, and thematic diversity needed for robust model development. To address these limitations, a dedicated training dataset was constructed using Copernicus Emergency Management Service (EMS) Rapid Mapping products, comprising 38 flood events from 2022 to 2025. A modular workflow was developed to generate ML-ready datasets from satellite imagery, including data acquisition, advanced preprocessing, flood mask generation, and image tiling. Additional steps, such as co-registration, rescaling, data fusion, and masking irrelevant regions, were implemented to ensure spatial and temporal consistency across heterogeneous inputs. The developed model demonstrates reliable performance in delineating flood extents, achieving an average IoU of 0.70 on the validation dataset. Although the system remains under active development, the results indicate strong potential for operational deployment in near-real-time flood monitoring.

Automated 3D extraction of hydromorphological metrics from LiDAR data

Alexandre Rétat^1,2, Nathalie Thommeret^1,2, Frédéric Gob^2,3, Jean-Stéphane Bailly⁴, Laurent Lespez^1,2, Karl Kreutzenberger⁵, Thomas Depret³

¹Université Paris-Est Créteil, France; ²Laboratoire de Géographie Physique, CNRS UMR 8591, Thiais, France; ³Université Paris 1 Panthéon-Sorbonne, France; ⁴LISAH, Univ. Montpellier, AgroParisTech, INRAE, Institut Agro, IRD, Montpellier, France; ⁵Office français de la biodiversité, Direction générale, Service Eau et Milieux Aquatiques, France

Rivers play a key role in the functioning of ecosystems, and their hydromorphological condition is essential for environmental assessments and water management. In France, field measurements used to evaluate channel geometry, such as bankfull width and slope, remain limited in spatial coverage due to logistical constraints. However, with the nationwide availability of high-density LiDAR data (>10 points per square metre), new opportunities have emerged for the large-scale, reproducible and automated characterisation of river morphology.

This paper introduces Bf3D, a fully automated 3D workflow designed to extract hydromorphological metrics from pre-classified LiDAR point clouds. Unlike traditional approaches based on manually placed cross-sections or 2D analyses, Bf3D relies on a continuous 3D representation of channel topography. The workflow includes automated river delineation, irregular digital terrain model (DTM) reconstruction, detrending, and a volumetric adaptation of the hydraulic-depth method to estimate bankfull stage and width.

Bf3D has been applied to over 1,400 river reaches across France. The results demonstrate accurate centreline delineation and bankfull width estimates that are close to field measurements. This approach removes user-dependent biases and enables rapid processing at a national scale. This approach introduces a new paradigm for hydromorphological monitoring by enabling the consistent, automated computation of key indicators across extensive river networks.

GRACE/GRACE FO: On Accurate estimation of Groundwater Storage Change from Satellite Gravimetry and beyond

Saurabh Choubey¹, Shard Chander², Rina Kumari¹

¹Central University of Gujarat, Vadodara, India; ²Space Applications Centre, ISRO, Ahmedabad, India

The present work focused on the synergistic utilization of Gravity Recovery and Climate Experiment (GRACE) and GRACE-Follow On (FO) derived Ensemble Terrestrial Water Storage Anomaly (TWSA), downscaled (GOU) TWSA and a gap filled JPL TWSA data and other hydrological variables to assess variability in groundwater storage (GWS) change in the last two decades (2002-2022) over Indus and Ganga river basins. The Indus basin witnessed a significant decline in groundwater with a rate of change between -2.11 to -3.0 cm/yr. Ganga basin also witnessed a significant decline in GWSA with ensemble dataset indicating a decline of -1.27 cm/yr and gap-filled JPL dataset indicating a decline of -1.88 cm/yr after removing soil moisture estimates respectively. Groundwater Storage Anomaly (GWSA) obtained from downscaled TWSA also indicated a significant negative trend. However, the magnitude of trend was considerably lower (0.9 cm/yr) than the ensemble (-1.37 cm/yr) and gap filled (-1.88 cm/yr) datasets. Ground observations also indicated a decline in GWSA in Ganga basin with a rate of change -0.26 cm/yr. GWSA computed from downscaled TWSA and satellite derived soil moisture showed highest positive corelation (R = 0.78) and least RMSE (17.5 cm) with in-situ GWSA in the Indus basin during 2002-2022. Similar results were observed for Ganga basin where downscaled TWSA (R=0.87) showed satisfactory corelation and low RMSE (8.11 cm) indicating that GOU TWSA and European Space Agency (ESA) soil moisture derived GWSA was able to capture the localized groundwater storage change effectively.

Assessment of spatio-temporal rainfall variability over high altitude Himalayan catchment

Omkar Pal¹, Amit Kumar Dubey², Shard Chander², Sudhakar Shukla¹, Saurabh Choubey³, Parth Dave⁴, Praveen Kumar Gupta²

¹Remote Sensing Application Centre, Lucknow, Uttar Pradesh, India; ²Space Application Centre, ISRO, Ahmedabad, Gujarat, India-380015; ³School of Environment and Sustainable Development, Central University of Gujarat, Vadodara, India; ⁴LDRP Institute of Technology and Research, Gandhinagar, India

The Indus River Basin, a major Himalayan river system, has complex topography limiting availability of in situ measurements, which obstruct reliable characterization of precipitation patterns, thereby negatively affecting climate impact assessments and water resource management strategies. Understanding hydrological processes and efficiently managing water resources and dangers in Himalayan river basins depends on accurate high-altitude precipitation estimation. In this study, we have used the satellite-based precipitation reanalysis dataset (ERA5) and gauge-based data of IMD to overcome this issue. To conduct the analysis, we have used statistical methodologies, which include correlation analysis, root mean square error, probability of detection, and critical success ratio. We assessed the performance and detection of precipitation from ERA5 in comparison to IMD for the high altitudes. The performance evaluation of ERA5 precipitation against IMD bservations indicates a reasonably good agreement between IMD and ERA5 datasets in representing precipitation patterns over the study region, with R² = 0.793 and RMSE = 47.831 mm. The POD = 0.9686 and CSI = 0.7507. These results suggest that ERA5 provides a reliable representation of rainfall variability over the study area and can be effectively used for regional climate and hydrological applications. Further, we evaluated the performance of a gauge-merged precipitation dataset (GSMaP_ISRO) to highlight the significance of gauge merging over the study area. It was observed that the dataset outperformed in all the statistical indices. This study affirms the reliability of satellite-based precipitation datasets in high-altitude Himalayan regions and provides critical insights for sustainable water resource management in the face of evolving climatic conditions.

Study of Physical and Chemical Parameters of Indus River Water

Jitendrabhai Khasiya, Subrat Sharma, M.S Mondal

University of Ladakh, India

This study focuses on assessing the physical and chemical parameters of Indus River water

collected from a single sampling location, with special emphasis on seasonal variations and

sample preparation for ICP–MS analysis. The objective is to evaluate how water quality and

sediment inflow vary across different seasons and to determine the concentration of dissolved

and particulate matter in the river system. Water samples were collected regularly from the same

site of the Indus River during the summer, monsoon, and winter seasons. The analyzed physical

parameters include temperature, pH, oxidation-reduction potential (ORP), dissolved oxygen (DO

in mg/L and % saturation), electrical conductivity (EC), total dissolved solids (TDS), and

salinity. These parameters help in understanding the physicochemical condition of the river and

its environmental status. Temperature and DO show seasonal dependency due to changing flow

and temperature conditions, while EC, TDS, and salinity indicate variations in ionic

concentration and evaporation rate.

Spectrometry) analysis to estimate trace and heavy metal concentrations. In addition to field

observations, Remote Sensing and GIS techniques were used to analyze spatial variations in land

use, vegetation cover, and watershed characteristics influencing the Indus River. Satellite data

(Landsat and Sentinel) were processed in QGIS and Google Earth Engine to detect seasonal

changes in turbidity, surface temperature and land cover. The study concludes that the Indus

River water exhibits clear seasonal variations in its physical parameters and sediment load.

Spectral signature analysis of snow contamination in Himachal Pradesh: a multi-analytical approach for cryosphere monitoring

Balla Vivek, Pradeep Kumar Garg

Indian Institute of Technology Roorkee, India

The cryosphere is essential for maintaining the balance of Earth's climate; however, it faces growing threats from increasing anthropogenic activities, including industrial emissions, biomass burning, and vehicular pollution, which have led to significant deposition of pollutants like ash on snow surfaces. These pollutants, originating from local industries, forest fires, and traditional wood-burning practices in the region, are altering the natural snow properties and accelerating disasters, snowmelt processes, potentially affecting climate, water resources, and local ecosystems. This research examines the effects of ash contamination on snow reflectance in the Himachal mountainous region of India, utilizing hyperspectral data collected through an XHR 1024i spectro-radiometer. The analysis involved a detailed examination of prominent absorption features, first derivative assessments, calculations of relative absorption strength, albedo evaluations, and the application of Principal Component Analysis (PCA) to thoroughly investigate the spectral alterations resulting from ash deposition. The need for this study arises from the growing concern over the accelerated melting of snow and glaciers due to reduced albedo caused by impurities like ash. The analysis indicates that the absorption feature at 1025 nm exhibits a pronounced sensitivity to ash contamination, demonstrating a reliable decline in relative absorption strength as ash concentration increases. The first derivative analysis highlighted rapid changes in reflectance, aiding in the identification of absorption features, while principal component analysis indicated that more than 99% of the spectral variance can be attributed to ash concentration. Albedo analysis supported the observed spectral alterations by confirming a notable decrease in snow reflectance.

Estimating Long-Term Groundwater Storage Change in the Chad Basin, Nigeria, using GRACE/GRACE-FO and GLDAS Terrestrial Water Storage Anomalies

Ubong Udoka, Karel Pavelka

Czech Technical University, Faculty of Civil Engineering, Thákurova 7, 16629, Prague 6,

The Chad Basin is a major water source for more than 30million people across four countries in the arid Sahel. Understanding long-term groundwater changes in the Chad Basin is necessary for water security, abstraction management and transboundary cooperation. In this study, we employed GRACE satellite and GRACE-FO satellite data (Total Water Storage Anomaly, TWSA) along with GLDAS land surface modeling to determine Groundwater Storage Anomaly (GWSA) trend between year period 2002 and 2024. The findings reveal water hydrological paradox as the basin shows a significant TWSA increasing trend of +5.91 mm/year (R² = 0.70). But, the gain is decoupled from replenishable reserves which are declining for the Surface Water/Soil Moisture (-1.04 mm/year) and near GWSA stagnation (+0.24mm/year, R² = 0.02). The rainfall shows a weak association (+1.65 mm per year trend) with GWSA (r = -0176). From this, it appears increasing rainfall is ineffective for recharging the deep aquifer. The excessive use of humans contributes to the localized depletion of the severe GWSA in the western margins, primarily in northeastern Nigeria. The present findings indicate that rather than climate variability, it is the failure of governance. That water scarcity is due to our unsustainable human activities and the inefficient water recharge pathways. In order to implement spatially-explicit abstraction quotas and prioritise effective high efficiency Managed aquifer recharge schemes, the data is essential for LCBC.

Hydromorphological Monitoring and Navigation Assessment on Alluvial River Sections Using Sentinel-2 and Water Gauge Data

Michał Śmiarowski

MILITARY UNIVERSITY OF TECHNOLOGY, Poland

Monitoring dynamic alluvial rivers is essential for safe inland navigation, yet traditional bathymetric surveys are often costly and infrequent. This study presents an automated, cost-effective methodology for detecting and monitoring migrating sandbars by integrating Sentinel-2 satellite imagery with daily water gauge data.

Implemented within Google Earth Engine (GEE), the algorithm matches specific river water levels with cloud-optimized satellite scenes. It utilizes the Sentinel Water Mask (SWM) index to separate water from sediments, applying a 30-meter internal channel buffer to mitigate mixed-pixel errors along the shorelines.

The automated extraction was validated against high-resolution (3-meter) PlanetScope imagery. The results demonstrated high geometric agreement (mean Intersection over Union = 0.71) and a strong area correlation (R² = 0.97). While the 10-meter spatial resolution of Sentinel-2 introduces a systematic 26% overestimation of the sandbar areas , this over-segmentation serves as a beneficial safety margin in a navigational context, preventing the underestimation of submerged obstacles.

By correlating specific gauge levels with the emergence of sandbars, this method provides a vital 2D spatial baseline that enables the estimation of available water columns over specific bottlenecks. Ultimately, this approach supports the continuous generation of spatial databases, offering a practical foundation for dynamic relative depth mapping within River Information Services (RIS).

Satellite-based analysis of snow cover trends and transitions in Nepal

Harshita Tiwari, Aditya Kumar Thakur, Rahul Dev Garg

Indian Institute of Technology Roorkee, Haridwar, India

Snow cover plays a critical role in the hydrology and climate of the Himalayas, serving as a vital water reservoir for millions of people. Most previous studies often placed limited emphasis on recent country-scale assessments along with detailed snow variability. This study assessed the spatio-temporal dynamics of snow cover in Nepal during 2024 using 8-day MODIS snow cover products at 500 m resolution. Monthly maximum snow composites were generated to quantify snow cover fraction, seasonal trends, persistence, and variability. Results show distinct seasonal variation, with mean snow extent highest in winter (42.97%) and lowest in autumn (26.55%). Monthly snow cover peaked in April (50.01%) and reached a minimum in November (22.31%), reflecting strong intra-annual variability. Snow persistence mapping revealed that 32.32% of Nepal experienced no snow throughout the year, whereas 6.91% remained snow-covered year-round, corresponding to high-altitude permanent snow regions. The snow status change analysis highlighted dynamic snow behavior, with over 60% of pixels experiencing one or more transitions, underscoring the sensitivity of transitional snow zones. These findings improve understanding of snow variability in complex terrain and provide a scientific basis for hydrological modeling, water resource planning, and climate change adaptation in Nepal, where snowmelt-driven runoff is a key contributor to river discharge.

Glacial Lake Outburst Flood Hazard and Risk Assessment of GYA Lake in the Upper Indus Basin of Ladakh Himalaya using Hydrodynamic Modelling

Jigmet Angmo, Subrat Sharma, Md. Surabuddin Mondal

1 Dept. of Remote Sensing & GIS, Centre for Space Sciences & Allied Subjects (CSS& AS), University of Ladakh, Leh, India

Due to global warming, Himalayan glaciers are retreating rapidly by several metres annually leading to the expansion of glacial lakes and increased risk of glacial lake outburst floods (GLOFs). These changes pose serious threats to downstream communities, highlighting the urgent need for climate adaptation and disaster preparedness. Gya Glacier, in particular, forms a moraine-dammed lake that experienced a significant outburst in 2014. The lake’s area expanded 1.25% in between 2018 to 2024, indicating a gradual increase and sustained hazard potential. To assess this risk, an integrated approach was employed using remote sensing, geographic information systems (GIS), and two-dimensional dam-break modelling with HEC-RAS. Multi-temporal satellite data from Sentinel-2 and High-resolution images were used to monitor changes in lake area, volume, and surrounding land use/land cover. High-resolution topographic data supported hydrodynamic modelling, allowing simulation of flood propagation and identification of vulnerable zones. The simulation revealed that a sudden lake breach could inundate approximately 1.71 ha of agricultural land, 1.28 ha of built-up area, 1.04 ha of fallow land, and 0.06 ha of a national highway. The greatest flood depths and velocities were recorded in the upper reaches due to steep gradients, with major damage concentrated downstream. To mitigate such risks, establishing an early warning system is crucial. This can include installation of Wireless Remote Terminal Units (WRTUs), Automatic Weather Stations (AWS), and GLOF detection systems at the lake site. Key sensors may include radar level sensors for monitoring water levels, and meteorological sensors to track climatic and hydrological changes in real time.

Shallow Water Depth Inversion in Beibu gulf Based on Optical Remote Sensing and Electronic Nautical Charts

Ertao Gao², Zijin Huang¹, Guoqing Zhou², Yanling Lu², Xiang Zhou², Liu Jun², Xia Wang²

¹Guilin university of technology, China,; ²Guangxi Key Laboratory of Spatial Information and Geomatics, China

Rapid and accurate acquisition of the bathymetry of large-scale nearshore shallow sea is of great significance for coastal economic development, safe navigation of ships and coastal ecological protection. Beihai and Fangchenggang of the Beibu Gulf of Guangxi as the research area. Three inverse algorithms are firstly using for the bathymetric inversion experiments, which are one-band model, two-band-ratio model and multi-band-combination model, based on Landsat-9 images and electronic chart data. After that these three inverse algorithms of water depth are compared and then analyse the accuracy of the bathymetric inversion between the unzoned and zoned ones. The results of the experimental results that the multi-band-combination model exhibit the highest inversion accuracies in both experimental areas among the MAE and RMSE are 1.3843 m and 1.7611 m in Beihai and that of Fangchenggang is 1.8609 m and 2.4599m; following the bathymetric stratification, the average weighted errors of water depths are reduced, which mean MAE and RMSE reduced in the Beihai region by 0.6414 m and 0.8031 m and the mean MAE and the RMSE decreased by 1.6788 m and 1.9163 m The multiband combined regression model had a superior effect after the bathymetric layered inversion.

Global Assessment of Total Water Storage Variability and Trends (2002–2025) Using Multi-Source GRACE Data and Uncertainty Analysis

farzam Fatolazadeh¹, kalifa Goïta¹, Shusen Wang²

¹CARTEL, Département de Géomantique appliquée, Université de Sherbrooke, Sherbrooke, Québec J1K 2R1, Canada; ²Canada Centre for Remote Sensing, Natural Resources Canada, Ottawa K1A 0E4, Canada

Monitoring global water storage dynamics is essential for understanding the impacts of climate change on hydrological systems. The Gravity Recovery and Climate Experiment (GRACE) and its successor GRACE Follow-On (GRACE-FO) missions have provided a unique opportunity to quantify Terrestrial Water Storage (TWS) variations at large spatial and temporal scales. However, differences among GRACE solutions from various processing centers, such as CSR, JPL, and GFZ, can lead to uncertainties that must be carefully assessed for reliable interpretations (Wang and Li, 2016).

This study aims to provide a comprehensive analysis of global TWS changes from 2002 to 2025 by integrating multiple GRACE-derived TWS products. Spatial trends of TWS were calculated to identify regions and countries experiencing significant water gain or depletion. Furthermore, monthly TWS variations were extracted to construct time series for individual countries, enabling the detection of long-term hydrological patterns and seasonal fluctuations. An uncertainty assessment was also performed to evaluate the robustness of the estimated trends and temporal variations.

Integrating Remote-Sensing driven SWAT Modelling and Community Perceptions to Assess Water Availability Across Elevation Gradients of Mount Kilimanjaro

Ehsan Khalefa, Nicholas Pepin, Richard Teeuw

University of Portsmouth, United Kingdom

Mount Kilimanjaro, an East African water tower, is undergoing hydro-climatic and land use changes with uncertain impacts on water availability along its elevation gradient. This ongoing study integrates satellite remote sensing, physically based hydrological modelling, and community knowledge to characterise spatial patterns of water availability and compare them with local experiences. Land use and land cover (LULC) are mapped using the European Space Agency (ESA) WorldCover 10-m product; vegetation dynamics are analysed with leaf area index (LAI); and climate forcings are derived from Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS) precipitation and ECMWF Reanalysis v5 (ERA5) temperature. We implement the Soil and Water Assessment Tool (SWAT) to simulate water yield by elevation band, in the absence of streamflow, model evaluation uses independent remotely sensed constraints from the Global Land Evaporation Amsterdam Model (GLEAM) evapotranspiration (ET) and ESA Climate Change Initiative (CCI) soil moisture. Semi-structured interviews and surveys across three elevation zones capture perceived change and adaptation strategies. Preliminary analyses indicate heterogeneous trends, with the largest declines in lowland catchments and more variable responses at mid- and high elevations. Ongoing work will quantify uncertainties (forcings/LULC/parameters) and translate findings into elevation-specific measures for climate-resilient water planning.

Using the SWOT KaRIn Sensor to Retrieve Lake Ice and Overlying Snow

Hayley Kavanagh¹, Grant Gunn¹, Claude Duguay²

¹University of Waterloo; ²H2O Geomatics

This research focuses on exploring the capabilities of the SWOT satellite mission’s Ka-band Interferometric Radar (KaRIn) sensor for retrieving lake ice and overlying snow properties. SWOT KaRIn Version D Pixel Cloud Data Products are compared to in-situ snow and ice measurements on Łù'àn Män (Kluane Lake) during the Calibration and Validation phase that took place over a three month period in 2023. The Snow Microwave Radiative Transfer (SMRT) model is used to simulate backscatter for varying snow and ice scenarios to better understand variances in observed backscatter across the lake. Optical satellite acquisitions are also utilized to extract and compare backscatter to surface reflectance to analyze seasonal lake ice phenology trends. Preliminary results indicate that KaRIn-retrieved heights are inconsistent during the winter season. Additionally, the contrast in backscatter for ice and open water allow for effective ice cover mapping. During the winter season, backscatter values exhibit a general negative pattern, with SMRT simulations indicating a correlation to snow cover variability.

Applicability of Landsat Products for Estimation of Water Clarity in Finger Lakes, New York

Rabia Munsaf Khan, Bahram Salehi

State University of New York, College of Environmental Science and Forestry, United States of America

This study investigates the use of Landsat data for monitoring water clarity, expressed as Secchi Disk Depth (SDD), across the Finger Lakes region in New York. SDD, a long-established indicator of water clarity, is measured using a Secchi disk and widely applied in limnological research. Recent advances have enabled remote sensing-based estimation of SDD, with Landsat imagery frequently used alongside band ratios to mitigate atmospheric effects. Cloud-computing platforms such as Google Earth Engine (GEE) further support large-scale water clarity assessments by providing accessible Top-of-Atmosphere (TOA) and Surface Reflectance (SR) products.

The study uses citizen-science SDD measurements from the NY-DEC CSLAP program (2017–2023) across all 11 Finger Lakes. Corresponding Landsat 8 TOA and SR reflectance values are extracted from GEE using a 3×3 mean around sampling points and filtered for clouds and shadows. A Random Forest model is trained using both original bands and band ratios to estimate SDD under multiple evaluation schemes, including 80:20 train–test splits and 5-fold cross-validation with both random and stratified sampling.

Results show that stratified sampling yields more reliable predictions due to variability among lakes, and TOA performs slightly better than SR in this case. Feature-importance analysis indicates consistent influential band ratios across products. The study provides the first Landsat-based assessment of water clarity for all Finger Lakes and supports improved understanding of water quality trends in these socioeconomically important freshwater systems.

Spaceborne bathymetry using SAR and water level data

Francescopaolo Sica, Maria Paula Bustos Moreno, Michael Schmitt

University of the Bundeswehr Munich, Germany

This work presented a data-driven and scalable approach for performing inland water bathymetry by integrating SAR-derived shoreline dynamics with water-level observations. The method leverages the high temporal resolution of Sentinel-1 imagery and diverse water-level data sources to infer relative elevation and uncertainty estimates.

By exploiting non-uniform sampling theory and regression-based interpolation, the method establishes a foundation for automated, reproducible bathymetry using globally accessible data. Future work will address error modeling and validation against high-resolution reference datasets.

Three-Decadal Sea Level Rise in the East China Sea: the Facts and Causes

Fengwei Wang, Yunzhong Shen

Tongji University, People's Republic of China

Based on the integration of multisource satellite observations, including GRACE/GRACE-FO gravimetry, altimetry, steric, and sediment datasets, this study provides a comprehensive analysis of sea level changes and their driving mechanisms in the East China Sea (ECS) over the periods 1993–2022 and 2002–2022. The findings reveal that the regional mean sea level rise is predominantly driven by manometric changes (mass addition), contributing approximately 87% (3.06 mm/yr during 2002–2022), while steric effects account for only about 12.6%. A pivotal discovery is the critical role of substantial sediment deposition from major rivers like the Yangtze. This deposition introduces a net bias of –0.35 mm/year in GRACE-derived mass trends, and correcting for this "sediment effect" is proven essential for accurately closing the regional sea level budget. Decadal analysis further reveals significant variability: the ECS sea level rise rate was notably high at 6.51 mm/year (1993–2002), sharply decreased to 2.45 mm/year (2003–2012) primarily due to a strong negative thermosteric contribution (–1.53 mm/year), and subsequently recovered to 4.19 mm/year (2013–2022). At the seasonal scale, annual variations are dominated by steric effects, whereas semiannual signals are primarily controlled by manometric changes. This study successfully demonstrates that the ECS sea level budget can be closed within uncertainty when sediment corrections are applied, providing a robust methodological framework that is highly applicable to other sediment-rich coastal regions globally for improved sea level budget assessment.

Deep Learning-based Feature Importance Evaluation for Pan-Arctic Sea Ice Concentration Mapping

Mabel Grace Heffring, Lincoln Linlin Xu

Department of Geomatics Engineering, University of Calgary, Alberta, Canada

Accurate, timely, and explainable Pan-Arctic sea ice concentration (SIC) maps are essential for climate change studies, Arctic sea route navigation, and climate adaptation of Northern communities. Every day, a large amount of active and passive microwave satellite imagery are collected by remote sensing systems over the Pan-Arctic region, including Synthetic Aperture Radar (SAR) from the RADARSAT Constellation Mission (RCM) and Sentinel-1, and Passive Microwave (PM) radiometry from the Advanced Microwave Scanning Radiometer 2 (AMSR2). While advanced DL-based data fusion models leverage extensive SAR and PM imagery to produce high-resolution SIC estimates, their decision making process is opaque and difficult to interpret. This study provides the first feature importance evaluation of SAR and PM inputs to improve the efficiency and transparency of using an advanced Transformer architecture for Pan-Arctic SIC mapping during the melting season.

Assessment of deep learning segmentation algorithms for lake ice cover retrieval from dual polarization SAR imagery

Yuhao Wu¹, Claude R. Duguay^1,2, K. Andrea Scott³, Justin Murfitt², Bin Li⁴, Samuel Johnston²

¹Department of Geography and Environmental Management, University of Waterloo, Canada; ²H2O Geomatics Inc., Kitchener, Canada; ³Department of Mechanical and Mechatronics Engineering, University of Waterloo, Canada; ⁴School of Environmental and Spatial Informatics, China University of Mining and Technology, Xuzhou, China

This study evaluates the performance of five deep learning (DL) segmentation algorithms for retrieving lake ice cover from dual-polarization Sentinel-1 SAR imagery. Lake Hazen, located in the Canadian High Arctic, was selected as a representative site due to its strong climate sensitivity and variable ice conditions. A six-year dataset (2015–2021) comprising over 1,100 dual-polarization EW-mode SAR images was used to train and validate U-Net, U-Net++, SegFormer, DeepLab v3+, and PSPNet models. Binary ice–water labels were manually annotated to support model development. Temporal cross-validation using independent test years (2015, 2018, and 2021) was conducted to assess model generalization across different ice phenology periods, including ice-on, break-up, ice-free, and freeze-up phases. Results show that all models achieved high accuracy (>98% overall accuracy) during stable ice and open-water periods, while segmentation performance decreased during freeze-up due to mixed ice-water backscatter signatures. Visual analysis confirmed that each architecture successfully captured the spatial distribution of lake ice, though some misclassifications were observed in noisy or low-backscatter regions. The findings demonstrate the potential of segmentation-based DL models for automated lake ice monitoring and highlight the need for further model refinement to improve performance during transitional periods. Future work will extend the framework to additional lakes and multi-year datasets to enhance operational monitoring of lake ice.

Evaluating the Surface Water and Ocean Topography Mission for Inland Water Monitoring: A SWOT Framework Review

Saba Hosseini Tabesh¹, Shusen Wang², Dongmei Chen³

¹Queen's University, Canada; ²Natural Resources Canada; ³Queen's University, Canada

The Surface Water and Ocean Topography (SWOT) mission

represents a major advance in Earth observation by providing the

first global two-dimensional measurements of surface water

extent and elevation. Its potential for hydrology, climate

monitoring, and water resource management is widely

recognized; however, recent studies indicate that its performance

varies across hydrological contexts.

This study presents a review of SWOT’s capabilities for inland

water monitoring based on a synthesis of published validation

studies, simulation experiments, and case applications. To

support a structured interpretation of these findings, a Strategic

Assessment Framework (SAF) is applied. The SAF is an

analytical framework that organizes the evaluation across four

components: strengths, limitations, opportunities, and risks,

enabling a systematic comparison of SWOT performance under

different environmental and observational conditions (Figure 1).

For large rivers and lakes (≥1 km²), SWOT meets its design

accuracy targets (Bazzi et al., 2025). However, in fragmented

wetlands and narrow channels, retrieval errors increase

significantly, with reported RMSE values of 30–70 cm in

simulation studies (Bergeron et al., 2020). Environmental

heterogeneity, including shoreline complexity and wind-induced

surface roughness, further increases uncertainty in elevation

retrieval (Bergeron et al., 2020), while vegetation and turbidity

reduce water–land separability and limit effective pixel

availability (Frasson et al., 2021).

The SAF highlights performance variability and identifies the

role of multi-sensor integration (Sentinel-1/2, Landsat, Planet

Scope) in improving the reliability of SWOT-based inland

water monitoring

Comparative Analysis of Spatiotemporal Trends in Arctic SST and SIC from Two Reanalysis Datasets

Chaeyun Kim¹, Sungwoo Park¹, Jongho Woo¹, Seungwon Kim¹, Seungkyoo Lee¹, Huiji Yu¹, Kyung-Soo Han²

¹Division of Earth Environmental System Science (Major of Spatial Information Engineering), Pukyong National University, Korea, Republic of (South Korea); ²Professor, Pukyong National University, Korea, Republic of (South Korea)

Accurate monitoring of the Arctic Marginal Ice Zone (MIZ) is critical due to rapid Arctic Amplification. This study evaluates discrepancies between two widely used Level-4 reanalysis datasets—NOAA OISST and CMEMS L4 Arctic Ocean—over the Arctic (>58°N) from 1988 to 2022, specifically focusing on the MIZ (SIC 0–50%). After spatial reprojection to a common 0.25° grid, the comparison revealed significant discrepancies, particularly in the transition zone (SIC 15–50%). While both datasets exhibit long-term warming, CMEMS-L4 shows a much stronger warming trend (+1.173°C/decade) compared to OISST (+0.215°C/decade). This divergence is primarily attributed to algorithmic disparities: CMEMS-L4 incorporates Ice Surface Temperature (IST), resulting in higher variability, whereas OISST relies on proxy SST estimates. Crucially, a distinct temporal discontinuity was identified in OISST around 2005, coinciding with a change in its sea ice input source from NASA to NCEP. This structural break caused abrupt shifts in SIC values and even resulted in contradictory cooling trends in parts of the Greenland Sea, whereas CMEMS-L4 indicated widespread warming. These findings highlight that data processing methodologies induce non-negligible uncertainties. We recommend caution when utilizing OISST for long-term analysis in the MIZ due to its 2005 discontinuity.

Using Pre- and Post-fire Airborne Laser Scanning Data to Determine Biomass Loss due to Combustion during the 2022 Chetamon Fire in Jasper National Park, Alberta, Canada

Natalie Krizan¹, Laura Chasmer², Raphaël Chavardès³, Richard Fournier⁴, Christopher Watson⁵, Christopher Hopkinson²

¹University of Lethbridge, Canada; ²Western University; ³Canadian Forest Service - Natural Resources Canada; ⁴Université de Sherbrooke; ⁵Parks Canada

Decades of fire suppression and exclusion in Jasper National Park (JNP), Alberta, Canada have altered forest conditions. Previous plot-level fire history analyses indicate a mixed-severity fire regime was disrupted after 1915 (Chavardès et al., 2018). Biomass (fuel) has accumulated, and stand connectivity and homogeneity have increased (Chavardès & Daniels, 2016). Furthermore, a mountain pine beetle epidemic has killed a significant portion of lodgepole pine within the park, shifting biomass distribution from the canopy as needles and branches drop (Talucci & Krawchuk, 2019) Under these conditions, fires can burn more intensely, with more high severity impacts, including substantial biomass loss (Hagmann et al., 2021; Harris & Taylor, 2015; Kreider et al., 2024). Understanding how altered fuel structures correspond to biomass loss is important for predicting future fire impacts, and informing forest management decisions (Schoennagel et al., 2004).

The 2022 Chetamon Fire in JNP provides an opportunity to study biomass loss using available pre- and post-fire airborne laser scanning (ALS) data. Fuel structures are determined following LidarForFuel protocol (Martin-Ducup et al., 2024). Pre- and post-fire outputs are differenced to determine spatial variability of biomass loss. Pre-fire ALS is further used to map pre-fire environmental conditions that influence fire intensity, and thus, biomass loss. This includes topography characteristics, and forest metrics such as density (Kane et al., 2007; Parks et al., 2012). These factors are analyzed as predictor variables of biomass loss in Random Forest analyses.

Evaluating fuel structure modeling from high- and low-density airborne lidar in northern boreal forests

Emliy Jones¹, Laura Chasmer², Chris Hopkinson², Richard Fournier³

¹University Of Lethbridge, Canada; ²University of Western Ontario, Canada; ³Université de Sherbrooke, Canada

Warming air temperatures and prolonged periods of drought have increased fuel availability and fire activity across northern boreal forest regions. Modelling fuel structures, such as canopy fuel load, vertical distribution and spatial connectivity, is important for providing inputs in fire behavior models, as well as furthering our understanding of the environment. The overall aim of the project was to determine the efficacy and accuracy of three standard fuel modelling methodologies at high- (>30 pt/m2) and low- (<10 pt/m2) point densities and resolutions (5m, 10m, 20m, and 30m) in a dense forested environment near Fort Simpson, Northwest Territories. All metrics are compared to fuels measured in situ. This study highlights both the potential and limitations of scalable lidar-based fuel mapping and can help inform management practices, fire behavior applications, and future operational fuel hazard-mapping and risk-mitigation strategies.

Improving Geospatial Data Quality Through Errors Propagation in Survey and Mapping Processes

Qassim Abdullah

Woolpert, inc., United States of America

A precise evaluation of positional uncertainty is crucial to maintaining the reliability of geospatial data, as well as supporting high-quality outcomes in professional surveying and mapping projects. This paper thoroughly examines the origins of error and the statistical and geodetic principles underlying accuracy assessment for technologies such as photogrammetry, airborne LiDAR, and mobile mapping systems. Building on these foundations, the study outlines a robust, methodical framework that enables practitioners to rigorously quantify the positional accuracy of their geospatial products. The approach is aligned with the most recent edition of the ASPRS Positional Accuracy Standards for Digital Geospatial Data, ensuring compliance with current industry benchmarks.

Integrating High Resolution Aerial Imagery and Digital Elevation Models for Vertical Stratification of Rooftop Vegetation

Frances Leung, Yuhong He, Adam Martin, Marney Isaac

University of Toronto, Canada

Urban green spaces including green roofs, parks, urban forests, community gardens and private green spaces are integral to city landscapes, offering ecosystem services and enhancing urban aesthetics. By leveraging data captured from satellite or aerial imagery, spectral analysis using indices such as Normalized Difference Vegetation Index (NDVI) enables effective mapping of vegetated surfaces in such urban green spaces. However, topographic views alone present certain limitations in this context, particularly for applications requiring the differentiation of vegetation based on vertical stratification. This study presents a novel approach that enables two-dimensional (2D) and three-dimensional (3D) visualization of rooftop vegetation using a combination of multispectral and digital elevation data.

An Evaluation of Methods for using LiDAR to obtain Depth of Burn Measurements from Wildland Fires in the Boreal Forest

Anna L. Magistrale¹, Mark C. De Jong², Koreen Millard¹

¹Carleton University; ²Natural Resources Canada

Canada's boreal forest accounts for 28% of the world's boreal forest ecosystem and is a large carbon sink. Under climate change, the severity and frequency of wildland fires in this area is increasing. This is resulting in large amounts of carbon being released in to the atmosphere, affecting the rate at which climate change occurs. LiDAR is being used more frequently for studying wildland fires and has shown some success in measuring fuel consumption, providing insight into the amount of carbon emitted. This research aims to refine the methods used to process LiDAR data collected before and after a fire in the boreal forest. Different ground point filtering algorithms, methods of spatial alignment, downsampling values and DTM resolutions are explored. Findings demonstrate how the choice in data processing can influence how well LiDAR-based DoB estimates agree with field-based observations and highlight considerations to be accounted for in similar future work.

On the importance of ground validation and methodology for wetland mapping in Canada

Armand LaRocque¹, Brigitte Leblon¹, Alan Hanson²

¹Lakehead University, Canada; ²Canadian Wildlife Service, ECCC, Canada

In this study, we compared existing national wetland maps with ground-truth polygons in four areas of interest located in Eastern Canada. By comparing the methods used for each map, we identified important elements to consider when producing a wetland map using remotely sensed data: 1) the five Canadian Wetland Classification System (CWCS) classes (bog, fen, swamp, marsh, shallow water) are broad and can create spectral confusion. It is preferable to use wetland subclasses and then merge them into the broad classes. 2) It is important to add SAR imagery to the classification, given that this imagery can detect many wetland characteristics related to the site's wetness and vegetation structure. 3) Ancillary data such as DEM, topographic metrics, and canopy height model are a valuable addition to the classification. 4) It is recommended to use multi-seasonal images to consider the seasonal and temporal variation in the vegetation phenology and in both surface and groundwater levels. 5) Images used should have a spatial resolution small enough to have a minimum mapping unit to be able to detect small landscape features; and 6) it is recommended to have a dense network of ground-truth sites representative of the AOI. Our study showed that mapping wetlands at the scale of Canada is very challenging, due in part to the diversity of wetland types, which complicates the definition of standardised wetland classes, as well as to the logistical challenges related to obtaining data at the Canadian scale.

Using the Sentinel Missions to Build a Validated Iceberg Database

Joseph Chamberland, Paris W. Vachon, Michael A. Stott

AstroCom Associates Inc, Canada

This presentation will review past and recent progress in iceberg detection from space and motivate the development of a large iceberg database for future testing and comparison of the new detection techniques. The presentation also review work done to leverage ESAs Sentinel missions to build such a database.

Monitoring Crop Phenology and Harvest Timing Using High-Resolution X-Band SAR Imagery in Western Canada Agricultural Systems

Keshav Singh

AGR.GC.CA

test

Multiscale Estimation of Crop Nitrogen Using Integrated UAV and Satellite Multispectral Imaging

Keshav Singh

AGR.GC.CA

test

Accurate and cost-effective forest terrain mapping by integrated SLAM and CLAS positioning

Soki Nishiwaki¹, Yusuke Iuchi¹, Takeshi Hori², Fumio Takeuchi³, Mika Takiya³, Takanori Emaru⁴

¹Graduate School of Engineering, Hokkaido University; ²Industrial Research Institute, Hokkaido Research Organization; ³Forestry Research Institute, Hokkaido Research Organization; ⁴Faculty of Engineering, Hokkaido University

This contribution presents a practical workflow for accurate and cost-effective forest terrain mapping in Japanese forests using a UAV equipped with low-cost LiDAR and GNSS. Instead of relying on a local reference station, we exploit the Centimeter-Level Augmentation Service (CLAS) of the Quasi-Zenith Satellite System "Michibiki" and integrate it with LiDAR-based SLAM to obtain dense terrain information with absolute coordinates.

In the proposed pipeline, LiDAR odometry estimated by FAST-LIO is aligned with the CLAS-based GNSS trajectory and fused in a pose graph on SE(3). The resulting optimization problem is solved in GTSAM using prior, odometry, and GNSS position constraints to compensate for the drift that accumulates when SLAM is used alone during large-scale flights.

Field experiments were conducted in real forest environments on multiple days and flight routes using a UAV-LiDAR system. Ground control points measured by post-processed kinematic GNSS were used as references to evaluate mapping accuracy. The results show that the integrated optimization reduces horizontal drift and improves terrain reconstruction to sub-metre accuracy, while keeping the system setup simple and low cost. The proposed approach is a promising option for operational forest surveys and other environmental applications that require frequent, wide-area terrain monitoring.

Comparative Assessment of Low-Cost SLAM-Based Scanners for Indoor Surveying Applications

Luca Viola, Marica Franzini, Vittorio Casella

University of Study of Pavia, DICAr, Laboratory of Geomatics, Italy

This abstract, authored by researchers from the University of Study of Pavia, DICAr, Laboratory of Geomatics, presents a comparative analysis of the geometric quality and cloud noise of four SLAM scanners. The study compares systems from different price points

Geo-Visual Fusion: An Enhanced Strategy for Drone Object Detection Based on High-Definition Map Context

Shenman Zhang, Qingshan Peng, Mengmeng Duan, Sheng He, Wei He, Yongsheng Yu, Xiaojun Fu, Wei Jiang, Jie Song, Ziliang Zhang, Lin Wen, Xiao Liu

Wuhan Geomatics Institute, China, People's Republic of

Current deep learning models for UAV object detection often suffer from "context-blindness," leading to high false positives (logical fallacies, like misidentifying building features as vehicles) and low-confidence false negatives for occluded objects. To address this, this paper proposes the innovative Geo-Visual Fusion (GVF) enhancement strategy, which leverages the rich, deterministic geo-spatial prior knowledge embedded within High-Definition (HD) city maps.

The GVF approach is implemented as a lightweight, plug-and-play framework featuring a Geo-spatial Contextual Reasoning (GCR) Module. First, a Real-time Geo-spatial Registration module accurately projects initial 2D detections onto the city's unified geographic coordinate system using UAV GPS/IMU data and camera parameters. The GCR Module then performs two key functions: Logical Error Elimination, which uses a Semantic Compatibility Matrix to suppress detections that violate real-world spatial constraints (e.g., vehicles detected on building facades); and Low-Confidence Boosting, which employs a Bayesian approach to significantly raise the confidence scores of reasonable detections located in compatible geo-spatial contexts (e.g., partially occluded vehicles on a road).

Validated on a high-resolution urban dataset, the proposed framework (Baseline + GCR) consistently demonstrates improved mean Average Precision (mAP), successfully eliminating geographically implausible false positives and enhancing the True Positive Rate for low-confidence targets. This method offers a practical solution to transition from purely data-driven feature matching to context-aware semantic understanding in urban aerial perception.

Evaluating Gaussian Splatting Maps for Absolute Visual Localization of UAVs

Felix Rocco Matzke, Hans-Gerd Maas, Anette Eltner

Institute of Photogrammetry and Remote Sensing, TUD Dresden University of Technology, Germany

Localization within a global reference frame is critical for the safe operation of UAVs. It is typically realized through GNSS measurements, however when signals are jammed, spoofed, occluded or reflected, this approach can lead to errors or fail. As most UAVs are equipped with cameras, absolute visual localization using georeferenced map representations offers a promising alternative. The recent invention of Gaussian Splatting introduces new opportunities for this task, leveraging real-time rendering from novel views to establish 2D-3D correspondences for pose estimation. In this work, we investigate the use of Gaussian Splatting maps for absolute visual localization of UAVs with a particular focus on geometric accuracy and its impact on the accuracy of position estimation. Through experiments with real-world data, we show that an initialization with dense Structure from Motion point clouds does not improve geometric accuracy compared to sparse initialization under the current training scheme. Additionally, constraining the position optimization of Gaussian Splats shows potential for improved pose estimation but introduces challenges during training. Despite these limitations, our results demonstrate the feasibility of Gaussian Splatting-based absolute visual localization for UAVs.

Multispectral Drone-in-a-box System – Geometric System Calibration and Validation

Niko Koivumäki, Juha Suomalainen, Teemu Hakala, Eija Honkavaara

Finnish Geospatial Research Institute, Finland

Uncrewed Aerial Systems (UAS, drones) are rapidly evolving technologies, with growing expectations for fully autonomous operations, enabling flights without onsite human control and Beyond Visual Line of Sight (BVLOS). A recent innovation is technology of ‘Drone-in-a-Box’ (DiaB) a.k.a. drone docks. DiaB systems provide an automated solution that integrates robust drones hosted in weather-resistant docks with typically also with cloud integration to data processing. Such connectivity enables utilization of real-time data products using both onboard and cloud processing workflows. This combination of robotics, AI, and data management holds the potential to deliver significant breakthroughs across diverse application scenarios. Objective of this study is to calibrate and assess the geometric performance of a novel multispectral (MS) DiaB system for environmental monitoring applications. The results indicated that the MS DiaB system delivers reliable performance without ground control points. For applications requiring cm-level accuracy, the post-processed georeferencing workflow was essential, whereas the direct georeferencing approach provided adequate accuracy for many operational scenarios. Our future work will extend this methodology to environmental applications.

Enhancing Vision-Based Perception in Autonomous Driving: YOLO11–DETR Integration with Selection Model

Ahmed M. Reda^1,2, Naser El sheimy¹, Adel Moussa^1,3

¹Dept. of Geomatics Engineering, University of Calgary, Canada; ²Dept. of Geomatics Engineering, Benha University, Benha, Egypt; ³Dept. of Electrical and Computer Engineering, Port-Said University, Port-Said, Egypt

This study investigates cross-domain generalization, adaptation behavior, robustness under visual degradation, and adaptive model selection for image-based object detection in autonomous driving scenarios. Two state-of-the-art detectors, YOLO11 and RT-DETR, are analyzed due to their complementary architectural paradigms, representing convolutional and transformer-based approaches, respectively.

The proposed framework consists of four stages: (1) zero-shot evaluation of COCO-pretrained models on the KITTI dataset to assess domain shift, (2) fine-tuning under short and extended training regimes to analyze adaptation dynamics, (3) robustness evaluation using synthetically degraded images simulating real-world perception challenges, and (4) the development of an image-based selection model for adaptive detector arbitration.

Experimental results show that YOLO11 demonstrates stronger zero-shot generalization and faster early adaptation, while RT-DETR achieves higher performance after extended fine-tuning, indicating superior long-term representation capacity. Under visual degradations, model performance varies depending on distortion type and training regime, confirming that no single detector consistently outperforms the other.

To address this, a lightweight selection model based on image quality features (brightness, blur, entropy, and edge density) is proposed to select the most suitable detector per image. The results demonstrate consistent performance improvements over individual models, achieving higher mAP without increasing computational cost.

This work highlights the effectiveness of adaptive, context-aware perception pipelines and demonstrates that exploiting model complementarity is a practical strategy for improving robustness in real-world autonomous driving systems.

From Image Space to Geospatial Space: A Camera Calibration Methodology for Video-Based Traffic Monitoring

Danesh Shokri^1,2, Christian Larouche^1,2, Saeid Homayouni³

¹Laval University, Canada; ²Centre de Recherche en Données et Intelligence Géospatiales (CRDIG), Université Laval, Québec, Canada; ³Centre Eau Terre Environnement, Institut National de la Recherche Scientifique, Québec, Canada

This paper presents a novel methodological framework for georeferenced traffic monitoring that bridges the gap between image-based vehicle detection and geospatial analysis. Traditional video-based traffic monitoring systems operate exclusively in image space, limiting their utility for applications requiring physical measurements and integration with geospatial datasets. We address this limitation by developing a comprehensive camera calibration approach that leverages readily available geospatial data including smartphone video, drone-derived orthophotos, and 3D point cloud data.

The methodology establishes precise mathematical relationships between image coordinates and real-world geographic coordinates through a hierarchical calibration algorithm for camera parameter estimation. Ground control points are strategically selected from orthophoto and point cloud data, emphasizing features that are precisely identifiable and geometrically advantageous for calibration. The framework enables transformation of image-space vehicle detections to geographic coordinates, facilitating physical measurements, spatial analysis, and direct comparison with simulated traffic data.

Experimental results demonstrate the effectiveness of our approach, achieving a mean reprojection error of 2.94 pixels across calibration points. A case study of multi-lane traffic monitoring showcases the practical utility, where vehicle detections are successfully transformed from image to geographic coordinates, enabling lane-specific traffic analysis and potential integration with traffic simulation models. The proposed methodology offers a robust workflow for urban planning by connecting conventional video surveillance with geographic information systems, using only commonly available data sources and equipment, making it accessible for widespread implementation in intelligent transportation systems.

Evaluation of ICP variants for point cloud/BIM alignment enabling Scan-vs-BIM comparison: Application to maritime construction tolerance verification

Florie Becker¹, Hélène Macher¹, Tania Landes¹, Yvon Kila²

¹Université de Strasbourg, CNRS, INSA Strasbourg, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, 67000, France; ²Ferrcad, 450 Rue Baden Powell, 34 000 Montpellier, France

Reliable geometric verification is essential in the construction industry, particularly for large-scale maritime infrastructures where deviations can critically affect functionality and safety. The emerging Scan-vs-BIM approach enables automated quality assessment by comparing as-built point clouds with as-designed BIM models. It allows evaluation of the entire structure, rather than just specific points, but relies heavily on accurate spatial registration. This paper presents an evaluation of several Iterative Closest Point (ICP) variants for fine registration within a Scan-vs-BIM framework dedicated to construction tolerance verification. Three ICP variants are compared in terms of convergence behavior, robustness to noise, and stability using synthetic point clouds derived from maritime structures. The methods are then tested on real datasets, each acquired under different conditions, leading to varying data quality. Based on the results, a hybrid method is proposed to improve registration reliability. The results show that the proposed approach improves the inlier rate by 8–9% while reducing the mean deviation by approximately 1 cm on the noisiest datasets, compared to the classical point-to-plane ICP.

Automatic Generation of LoD3 Building Models for High-Density Cities: A Case Study of Hong Kong using Multi-Source Data and an Adaptive Strategy

Guoxin Si¹, Wei Yao^2,3

The automatic generation of detailed Level of Detail 3 (LOD3) building models (Gröger et al., 2012), featuring wall surface features such as windows, doors, and balconies, remains a significant challenge within urban 3D modeling. This challenge is particularly pronounced in high-density urban environments like Hong Kong, where complex building geometries, severe data occlusion from dense high-rise structures, and diverse architectural styles collectively create exceptionally difficult conditions for automated processes. In response to these challenges, this study proposes and develops a novel, adaptive workflow designed to efficiently generate semantically rich and geometrically accurate LOD3 models. Our methodology leverages multi-source data, including a large-scale repository of existing LOD2 models, Airborne Laser Scanning (ALS) data, and Mobile Laser Scanning (MLS) data, to overcome the limitations of any single data source.

Towards Automated 3D BIM Reconstruction of Existing Industrial Buildings from Point Cloud Data

Patricia González Cabaleiro, Muataz S.A. Albadri, Antonio Fernández, Lucía Díaz-Vilariño

CINTECX, Universidade de Vigo, GeoTECH Group, 36310, Vigo, Spain

This paper presents a methodology for automated semantic segmentation and 3D reconstruction of industrial building elements from unstructured point clouds. It addresses components such as roof panels, floors, rafters, purlins, and columns by combining orientation-based filtering, projection onto characteristic planes, morphological analysis, and optimization-based I-profile fitting. The workflow includes preprocessing with axis alignment and outlier removal, surface-orientation-based subdivision, contour extraction from binary projections, and automatic estimation of roof slopes and panel inclinations to guide structural reconstruction. The approach provides a systematic framework for precise digital modeling of industrial buildings, enabling efficient structural analysis, documentation, and planning.

Foundation Model-Based Pipeline for 3D Damage Localization in Built Infrastructure

Zhiya Yang, Roberto de Lima-Hernandez, Maarten Vergauwen

KU Leuven, Belgium

Accurate damage localization is essential for infrastructure inspection, but conventional segmentation methods rely on dense pixel-level annotations that are costly to obtain and difficult to scale. This paper presents a foundation model-based pipeline for data-efficient damage localization in built infrastructure. The proposed workflow combines DINOv3 features for image-level classification, Grad-CAM for weak localization, and the Segment Anything Model (SAM) for prompt-guided pixel-level segmentation. The resulting masks are further transferred into 3D space for spatially contextualized visualization.

The pipeline is evaluated on two case studies. On a subset of Sewer-ML, three representative sewer defect classes are used to compare pretrained backbones and to qualitatively assess downstream localization. The DINOv3-based classifier achieves a higher average F2-score than a Google ViT baseline, reaching about 0.72 versus 0.64. On a custom historic masonry dataset, the method is quantitatively evaluated for material-loss segmentation using manually annotated test masks. The proposed heatmap-guided prompting strategy achieves a mean Dice score of 0.69 and a mean IoU of 0.53, while the classification stage reaches an F2-score of 0.99. A proof-of-concept experiment further demonstrates that segmented damage regions can be visualized within a larger local 3D scene. Overall, the results show that the proposed foundation-model based pipeline can support data-efficient and spatially meaningful damage localization across different infrastructure domains.

3D Point Cloud from Close-Range Photogrammetry for Defect Characterization of Rubberized Concrete

Jiacheng Liu¹, Mohammed Alnahhal¹, Ailar Hajimohammadi¹, Sara Gonizzi Barsanti², Jinling Wang¹, Mohsen Kalantari¹

¹UNSW Sydney, Australia; ²Università degli Studi della Campania Luigi Vanvitelli, Italy

3D point clouds have been widely used in civil engineering, providing comprehensive geometric data for structural health monitoring, scene understanding, surface defect assessment, and more. However, the mainstream point cloud data acquisition sources, i.e., TLS and MLS, are superior for large-scale scene understanding and analysis but challenging for fine-scale analysis, particularly in laboratory testing, due to their low resolution.

This study proposes a close-range photogrammetry-based workflow for the 3D reconstruction and visual inspection of rubberised concrete (RuC) beams in an indoor-lab environment. High-resolution image sets were captured with both a Canon 5D Mark IV DSLR camera and an iPhone 14 Pro Max, and 3D models were generated in Agisoft Metashape. The comparison between reconstructed models revealed that the DSLR-based reconstruction achieved sub-millimetre resolution and texture, demonstrating satisfactory performance for fine-scale surface monitoring. An RGB-guided crack extraction method was developed to enhance the identification of surface defects to isolate the potential crack area from the background. The extracted crack regions were visually distinguishable and provided a well-structured geometrical representation of defect morphology. Furthermore, a before-and-after deformation analysis was conducted, which provides a sub-millimetre level comparison in different stages. The results confirm that the proposed workflow based on close-range photogrammetry is a flexible, intuitive, and high-resolution alternative to LiDAR-based methods for surface inspection and deformation monitoring in laboratory environmental concrete specimens. This workflow provides another aspect of structural assessment and establishes a foundation for future high-accuracy 3D feature characterisation, which can be integrated with material design and mechanical performance evaluation.

Distributed Scan vs BIM Processing for Automated Geometric Quality Monitoring

Sungha Ju¹, Jiwan Hong², Sehoon Oh², Joon Heo²

¹Conworth, Inc.; ²Yonsei University, Korea, Republic of (South Korea)

This contribution presents a Scan vs BIM–based framework for geometric quality monitoring that integrates large-scale site-acquired point clouds with design BIM models in a distributed processing environment. The approach targets both vertical structural components and complex mechanical, electrical, and plumbing (MEP) systems on active building sites. Large point clouds from terrestrial laser scanners are indexed using an octree structure, while structural columns and MEP objects are extracted from IFC-based BIM and converted into mesh representations that serve as analysis units. For each component, nearby scan points are clipped, filtered, and locally registered to the corresponding BIM mesh to compute horizontal deviations, verticality, and installation discrepancies without assuming specific cross-sectional shapes or component types. The workflow is parallelized across multiple nodes and threads so that the same procedure can be consistently applied to thousands of objects in project-scale datasets. By automating component extraction, point-cloud preprocessing, and deviation calculation, the framework enables quantitative tolerance checks and systematic identification of elements requiring inspection or rework during construction.

6:30pm - 9:30pm

Congress Dinner
Location: Art Gallery of Ontario

Awards Ceremony:

ISPRS Fellow
The Schermerhorn Award
The Schwidefsky Medal
The Eduard Doležal Award
ISPRS Student Consortium Service Award

Date: Thursday, 09-July-2026

8:30am - 10:00am

WG III/1E: Remote Sensing Data Processing and Understanding
Location: 713A

8:30am - 8:45am

Directional Total Least Square for FullWaveform Aerial LiDAR Smoothing

Yining Feng, Debra Laefer, Ivan Selesnick

Tandon School of Engineering, New York University, United States of America

Smoothing aerial LiDAR point clouds is challenging, because they are often noisy, irregularly sampled, and sparse, as well as their inherent high degrees of freedom. Classic methods struggle on such datasets as they were designed for regularly sampled, dense datasets with moderate noise. To address the challenge, this paper proposes a constrained point cloud model with one degree of freedom. The point cloud model incorporates the sensing directions stored in the full waveform LiDAR datasets, and has theoretical advantages in terms of the statistical error bound for normal estimation. Based on the point cloud model, the directional total least square is formulated as a regularized convex optimization problem for points estimation on a tangent plane. Moreover, a non-convex regularizer along with the non-convex regularized directional total least square is proposed to improve the estimation

quality. To solve the proposed optimization problems, an accelerated Douglas-Rachford splitting algorithm is introduced. The proposed methods demonstrate better performances on simulated two-dimensional point clouds in terms of improved root-mean square-

error. For three-dimensional aerial LiDAR point clouds, implemented under the Savitzky-Golay filter framework with local

smoothness prior, the proposed methods demonstrate more smoothing power and robustness than the classic method.

8:45am - 9:00am

Improving Urban Point Cloud Classification Using Dynamic Local Context-Based Point Confidence

Jayati Vijaywargiya, Anandakumar M Ramiya

Indian Institute of Space Science and Technology

Urban mapping for planning and monitoring requires high-resolution spatial data, especially in areas with high landcover diversity. Airborne LiDAR Scanning (ALS) provides accurate 3D point cloud data, but its classification remains challenging due to computational complexity, irregular point distribution, noise, mislabeling and outliers in the dataset. These challenges are amplified in dense urban environments with mixed vegetation and infrastructure. Existing local context-based classification methods consider all points equally, overlooking the impact of their spatial position of the point in the dataset. To address this, we propose a dynamic local context-based point confidence-based optimization that improves classification accuracy by leveraging the spatial context of each point. This approach selects points based on confidence levels derived from position indices in training data and predicted by binary classifiers in test data to enhance robustness of classifier. We evaluated the proposed approach using boosting-based machine learning classifiers on two datasets: Thiruvananthapuram Aerial LiDAR Dataset (TALD) from India and the ISPRS 3D semantic labeling dataset from Vaihingen, Germany. The results showed 90.3% accuracy on TALD and 90.0% on Vaihingen, achieving a 2-4% improvement over conventional local context-based classification.

9:00am - 9:15am

Refinenet: a confidence-aware deep online learning framework to refine real-world point cloud semantic segmentation

Sharath Chandra Madanu¹, Shenglan Du¹, Jantien Stoter¹, Daan van der Heide^1,2

¹3D Geoinformation group, Delft University of Technology, Delft, NL; ²Rijkswaterstaat, Delft, NL

Accurate interpretation and segmentation of 3D point clouds in real-world urban environments is a critical challenge in geospatial analysis, particularly due to the complexity of real-world scenes, inevitable data uncertainties, and potential annotation errors. This paper proposes a confidence-aware deep learning framework to refine the segmentation accuracy of real-world point cloud data. By incorporating multi-source information, such as aerial imagery, and embedding geospatial prior knowledge, this framework models data uncertainty through point-wise confidence scores. Besides, we design an iterative online learning strategy, allowing the network to improve both its predictions and the quality of training labels. Extensive experiments on large-scale airborne laser-scanned data demonstrate that our framework effectively enhances training data by reducing label noise and improving annotation quality, which leads to more robust, generalizable model performance. Our source code is publicly available at https://github.com/AutumnMoon00/RefineNet.

9:15am - 9:30am

A Structured Query Language Approach for processing Smartphone-based LiDAR of Understory Vegetation

Rory Pittman, Baoxin Hu

York University, Canada

LiDAR sensors incorporated within modern smartphone and tablet devices enable relatively quick and inexpensive collection of ground-based LiDAR data applicable for ground truth mapping as needed for modelling understory vegetation. However, this LiDAR data often requires conversion and processing prior to research use. This study presents a workflow with algorithms utilizing structured query language (SQL) to efficiently process detailed rasterized features from LiDAR data collected by an iPhone Pro Max via the ForestScanner app. After transformation of the LiDAR data, SQL has been employed to voxelize the LiDAR data from which rasterized features have been derived. Various cell sizes for voxels and subsequent pixels have been investigated, leading to a recommended spatial resolution of 0.05 m for cell size dimension. SQL provides precise control for advanced querying to process ground-based LiDAR data for vegetational modelling applications.

9:30am - 9:45am

AI Indexing of Aerial LiDAR Point Cloud for Efficient Query

Mohana M, Anandakumar M Ramiya

Indian Institute of Space Science and Technology, Trivandrum, India

In the era of information revolution, with data being the fuel of AI and analytics, efficient information extraction from LiDAR point clouds becomes indispensable for solving real-world problems and aiding decision-making in geospatial domain. Despite having geometric richness, the massive LiDAR point clouds are not only computationally demanding but also lack inherent semantics. The lack of semantics in LiDAR constrains effective data analysis. This paper presents a novel workflow by incorporating Deep Learning derived embeddings as attributes in the geospatial database for the spatio-semantic querying on Aerial LiDAR point clouds. This work leverages AI-based indexing, such as IVFFlat(Inverted File Index with Flat Quantization) on LiDAR point clouds for fast retrieval of queries. The pgPointCloud and pgVector extensions of PostgreSQL aid in importing point clouds into the database and performing similarity-based query retrieval on the embedding space of the point clouds. The methodology developed in this paper explores how semantic embeddings can handle inadequate semantics of point clouds by enabling direct and complex 3D intelligent queries within the database environment, thereby overcoming the limitations of traditional LiDAR representations. Few queries presented in this paper highlight the applications of this proposed framework in individual tree detection, tree species identification, utility management, urban planning and anomaly detection.

9:45am - 10:00am

Intelligent Extraction Method for Geographic Information Feature Based on Human-Machine Collaboration

Hongwei Zhang¹, Jiage Chen², Chenchen Wu²

¹Chinese Academy of Surveying and Mapping, China, People's Republic of; ²National Geomatics Center of China, China, People's Republic of

The development of global geographic information resource products involves massive information processing of PB-level multimodal spatiotemporal data, and faces technical challenges brought by the global scale. In response to the challenges, we have made technological innovations to break through the key technologies for the development of global geographic information data products. With the main themes of "intelligent interpretation of typical elements, multi-source geographic data mining, and intelligent hybrid compilation", we have conducted and completed the overall technical research on the construction of global geographic information resources, formed an autonomous construction capability. Firstly, through crowd-sourced data mining and fusion technology to achieve content information extraction and knowledge fusion; Secondly, using multiple source data features, fast automatic extraction and integration of elements based on deep learning models was processed, and produce digital line graph data based on intelligent hybrid compilation. Based on the automatic feature extraction technology of deep learning, the production of digital line graph data products has been updated, and the accuracy evaluation has reached over 85%.

8:30am - 10:00am

ICWG II/Ia: Autonomous Sensing Systems and their Applications
Location: 713B

8:30am - 8:45am

GCP Deployment and Recognition System based on Light-Marker UAV

Ruikai Li, Yi Wan, Dong Wei, Yongjun Zhang, Ziqian Huang, Xiaotong Li, Changjun Zhu

Wuhan University,China

This paper addresses the heavy reliance on manual operations in control point acquisition for UAV photogrammetry and proposes an encoded control point deployment and recognition method based on a Light-Marker UAV (LMUAV). Conventional approaches rely on manual placement of control points and manual identification and measurement in images for aerial triangulation, resulting in low efficiency. To address this limitation, an LMUAV equipped with an LED array actively broadcasts its positional information as quaternary optical signals. The observing UAV performs coarse localization of the target region by integrating communication priors with the imaging model, followed by light spot segmentation and graph construction within the region of interest (ROI). Node correspondences are then recovered by constructing a template graph and an observation graph and applying Reweighted Random Walks (RRWM) graph matching. The matching robustness is further enhanced by incorporating directional point constraints and RANSAC-based geometric filtering. Based on the recovered correspondences, the encoded information is decoded through color recognition and validation, enabling automatic control point recovery. Experimental results in a cross-flight-line scenario with a single target UAV demonstrate that the proposed method achieves stable node matching and encoding–decoding, with a sequence-level accuracy of 76.32%, and a final effective decoding rate of 71.05%, while maintaining centimeter-level positioning accuracy, thereby validating its effectiveness for automatic control point acquisition in UAV mapping.

8:45am - 9:00am

6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data

Saptarshi Neil Sinha¹, Paul Julius Kühn¹, Mika Silvan Goschke¹, Michael Weinmann²

¹Fraunhofer IGD, Germany; ²Delft University of Technology, Netherlands

Automated and selective harvesting of fruits is increasingly vital due to high costs and seasonal labor shortages in advanced economies. This paper explores 6D pose estimation of strawberries using synthetic data generated through a procedural pipeline for photorealistic rendering. We utilize the YOLOX-6D-Pose algorithm, a single-shot method leveraging the YOLOX backbone, known for its balance of speed and accuracy and its suitability for edge inference. To counter the lack of training data, we develop a robust and flexible pipeline for generating synthetic strawberry data from various 3D models in Blender, focusing on enhancing realism compared to prior efforts, thus providing a valuable resource for training pose estimation algorithms.

Quantitative evaluations show that our models achieve comparable accuracy on both the NVIDIA RTX 3090 and Jetson Orin Nano across several ADD-S metrics, with the RTX 3090 offering superior processing speed. However, the Jetson Orin Nano is particularly effective for resource-constrained environments, making it ideal for deployment in agricultural robotics. Qualitative assessments further validate the model's performance, demonstrating accurate pose inference for ripe and partially ripe strawberries, although challenges remain in detecting unripe specimens. This highlights opportunities for future enhancements, particularly in improving detection for unripe strawberries by exploring color variations. Moreover, the presented methodology can be easily adapted for other fruits, such as apples, peaches, and plums, broadening its applicability in agricultural automation.

9:00am - 9:15am

A Comparison of Multi-View Stereo Methods for Photogrammetric 3D Reconstruction: From Traditional to Learning-Based Approaches

Yawen Li, George Vosselman, Francesco Nex

Faculty of Geo-Information Science and Earth Observation (ITC), University of Twente, Enschede, The Netherlands

Photogrammetric 3D reconstruction has long relied on traditional Structure-from-Motion (SfM) and Multi-View Stereo (MVS) methods, which provide high accuracy but face challenges in speed and scalability. Recently, learning-based MVS methods have emerged, aiming for faster and more efficient reconstruction. This work presents a comparative evaluation between a representative traditional MVS pipeline (COLMAP) and state-of-the-art learning-based approaches, including geometry-guided methods (MVSNet, PatchmatchNet, MVSAnywhere, MVSFormer++) and end-to-end frameworks (Stereo4D, FoundationStereo, DUSt3R, MASt3R, Fast3R, VGGT). Two experiments were conducted on different aerial scenarios. The first experiment used the MARS-LVIG dataset, where ground-truth 3D reconstruction was provided by LiDAR point clouds. The second experiment used a public scene from the Pix4D official website, with ground truth generated by Pix4Dmapper. We evaluated accuracy, coverage, and runtime across all methods. Experimental results show that although COLMAP can provide reliable and geometrically consistent reconstruction results, it requires more computation time. In cases where traditional methods fail in image registration, learning-based approaches exhibit stronger feature-matching capability and greater robustness. Geometry-guided methods usually require careful dataset preparation and often depend on camera pose or depth priors generated by COLMAP. End-to-end methods such as DUSt3R and VGGT achieve competitive accuracy and reasonable coverage while offering substantially faster reconstruction. However, they exhibit relatively large residuals in 3D reconstruction, particularly in challenging scenarios.

9:15am - 9:30am

Automatic detection models for building exterior wall cracks in drone imagery based on CNN and Transformer

Yaoling Shang¹, Ying Ge², Yuqing Ma³, Yingying Zhang³, Shilin Lv³

¹National Quality Inspection and Testing Center for Surveying and Mapping Products, China, People's Republic of; ²Hohai University, China, People's Republic of; ³State Grid Zhejiang Electric Power Co.,Ltd. Logistics Service Company, China, People's Republic of

This study presents a comprehensive evaluation of six deep learning models for building exterior crack detection using UAV imagery. Our framework systematically compares Standard U-Net, ResNet34-UNet, UNet-Attention, UNet-Residual, HybridUNet, and TransUNet through rigorous ablation experiments. The models were trained on dedicated drone-captured crack imagery and evaluated using multiple loss functions and performance metrics.

Results show that TransUNet achieves optimal performance (87.66% F1 Score, 90.43% Precision, 89.99% Recall) by leveraging Transformer-based global context modeling. Notably, the performance gap among all six models remains minimal (<0.5% F1 Score difference), suggesting limited returns from increased architectural complexity alone. F1 Loss demonstrates the most balanced performance across architectures, while Focal-Dice-Loss offers superior optimization stability.

The study provides practical guidance for model selection: TransUNet with F1 Loss suits high-accuracy requirements, while simpler attention-enhanced U-Net variants offer cost-effective solutions for large-scale applications. These findings advance intelligent crack detection methodologies and emphasize balancing accuracy with computational efficiency for real-world structural health monitoring.

9:30am - 9:45am

Towards real-time UAV path replanning based on photogrammetry and learning-based approaches

Débora Paula Simões^1,2, Henrique Candido de Oliveira¹

¹University of Campinas, Brazil; ²IFSULDEMINAS, Brazil

Unmanned Aerial Vehicles (UAVs) have contributed to a wide range of applications, becoming faster and more sustainable nowadays. However, given the significant increase in the number of UAVs, concerns regarding operational safety have grown.

Autonomous UAV path planning must ensure compliance with safety requirements. This study proposes a real-time path replanning method focused on ensuring compliance with regulations governing UAV operations. Considering no-fly zones (NFZs) defined by both static (buildings) and dynamic (people) obstacles, a low-cost and replicable solution was implemented in four main steps: 3D offline path planning using the A* algorithm and Digital Elevation Models; human detection in UAV imagery using the YOLO11m model; estimation of the person’s 3D coordinates using Monoplotting; and experiments of real-time path replanning. During flight

execution, imagery acquired by the UAV is transmitted to a server and, if a person is detected, path replanning is performed. The replanned route is then sent to the UAV controller to be executed via an SDK-based application. For flights at reduced speeds, the proposed method demonstrated feasibility in a computational environment (replanning time of 2.79 s). Simulated flight execution using the DJI Mobile SDK was successful. However, when relying on data transmission over Wi-Fi, the replanning duration on a local server (17.96 s) remained unsuitable for real-time operations. As future work, alternative solutions should be explored to ensure real-time processing. Despite the challenges, this study contributes by validating the open and free DJI MSDK application for path execution in a simulated environment, integrated with a listener application.

9:45am - 10:00am

PC2Model: ISPRS benchmark on 3D point cloud to model registration

Mehdi Maboudi¹, Said Harb¹, Jackson Ferrao¹, Kourosh Khoshelham², Yelda Turkan³, Karam Mawas¹

¹Technische Universität Braunschweig; Institute of Geodesy and Photogrammetry, Germany; ²Department of Infrastructure Engineering, University of Melbourne, Australia; ³Civil & Construction Engineering, Oregon State University, USA

Point cloud registration involves aligning one point cloud with another or with a three-dimensional (3D) model, enabling the integration of multimodal data into a unified representation. This is essential in applications such as construction monitoring, autonomous driving, robotics, and virtual or augmented reality (VR/AR).With the increasing accessibility of point cloud acquisition technologies, such as Light Detection and Ranging (LiDAR) and structured light scanning, along with recent advances in deep learning, the research focus has increasingly shifted towards downstream tasks, particularly point cloud-to-model (PC2Model) registration. While data-driven methods aim to automate this process, they struggle with sparsity, noise, clutter, and occlusions in real-world scans, which limit their performance. To address these challenges, this paper introduces the PC2Model benchmark, a publicly available dataset designed to support the training and evaluation of both classical and data-driven methods. Developed under the leadership of ICWG II/Ib, the PC2Model benchmark adopts a hybrid design that combines simulated point clouds with, in some cases, real-world scans and their corresponding 3D models. Simulated data provide precise ground truth and controlled conditions, while real-world data introduce sensor and environmental artefacts. This design supports robust training and evaluation across domains and enables the systematic analysis of model transferability from simulated to real-world scenarios. The dataset is publicly accessible at: https://zenodo.org/records/17581812.

8:30am - 10:00am

WG III/7A: Remote Sensing of the Hydrosphere and Cryosphere
Location: 714A

8:30am - 8:45am

Mass Balance Estimation of Gangotri Glacier, India, through Ice Thickness changes using Sentinel-1 SAR data

Shubham Bhattacharjee¹, Rahul Dev Garg¹, Arvind Chandra Pandey²

¹Indian Institute of Technology Roorkee, Roorkee, India; ²Central University of Jharkhand, Ranchi, India

The cryosphere responds to variations in the climate. Monitoring glaciers requires research into their dynamics. The surface velocity of the Gangotri glacier was obtained in this study using the Sentinel-1 dataset. Modifying the laminar flow model improved estimates of ice thickness. Moreover, the glacier mass balance has been calculated using changes in ice thickness between 2017 and 2022. An average velocity of 0.09 m/day was observed with stretches from 0.12 to 0.23 m/day in the central trunk. A mean thickness of 189 ± 17.01 m was determined for the glacial ice. The thickest areas, with the least drag, were measured to be 587 ± 52.83 m in the middle part. Negative mass rates of -1.3 to -0.5 m.w.e./year were observed for the glacier system (with thickness changes of -3 to -0.6 m/year) due to the glacier's decreased thickness throughout time.

8:45am - 9:00am

Three-Quarters of a Century of Glacier Mass Loss and Lake Emergence in the Beas Basin, Western Himalaya

Roja Asharaf, Anil V Kulkarni, Ravi S Nanjundiah, Ramananda Chakrabarti

Indian Institute of Science, India

The Himalayan region hosts the largest reservoir of snow and ice outside the polar regions. However, ongoing climate change has resulted in widespread glacier retreat, heightening the frequency and magnitude of extreme events, including flashfloods, landslides, and Glacier Lake Outburst Floods. The Beas Basin in the northwestern Himalaya exemplifies this vulnerability, where cryospheric transformations directly threaten downstream communities, hydropower systems, and infrastructure. Despite its critical importance, long-term basin-scale records remain limited. Therefore, this study investigates the long-term cryospheric evolution of the Beas Basin and identifies emerging glacial lakes using an integrated remote-sensing and modelling-approach. Glacier mass balance from 1951 to 2024 was estimated using an Improved Accumulation-Area-Ratio method, incorporating equilibrium-line-altitudes derived from ASTER-DEM and meteorological data, alongside glacier extents from Landsat and Sentinel imagery. Current glacier ice reserves were quantified using laminar-flow and volume–area scaling methods, with surface velocities derived from sub-pixel Landsat image correlation, and slope from DEMs. Future glacial lake formation was assessed using the HIGTHIM tool, which integrates ice thickness, bed topography, and moraines. Results indicate a mean area-weighted mass balance of –0.46±0.26m.w.e.a⁻¹, corresponding to 17.75Gt cumulative ice loss (~48% of glacier-stored mass) since 1951 and a current ice reserve of 19.60±3.5 Gt. Sixty-three potential glacial lake sites were identified, with four existing lakes projected to expand, totalling 122±22 million-m³of water. These findings reveal extensive cryospheric reorganisation, with significant implications for hydrology, water security, and hazard management. The study demonstrates the value of combining satellite observations with process-based modelling for monitoring Himalayan glacier dynamics in data-sparse regions.

9:00am - 9:15am

Basal Melting and Potential Warm Water Intrusion Beneath Antarctic Ice Shelves

Shuang Wu^1,2, Tian Chang^1,2, Lu An^1,2, Rongxing Li^1,2

¹Center for Spatial Information Science and Sustainable Development Applications, Tongji University, 1239 Siping Road, Shanghai 200092, China; ²College of Surveying and Geo-Informatics, Tongji University, 1239 Siping Road, Shanghai, China

The intrusion of relatively warm ocean waters beneath Antarctic ice shelves is a key driver of basal melting and strongly influences ice-shelf stability. However, previous studies investigating warm-water pathways have largely relied on single-source datasets, such as ship-based Conductivity–Temperature–Depth (CTD) measurements, which are spatially sparse and limited to a few well-surveyed regions. Recent advances in multi-source remote sensing datasets provide new opportunities to address these limitations. In this study, a multi-source remote sensing–based framework is developed to identify potential pathways of relatively warm water intrusion beneath Antarctic ice shelves and to quantify the associated basal melting. The Moscow University Ice Shelf (MUIS) is used as a case study. Across the continental shelf, CTD observations, sub-ice-shelf bathymetry, and modeled ocean circulation are integrated to infer potential intrusion routes. At the ice-shelf front and base, EN4 reanalysis data are used to characterize seawater properties, while satellite-derived basal melt products are applied to analyze spatial and vertical patterns of basal melting. Results indicate that relatively warm water is mainly concentrated at depths of 300–500 m, coinciding with bathymetric depressions that facilitate its intrusion beneath MUIS. Enhanced basal melting occurs near the ice front and grounding line, primarily within the upper 0–500 m of the ice-shelf draft, with an average melt rate of ~6 m yr⁻¹. The proposed framework provides a transferable approach for investigating ocean-driven melting beneath Antarctic ice shelves.

9:15am - 9:30am

Impact of Flux Gate Location on Antarctic Mass Balance via Input-Output Method

Yixiang Tian^1,2, Shuwei Jia^1,2

¹College of Surveying and Geo-Informatics, Tongji University, China, People's Republic of; ²Center for Spatial Information Science and Sustainable Development Applications, Tongji University,China, People's Republic of

The Antarctic Ice Sheet (AIS), the largest terrestrial ice mass on Earth, contains approximately 90% of the planet's total ice volume. This study quantifies ice discharge and associated uncertainties in AIS estimates through Input-Output method, evaluating the impact of flux gate locations on discharge magnitude and measurement uncertainty. Through analysis of key factors contributing to discharge uncertainty, we propose a gate positioning strategy that optimizes the balance between proximity to the grounding line and uncertainty minimization.

9:30am - 9:45am

Spatiotemporal Accuracy Assessment and Application of ICESat-2 Satellite Observations over the Antarctic Ice Sheet

Gang Qiao^1,2, Youquan He^1,2, Hongwei Li^1,2

¹Center for Spatial Information Science and Sustainable Development Applications, Tongji University, China; ²College of Surveying and Geo-Informatics, Tongji University, China

NASA’s ICESat-2, a single-photon lidar satellite launched in 2018, has for six years delivered pole-wide elevation data with <0.4 cm/yr precision. To verify and exploit these data over Antarctica, we built a “space-air-ground” calibration chain. (1) A cross-track array of corner-cube retro-reflectors (CCRs) was installed at Kunlun, Taishan and Zhongshan stations; one deployment captures both ascending and descending passes, doubling efficiency. GNSS-PPP/RTK solutions overcome the absence of fixed reference points and position CCRs to within 1 cm; comparison with ICESat-2 tracks shows sub-4 cm vertical accuracy, confirming stable on-orbit performance. (2) UAV photogrammetry during the 36th CHINARE expedition produced 5 cm-resolution DEMs of crevassed ice margins at Zhongshan/Prydz Bay. Fused with RTK ground control, these reveal ICESat-2 planimetric offsets of 2–5 m and serve as “truth” for a new Photon-Cloud algorithm that corrects slope-induced positioning errors and extends the mission’s utility in rugged terrain. (3) Whole-continent cross-over analysis of repeat tracks shows millimetre-level consistency between ascending and descending orbits; an improved cross-track model extracts robust elevation-change time series for stable ice interiors. The integrated framework provides ICESat-2 Antarctic accuracy metrics, refined processing tools and a transferable protocol for future polar photon-counting altimetry missions.

9:45am - 10:00am

Enhancing existing Remote-Sensing Datasets with weakly supervised Deep Learning: A Case Study on Antarctic Rock Outcrops

Felix Dahle, Roderik Lindenbergh, Bert Wouters

TU Delft, The Netherlands, Dept. of Geoscience & Remote Sensing

Accurate mapping of exposed rock is fundamental for cryospheric and geospatial analyses in Antarctica, yet existing products are of limited resolution and tend to underestimate true rock exposure. We present a weakly supervised deep-learning framework that refines existing rock masks by combining Sentinel-2 multispectral imagery with elevation and slope data from the Reference Elevation Model of Antarctica (REMA). A U-Net with eight input channels (six spectral bands, elevation, slope) is trained using imperfect Landsat- and GeoMap based labels. Trained on data from the Antarctic Peninsula, the model produces a 10~m rock mask that delineates small and shaded outcrops more effectively than existing datasets. While quantitative evaluation is constrained by imperfect reference data, qualitative inspection indicates improved rock–snow separation. The workflow is fully automated, requires no manual annotation, and scales efficiently to all rock-hosting regions of the continent reachable by Sentinel-2 multispectral coverage. Beyond rock mapping, the framework is transferable to other scenarios with incomplete or uncertain reference data, such as vegetation, snow, or water mapping. The resulting rock mask for complete Antarctica, together with the trained model and preprocessing scripts, will be released to support reproducible large-scale mapping and future cryospheric research.

8:30am - 10:00am

WG III/9: Geospatial Environment and Health Analytics
Location: 714B

8:30am - 8:45am

Urban Livability Analysis Based on Multi-Source Remote Sensing Data

Hailun Dai^1,2, Yu Liu¹, Tao Zhang¹, Guanghui Wang¹, Lijuan Zheng¹, Lei Miao¹, Ting Liu¹, Yunjia Zou¹, Wei Zhang^1,3, Jin Liu^1,2

¹Land Satellite Remote Sensing Application Center, Ministry of Natural Resources, China, People's Republic of; ²Beijing Satlmage Information Technology Co. Ltd, China; ³China University of Geosciences, Beijing, 100083, P. R. China

Under the background of city physical examination and assessment in territorial spatial planning, urban livability has become a focus of interest. Urban livability reflects residents' overall satisfaction with their living environment. Previous studies have been constrained by issues such as low data precision, coarse spatial scales, and limited practical applicability. To address these limitations, this study developed a refined livability evaluation framework by multi-source remote sensing data, with a primary emphasis on high-resolution domestic satellite imagery, including Gaofen (GF-1) and Ziyuan (ZY-3). Integrated with Suomi NPP night-time light data and socio-economic datasets, the research assessed four key dimensions, which were safety and resilience, residential comfort, recreation convenience, and quality and vitality in the city of Wuhan and Yibin at a detailed kilometer-grid scale. Results revealed distinct spatial patterns of urban livability of the two cities: Wuhan's central urban areas exhibited higher, more clustered livability, driven largely by quality and vitality, whereas Yibin showed a more fragmented pattern with strengths in recreation convenience but relative weaknesses in residential comfort and urban vitality. This study underscores the significant value of high-resolution, multi-source remote sensing data in enabling precise, spatially explicit livability analysis, thereby providing a scientific basis for targeted spatial planning and urban quality enhancement.

8:45am - 9:00am

Integrated Remote Sensing and GIS-Based Assessment of Urban Morphology, Waterlogging, and Dengue Hotspots in Chennai (2021–2023)

Swetha Sureshkumar, Sulochana Shekhar

Central University of Tamil Nadu, India

Dengue transmission in rapidly urbanising tropical cities is shaped by the combined influence of climate variability, urban morphology, and short-term surface water dynamics. This study develops a remote sensing and GIS-based framework to investigate the interaction between built-up density, waterlogging, and dengue incidence in Chennai from 2021 to 2023. Multi-source datasets, including Sentinel-2 imagery, NICFI high-resolution LULC, NDVI, and NDWI indices, Google Open Buildings footprints, IMD daily climate variables, and geocoded dengue case records, were integrated into a harmonised spatial grid for systematic analysis. Waterlogging-prone zones were delineated using a Sentinel-2 water-frequency method to capture the post-rainfall surface water accumulation rather than only persistent water bodies. Spatial clustering of dengue cases was examined using kernel density estimation, Global and Local Moran’s I, and Getis-Ord Gi*, revealing strong spatial autocorrelation and persistent hotspots in older, densely built neighbourhoods such as Kodambakkam, Adyar, Guindy, Saidapet, and Velachery, where compact built-up patterns and drainage limitations facilitate vector breeding. Peripheral areas showed weaker clustering and lower disease intensity. To assess the climatic influences, a Distributed Lag Non-linear Model (DLNM) was employed to quantify the delayed and non-linear effects of rainfall, maximum temperature, and minimum temperature on dengue incidence. Results showed notable lagged responses, with rainfall and minimum temperature exhibiting strong delayed associations aligned with mosquito development and viral incubation cycles. By integrating climatic, hydrological, and urban structural metrics, this study provides a replicable geospatial workflow for identifying micro-scale dengue-risk environments, supporting evidence-based vector-control strategies and climate-resilient urban planning in tropical cities.

9:00am - 9:15am

From Pixels to Pathogens: Multi-Scale Environmental Modeling of Tick-Borne Disease Risk

Kirsten Noltie, Dongmei Chen

Queen's University, Canada

Ticks are key vectors of human and animal disease, with Borrelia burgdorferi sensu stricto, the causative agent of Lyme disease, posing the greatest risk in North America. In Canada, Lyme disease cases are rising as the blacklegged tick (Ixodes scapularis) expands northward, driven by climate change, land cover shifts, and host movement. The Kingston, Frontenac, Lennox and Addington (KFL&A) region is a well-established hotspot, highlighting the importance of mechanistic models that realistically represent heterogeneous environmental drivers of transmission.

This study integrates multi-sensor Earth observation (MODIS, GEDI, Landsat) with climate, habitat, and ecological data to improve mechanistic tick phenology models. A hierarchical framework incorporates microclimate, landscape, and regional variables, enabling assessment of how sensor type, spatial resolution, and environmental gradients influence seasonal tick activity predictions. Model calibration and validation use field-collected tick and pathogen data, supplemented by citizen science observations.

By systematically linking EO to disease modeling, this approach improves the representation of environmental drivers, enhances predictive performance, and supports public health planning. The framework is transferable to other vector-borne diseases, advancing the integration of remote sensing into epidemiological forecasting at regional to national scales.

9:15am - 9:30am

Detection of Illegal Landfills on Satellite Imagery Using a Multi-agent Framework

Yehor Lytvynov¹, Viktoriia Hnatushenko^1,2, Volodymyr Hnatushenko³, Christian Heipke²

¹Ukrainian State University of Science and Technologies; ²Leibniz University Hannover, Germany; ³Dnipro University of Technology

Illegal waste disposal sites pose significant ecological and public-health risks yet remain difficult to track with traditional field inspections. We propose a multi-agent detection framework that fuses textural, spectral, and contextual cues from medium-resolution satellite imagery for this work. Three specialised agents - Waste-Pile, Road, and Industry detectors - are implemented as YOLO (You Only Look Once) convolutional models that generate partial hypotheses, which are then hierarchically aggregated through rule weights learned from expert-labelled samples. The system provides an interpretable set of object relations, allowing regulators to trace how individual cues contribute to the final decision. The method was validated on an independent test area near Taromske (Dnipropetrovsk region, Ukraine) and corroborated by ground surveys. Joint aggregation raised the posterior probability of the primary target cluster from 0.27 (single-detector confidence) to 0.91, while maintaining robustness to label noise and heterogeneous sensor characteristics. Compared with conventional CNN baselines, the proposed approach delivers three key advantages: explicit explainability of outputs, transferability to 10 m spatial resolution without extensive retraining, and seamless integration of heterogeneous evidence sources. The proposed framework can serve as a cost-effective backbone for regional and national waste-monitoring systems. Future work will focus on near-real-time processing of Sentinel-2 time series, incorporation of hyperspectral and thermal methane indicators to assess remediation stages, and extension of the array of features to other anthropogenic disturbances such as open-pit mining and construction debris.

9:30am - 9:45am

Building Deformation Monitoring and Safety Risk Assessment Based on PSI Technology

Naiyi Li^1,2, Feng Zhao^1,2, Wenqiang Yao^1,2

¹Shanghai Surveying And Mapping Institute, China; ²Shanghai Natural Resources Satellite Application Technology Center,China

Based on traditional PS-InSAR technology, this study proposes a building elevation estimation method based on long and short baseline iteration. It utilizes long-temporal SAR images for multiple iterations to calculate building heights, which are used as prior information. Combined with the Interferometric Point Target Analysis (IPTA) method, it inverts building deformation information. The K-means clustering method is employed for PS point clustering analysis, classifying PS points with similar deformation trends and mapping them to buildings. A building safety risk assessment system is established, which comprehensively evaluates the cumulative deformation amount and deformation rate of both the building structure and its foundation. In this paper, the feasibility of the above method is verified by an example. The deformation of 9442 buildings is extracted in the study area, of which 245 buildings are in a high security risk state, and 2 buildings are in a high security risk state. Through this study, it can provide comprehensive auxiliary decision-making reference data covering macro wide-area and micro single buildings for urban construction management departments.

8:30am - 10:00am

WG III/8J: Remote Sensing for Agricultural and Natural Ecosystems
Location: 715A

8:30am - 8:45am

Estimating grassland dry mass in forage mixes using UAV imagery and PCR

Matheus Modesto Silva¹, Rebeca Campos Emiliano da Silva¹, Antonio Maria Garcia Tommaselli², Nilton Nobuhiro Imai²

¹Graduate Program of Cartographic Sciences, Faculty of Sciences and Technology, São Paulo State University (UNESP) at Presidente Prudente; ²Department of Cartography, São Paulo State University (UNESP) at Presidente Prudente

Beef cattle farming is a significant activity in Brazil, and forage quality has a direct impact on animal performance. However, traditional methods for estimating dry mass, which involve cutting, drying and weighing plant material, are slow and labor-intensive. UAVs equipped with multispectral sensors, such as the DJI Mavic 3M, offer a faster and more scalable alternative for monitoring mixed-forage pastures. This study estimates the dry mass of forage mixtures using multispectral UAV data in two scenarios: (i) using only spectral information and (ii) combining spectral data with canopy height measured in the field. Model performance was evaluated using R², RMSE, and percentage error. The multispectral-only model explained 55% of dry mass variability (720.56 kg/ha; 23.67%), while adding canopy height improved performance to 80% and reduced the error to 589.41 kg/ha (19.36%). Results show that canopy height enhances the accuracy and operational potential of UAV-based methods for estimating dry mass in mixed-forage areas.

8:45am - 9:00am

Predicting Plant Diversity in Revegetated Grasslands with Sentinel-2: Comparing Performance of Spatio-Temporal Features with Input Time Series

Pinja Lindgrén¹, Matthieu Molinier¹, Alexander Kießling², Astrid Tischler²

¹VTT Technical Research Centre of Finland Ltd, Finland; ²Bonatica

Mining companies are continuously looking for cost efficient methods to monitor the success of their rehabilitation efforts. Although open access satellite imagery is available at regular temporal intervals, its usefulness for grassland biodiversity monitoring has been questioned due to its coarse spatial resolution with respect to the species size. To compensate for the low spatial resolution, previous studies have successfully explored the benefits of using a multitemporal set of Sentinel-2 (S2) images. However, unless the temporal patterns are studied as a whole, some of the phenological information such as growth rates are lost, and delayed snow cover may spread events like growth onset over multiple dates between plots. This study aims to explore the added value of temporal fitting of Sentinel-2 time series (ts) over existing baseline models applied using the full time series as such. Our set of temporal features included functional components, harmonic decomposition, frequency decomposition, and phenological metrics. Out of the compared models, the Random Forest regression model using a set of fitted temporal features achieved the highest holdout prediction accuracy (R2 = 0.36, RMSE = 3.87, relative RMSE = 0.20) and cross-validation accuracy similar to the baseline models. However, all the compared regression models underestimated extreme plant diversity to some extent. Future studies should account for varying vegetation cover and terrain features by incorporating auxiliary data.

9:00am - 9:15am

Mapping Shrub and Tree Encroachment in Canadian Prairies using Stacking Ensemble and Sentinel-1/2 Imagery

Yihan Pu, Irini Soubry, Xulin Guo

Department of Geography and Planning, University of Saskatchewan, Canada

Woody plant encroachment (WPE) threatens grassland ecosystems across the Canadian Prairies, causing grassland biodiversity loss with substantial economic impacts due to reduced forage production. While remote sensing offers scalable monitoring capabilities, existing approaches lack frameworks for distinguishing shrub and tree encroachment and often require extensive ground truth data. This study developed an ensemble machine learning framework integrating Sentinel-1 SAR and Sentinel-2 optical imagery with UAV-derived training data to map fractional shrub and tree cover across Saskatchewan's Aspen Parkland and Moist Mixed Grassland ecoregions, SK. A stacking ensemble combining Random Forest, Support Vector Machine, XGBoost, and Artificial Neural Network models with Ridge regression meta-learning outperformed individual algorithms, achieving mean R² values of 0.65 for shrub and 0.68 for tree cover prediction. Multi-scale training incorporating features at 10, 30, 50, and 70 m resolution improved performance by 15% for shrub and 24% for tree compared to single-scale approaches. Feature importance analysis revealed that shrub detection relied primarily on red-edge bands and moisture indices, while tree detection depended heavily on SAR backscatters. Quantile histogram matching enabled successful model transfer from Foam Lake Community pasture to Aberdeen Community Pasture, with resulting maps indicating that total WPE exceeded 50% in both study areas, with shrubs occupying 23.7% (Foam Lake) and 18.5% (Aberdeen) of both regions at rates higher than 5% shrub cover. The present framework provides a scalable, cost-effective approach for operational woody encroachment monitoring, enabling early detection and targeted functional management interventions to preserve grassland ecosystems.

9:15am - 9:30am

Integrating Earth observations and machine learning for large-scale fractional vegetation cover mapping of wood bison habitat

Michael Merchant, Jennifer Hird, Jenet Dooley, Victor Tran, Cynthia McClain, Mir Mustafizur Rahman

Alberta Biodiversity Monitoring Institute

Fractional vegetation cover (FVC) is a key land surface parameter describing vegetation abundance and structure, defined as the fraction of the ground area occupied by vegetation when viewed from nadir. FVC provides essential insights into ecosystem condition, productivity, and disturbance, making it a critical variable for biodiversity monitoring and habitat assessment. However, generating accurate and repeatable FVC estimates remains challenging due to scale effects, spatial resolution constraints, and inconsistencies in available validation data across time and space. This research develops a machine learning (ML) framework for large-scale FVC estimation that addresses these challenges by combining multi-sensor Earth observation data and Active Learning (AL) model refinement techniques. The ML framework is applied within key wood bison habitat in northern Alberta, focusing on mapping six vegetation components: spruce, pine, deciduous, shrub, herbaceous, and moss. The approach integrates Sentinel-1, Sentinel-2, Landsat-9, and GLO-30 data, optimized through feature selection and ensemble-based Random Forest modeling. The resulting FVC maps achieved strong predictive performance (R² = 0.50–0.88) and capture fine-scale spatial variability in vegetation composition. The ML pipeline provides a scalable and adaptive framework for FVC estimation that supports provincial landcover updates, improves understanding of wood bison habitat features, and contributes to ongoing ecosystem monitoring and conservation planning across boreal Alberta.

9:30am - 9:45am

DINOKey: Transformer-Based Keypoint Detection for Wildlife Monitoring in Aerial Imagery

Jayden Hsiao¹, Muhammed Patel¹, David A. Clausi¹, Linlin Xu²

¹University of Waterloo, Canada; ²University of Calgary, Canada

Wildlife monitoring from aerial imagery often requires precise animal localization under practical constraints where only object counts are needed. Traditional detection methods rely on bounding-box annotations, introducing unnecessary cognitive load for small objects spanning only a few dozen pixels. This work introduces DINOKey, a modified DINO transformer-based detector adapted to operate natively on point annotations rather than bounding boxes. Key contributions include: (1) architectural modifications to the DINO decoder, detection head, and denoising queries to directly predict 2D keypoints; (2) a combined loss function integrating L1 regression, focal loss, and average Hausdorff distance, with ablations validating each component; (3) open-source implementation within an existing detection framework; and (4) demonstration of improved small-object localization and reduced false positives on an aerial elephant dataset compared to box-supervised baselines. Ablation studies show that the Hausdorff distance term provides the largest accuracy gain by effectively reducing false positives, while focal loss improves stability in densely clustered regions. The proposed method achieves 0.786 mAP and accurately localizes animal centers across diverse environmental conditions, offering a practical solution for conservation practitioners working under tight logistical constraints.

9:45am - 10:00am

Testing a novel UAV SWIR imaging system for estimating absolute water content in Tillandsia landbeckii

Fabian Reddig¹, Christoph Hütt¹, Leon Vehlken¹, Nora Tilly¹, Sebastián Yassir Espinoza Guzmán³, Jan Wolf¹, Annika Klee¹, Marcus Koch^4,5,6, Georg Bareth¹, Alexander Jenal²

¹GIS & RS Group, Institute of Geography, University of Cologne, Germany; ²Application Center for Machine Learning and Sensor Technology (AMLS), University of Applied Sciences Koblenz, Germany; ³Departamento de Ciencias Geológicas, Universidad Católica del Norte, Chile; ⁴Center for Organismal Studies, Biodiversity and Plant Systematics, Heidelberg University, Germany; ⁵Cluster of Excellence GreenRobust, Heidelberg University, 69120 Heidelberg, Germany; ⁶Heidelberg Center for the Environment, Heidelberg University, 69120 Heidelberg, Germany

Fog-dependent ecosystems in the Atacama Desert host highly specialized vegetation, yet monitoring their functional traits remains challenging due to remoteness and limited spectral detectability. The bromeliad Tillandsia landbeckii exhibits extremely low reflectance in the VIS/NIR range, rendering conventional multispectral approaches ineffective.

This study evaluates the potential of a novel UAV-based VNIR/SWIR multi-camera system (camSWIR) for estimating canopy water content (CWC) in Tillandsia landbeckii. A UAV survey conducted in northern Chile acquired high-resolution (≈3 cm GSD) SWIR imagery across four operational bands (1100–1650 nm). Field-based destructive sampling (n = 20) provided reference CWC measurements, and a statistically rigorous workflow was applied to mitigate overfitting in a high-dimensional predictor space.

Results show that the spectral slope between 1200 and 1510 nm is the most informative predictor of CWC, with cross-validated performance indicating moderate predictive skill (LOOCV R² ≈ 0.52), but reduced stability under nested validation. The repeated selection of predictors within this wavelength region confirms a physically meaningful relationship with liquid water absorption.

Despite limitations due to a small sample size and species-specific optical properties, particularly the dense trichome layer that affects light interactions, the study demonstrates the feasibility of SWIR-based, non-destructive CWC estimation in hyper-arid ecosystems. These findings provide a proof of concept for future upscaling, highlighting the need for larger calibration datasets and improved modelling to enable reliable spatial mapping of plant water status.

10:00am - 10:15am

Adapting Deep Anomaly Detection for Automated Aerial Caribou Monitoring in Alaska

Ghazaleh Serati^1,2, Jérôme Théau^1,2, Samuel Foucher¹

¹Université de Sherbrooke, Canada; ²Quebec Centre for Biodiversity Science (QCBS)

Aerial imagery provides a powerful avenue for monitoring wildlife populations, yet automated detection remains challenging. Animals typically occupy only a tiny fraction of large-scale aerial imagery, may be partially obscured, and appear against highly diverse Arctic and sub-Arctic backgrounds. Suppervised deep-learning detectors also depend on large, fully annotated datasets, making broad ecological surveys labor-intensive and slow to scale.

This study explores an alternative perspective: viewing wildlife as rare events within mostly background imagery. Instead of training on annotated animal samples, an anomaly-detection framework learns the visual patterns of normal landscapes and identifies deviations from these patterns as potential animal locations. To guide the model without costly labels, simple animal-like shapes are inserted into background patches during training, encouraging the network to recognise features associated with real targets while avoiding the need for detailed masks or bounding boxes.

The approach generates two outputs: patch-level predictions distinguishing empty from potentially occupied areas, and pixel-level anomaly maps highlighting likely target locations. When evaluated on a highly varied Arctic dataset, the method remains reliable despite major shifts in terrain, surface texture, animal distributions and postures, and pronounced class imbalance that often degrade supervised models. Unlike distribution-based anomaly approaches that rely on stable normal-feature statistics and frequently misinterpret natural texture variability as anomalies, this method handles heterogeneous environments more effectively.

Overall, the study shows that anomaly-oriented frameworks, typically used in industrial and medical settings, have strong potential to ease annotation demands and support scalable, automated wildlife detection in complex remote-sensing environments.

8:30am - 10:00am

WG II/3F: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

8:30am - 8:45am

Beyond Photorealism: Gaussian Splatting for the Precise Reconstruction of Complex Geometries In Underwater Photogrammetry

Andrea Cimatoribus¹, Davide Antonio Cucci¹, Fabio Menna², Erica Nocerino³, Christoph Strecha¹

¹PIX4D SA, Route de Renens 24 1008 Prilly, Switzerland; ²Department of Chemical, Physical, Mathematical and Natural Sciences, University of Sassari, Sassari, Italy; ³Department of Humanities and Social Sciences, University of Sassari, Sassari, Italy

This study examines PIX4D’s implementation of Gaussian Splatting for reconstructing complex geometries, with a focus on underwater photogrammetry for coral reef mapping. Unlike standard Gaussian Splatting pipelines that emphasize photorealistic rendering, our approach prioritizes high-precision geometric reconstruction, especially for thin structures and heavily occluded regions. We compare the method against conventional multi-view stereo techniques using both real underwater imagery collected in Moorea (French Polynesia) and synthetic datasets generated with the POSER underwater simulation framework.

8:45am - 9:00am

Merchantable Tree Stem Volume Estimation using Mobile Backpack LiDAR

Sangyoon Park¹, Hazem Hanafy¹, Chunxi Zhao¹, Jinyuan Shao², Songlin Fei², Ayman Habib¹

¹Lyles School of Civil and Construction Engineering, Purdue university, United States of America; ²Department of Forestry and Natural Resources, Purdue university, United States of America

Stand-level merchantable tree stem volume estimation in temperate forests is critical for data-driven forest management decision-making. Mobile laser scanning (MLS) has greatly improved data-collection efficiency for forest biometrics; however, automated analysis of massive, structurally complex MLS point clouds remains limited. This study presents an automated framework to estimate stand-level merchantable stem volume from backpack mobile Light Detection and Ranging (LiDAR) data. The framework comprises three stages: (1) point cloud reconstruction using the Integrated-Scan Simultaneous Trajectory Enhancement and Mapping (IS²-TEAM) method; (2) individual tree segmentation via a multistage geometric pipeline; and (3) merchantable stem volume estimation based on skeletonization-derived stem modeling. The proposed approach is evaluated on a forest-scale dataset collected in temperate natural forests in the United States. Results demonstrate operational feasibility at scale, with practical processing times and robust geometric consistency. Validation against destructively measured reference volumes shows that the proposed approach outperforms baseline quantitative structure modeling (QSM) methods, achieving a coefficient of determination (R²) of 0.97, a bias of −0.06 m³, and a root mean square error (RMSE) of 0.21 m³. The proposed framework enables reliable, automated estimation of merchantable stem volume from MLS data and supports deployment from individual-tree to forest scales with minimal manual intervention.

9:00am - 9:15am

TRACE: Instance-Level Open-Vocabulary Inventory Generation for 3D Forensic Evidence Reconstruction

Florian Johannes Eichinger^1,2, Michael Greza^1,2, Benjamin Busam^1,2

¹Technical University of Munich, Germany; ²Munich Center for Machine Learning, Germany

TRACE is a training-free framework for instance-level open-vocabulary inventory generation in 3D forensic evidence reconstruction.

Starting from multiview RGB imagery, prompt-based 2D object masks are extracted using SAM3 and associated across views via geometry-aware and appearance-aware multiview instancing. Based on COLMAP geometry and DINOv2/v3 descriptors, the proposed framework establishes globally consistent same-class object identities across the scene. The resulting global instances are then encoded with SigLIP2 to obtain language-aligned instance descriptors and subsequently lifted into a 3D Gaussian Splat representation by assigning instance-level semantics to geometrically supported Gaussian subsets. This yields an enriched 3D scene representation that jointly preserves spatial structure, object-level identity, and language-accessible semantics, thereby enabling instance-aware open-vocabulary querying in 3D.

9:15am - 9:30am

Surface Water 3-D Mapping With Point Cloud Data of Single Return Airborne LiDAR

Cihan Altuntas

Konya Technical University, Turkiye

The purpose of this study is to automatically classify water and land areas with LiDAR point clouds. After determining the average water level, the water and land surfaces were classified. Previous studies have focused on supervised classification based on land sampling or deep learning techniques using photographs. However, these classification techniques are expensive and require long calculation times. In this study, a method is proposed for the automatic classification of water and land areas without land surveys using the coordinate and reflection values of LiDAR point clouds. The bounding box method was used to detect water surface levels. The correlations between the min-box level, mean box height, and mean box reflection values of the LiDAR point data were used to determine the water surface level. The results show that the method is suitable for the fast classification of water surfaces from LiDAR point clouds. Thus, shoreline changes in large areas can be detected automatically without the need for land surveying. The proposed bounding box classification method can be applied independently of LiDAR point cloud density. The extended version of this method can also be used to detect vehicles and objects on a water surface.

9:30am - 9:45am

Enhancing underground environment rendering with lightweight 3D gaussian splatting

Roberto de Lima-Hernandez, Zhiya Yang, Maarten Vergauwen

KU Leuven, Belgium

Underground environments such as sewer networks are critical infrastructure whose condition directly affects public health, environmental protection, and maintenance costs. Conventional inspection workflows largely rely on monocular CCTV systems and manual video review, providing limited 3D understanding and often missing subtle or spatially complex defects. At the same time, sewer environments are characterised by challenging imaging conditions, including low illumination, specular surfaces, water films and occlusions, which further complicate reliable assessment.

In this extended abstract, we present a real-time inspection concept that combines (i) stereo camera-based SLAM for geometric mapping and pose estimation, (ii) Vision Transformer (ViT) based anomaly detection trained on the public SewerML dataset, and (iii) lightweight Gaussian Splatting modules that create local high-resolution 3D reconstructions only in the vicinity of detected defects. The system is targeted at embedded hardware, specifically an NVIDIA Jetson Nano, and is designed for deployment and evaluation in real sewer environments. The overall goal is to provide inspectors and asset managers with spatially anchored 3D visualisations of anomalies that can be integrated into digital-twin workflows for decision support and long-term monitoring.

9:45am - 10:00am

Robust Cross-Modal Matching between LiDAR Point Clouds and Multi-Camera Images in Tunnel Environments via Surface Parameterization

Ying Jiang¹, Feng Liu², Han Hu^1,3, Yulin Ding^1,3, Chong Wang^3,4, Ping Wen^3,4, Qing Zhu^1,3

¹Faculty of Geosciences and Engineering, Southwest Jiaotong University; ²CRSC Communication & Information Group Co., Ltd.; ³Yunnan Engineering Research Center of 3D Real Scene; ⁴Kunming Engineering Corporation Limited

This paper proposes a robust cross-modal matching framework for tunnel inspection, specifically designed to address the unique challenges posed by low-texture environments often encountered in tunnel linings. Traditional image-based matching techniques struggle in these environments due to the lack of distinctive surface features and limited texture variation. To overcome these challenges, the proposed method leverages the global prior knowledge of tunnel geometry. By jointly projecting LiDAR point clouds and multi-camera images onto a shared parameterized cylindrical surface, the method constructs a unified geometric space that facilitates accurate 3D–2D correspondences. This dual-projection strategy significantly improves the alignment of structural features such as segment joints, line grooves, and equipment brackets, which are critical for defect detection in tunnel inspection. The enhanced matching ability allows for more reliable multi-sensor data fusion, thereby supporting the automated analysis of tunnel defects. This framework lays a solid foundation for intelligent tunnel inspection systems, offering a powerful solution for real-time monitoring and analysis of tunnel infrastructure.

8:30am - 10:00am

IvS5: Next-Generation Flood Mapping: Integrating AI, Remote Sensing, and Evolving Landscapes
Location: 716A

8:30am - 8:45am

Spatiotemporal Flood Susceptibility Mapping using a Hybrid CNN-ConvLSTM Architecture

Karen Elaine Dunbar^1,2, Heather McGrath², Usman Khan¹

¹York University, Canada; ²Natural Resources Canada

Flood susceptibility mapping (FSM) is a crucial component of flood risk assessment; however, traditional statistical and machine learning methods for FSM are limited in their predictive capabilities. FSM approaches typically use static inputs, relying solely on geospatial factors, and fail to consider the spatiotemporal aspects (antecedent conditions) that trigger flood events. This study addresses this gap by developing a hybrid model that combines static geospatial features with dynamic temporal meteorological data, which is often excluded in FSM. The proposed hybrid model consists of two branches: (1) a 2D Convolutional Neural Network (CNN) to extract the features from geospatial inputs (i.e., slope and surficial geology) and (2) a Convolutional Long Short-Term Memory (ConvLSTM2D) network to learn the temporal antecedent conditions from Daymet precipitation, temperature and snow-water equivalent. This model was trained and tested in the Saint John River basin, New Brunswick, Canada — a region that has experienced significant historical flooding. Three hyperparameters were investigated: temporal sequence length (1–4-month timesteps), resampling ratio (0.1-0.7), and positive class weight (1.5 or 2.0). The optimal model was achieved with a 3-month timestep, a 0.2 resampling ratio, and a 1.5 positive class weight, resulting in an F1 score of 0.89. The model performance was highest when using a 3-month timestep, which captured the full snowmelt-to-rain spring cycle, outperforming models that used timesteps of 1, 2, or 4 months. The proposed 2D CNN-ConvLSTM2D architecture is effective in simultaneously learning the static geospatial features and temporal meteorological sequences, highlighting the importance of seasonal antecedent conditions in FSM.

8:45am - 9:00am

Risk-guided Flood Segmentation from Optical Satellite Imagery using NDWI Threshold Optimization and Segment Anything Model.

Andrea Reid¹, Shabnam Jabari¹, Heather McGrath²

¹University of New Brunswick, Canada; ²Natural Resources Canada, Government of Canada, Ottawa, ON

Optical satellite sensors are widely used for rapid flood mapping due to their global coverage and free availability. Thresholding spectral indices, such as the Normalized Difference Water Index (NDWI), can detect water pixels rapidly and with good precision. However, small shifts in threshold values can lead to large differences in flood area and data-driven approaches for threshold selection remain a challenge. At the same time, new foundation segmentation models, such as the Segment Anything Model (SAM), can extract object boundaries from images without task-specific training, though it lacks flood-specific contextual awareness. To address these limitations, we propose a risk-guided segmentation framework that combines risk-weighted optimization of NDWI thresholding, and further refinement of the NDWI mask using SAM. The goal is to improve flood delineation by incorporating information on where a flood is more likely to occur (flood hazard maps) and how flood boundaries appear visually (SAM). We evaluate the method on the 2018 spring flood along the Saint John (Wolastoq) River in New Brunswick, Canada, across five study regions for both Sentinel-2 and Landsat-8 scenes using imagery captured on May 2, 2018 (peak flood for the study regions). We show that a higher risk score corresponds to a higher segmentation accuracy, demonstrating that flood hazard maps can help guide NDWI threshold selection. Moreover, refinement with SAM improves segmentation quality compared to the baseline NDWI masks, demonstrating that the use of risk-guided spectral thresholding with foundation models can improve flood delineation in optical satellite imagery.

9:00am - 9:15am

Integration of Remote Sensing Indices and Ensemble Machine Learning with Independent HEC-RAS 2D Simulations for Improved Flood Hazard Assessment in the Ottawa River Watershed.

Temitope Oluwadare¹, Heather McGrath², Dongmei Chen¹

¹Queen's University, Canada; ²National Resource Canada

Floods remain among the most damaging natural hazards worldwide. In Canada’s capital region—Ottawa and its surrounding areas—flood prediction is crucial, most especially in flood-prone zones, to mitigate recurring events such as the 2017 and 2019 Ottawa floods, which caused extensive damage to homes and infrastructure. This study integrates 18 flood conditioning factors with remote sensing indices and ensemble machine learning to improve flood susceptibility mapping in the Ottawa River watershed. A complementary HEC-RAS 2D hydraulic model simulated flow depth and velocity under a 100-year flood scenario. The ensemble model achieved strong predictive performance (Kappa, F1-score, and AUC > 0.979) and demonstrated high transferability across sub-regions (Kappa > 0.85; F1-score > 0.92; AUC > 0.99). HEC-RAS results indicated spatial variability in flood depth (up to 15 m) and velocity (up to 15 m/s). SHAP analysis identified Elevation, HAND, MNDWI, NDWI, and Aspect as the dominant flood-driving factors. The integrated framework enhances flood susceptibility assessment and supports Natural Resources Canada’s efforts to strengthen flood risk management and resilience in the Ottawa River watershed and similar regions.

9:15am - 9:30am

Multi-Event Machine Learning for Annual Flood Susceptibility Prediction at a National Scale

Heather McGrath

Natural Resources Canada, Canada

Machine learning for flood susceptibility mapping (FSM) has traditionally relied on narrowly scoped events and temporally constrained datasets, limiting the generalizability and long-term utility of predictive models. We present a multi-event, multi-temporal modelling framework that leverages discrete flood occurrences from 2005 to 2023 to train a unified model capable of inference across an extended temporal horizon. Each flood event was treated as a spatio-temporal marker, enabling the model to learn evolving driver–event relationships and underlying temporal trends. Dynamic inputs (e.g., climate data, land use/land cover) are integrated with static geophysical features (e.g., digital terrain model and derivatives) to capture both transient and persistent influences on flood susceptibility. An XGBoost model was trained, tested, and validated using a 70/15/15 split, achieving an overall accuracy of 0.945, with true positive and true negative rates of 0.95 and 0.94, respectively. Precision scores for wet (flood-prone) and dry (non-flood-prone) classes are 0.94 and 0.95. Generated yearly national FSM maps from 2000 to 2023 were evaluated against published flood event datasets. Validation using national flood records, climate variability bulletins, and spatio-temporal analyses of year-to-year raster correlations confirms that years with elevated predicted susceptibility correspond to observed flood events. In addition, a weighted wetness score identified the years with both widespread and extreme flood-prone conditions, highlighting the model’s ability to capture multi-scale temporal dynamics. These results demonstrate that multi-event, multi-temporal modelling enhances the temporal reach and robustness of geospatial flood prediction, providing a foundation for long-term monitoring, trend analysis, and policy-relevant scenario planning.

9:30am - 9:45am

Geomorphometric analysis of urban fluvial terraces using UAV LiDAR: a case study from the La Silla River, Mexico

Mariana Pérez Martínez, Fabiola Doracely Yépez Rincón, Luis Eduardo Arista Cázares, Yadira Zulema Antonio Durán, Kevin David Rodriguez González, Hector de León Gómez, Adrian Leonardo Ferriño Fierro, Victor Hugo Guerra Cobian

Autonomus university of Nuevo León, Mexico

This study presents a high-resolution geomorphological analysis of river terraces along the urban corridor of the La Silla River (Monterrey Metropolitan Area, Mexico) using UAV-based LiDAR and photogrammetry, with a DJI Matrice 350 RTK equipped with a Zenmuse L2 sensor, generating dense point clouds, DEMs, and orthomosaics. These products allowed for the precise identification of three terrace levels (T1-T3), their geomorphometric attributes, and their lithological composition. The results reveal contrasting degrees of anthropogenic modification: while terrace 1 retains its natural morphology, terraces 2 and 3 show substantial alterations due to residential expansion, public infrastructure, and road construction, which alter the original geomorphological surfaces. Temporal satellite images also show the sensitivity of terrace geomorphology to extreme hydrometeorological phenomena, with cyclones such as Hanna (2020) and Alberto (2024) causing vegetation loss, surface restructuring, and local modification of terraces. Overall, UAV-LiDAR proved to be very effective for mapping terraces in restricted urban environments, providing essential details for monitoring, risk assessment, and sustainable management of urban rivers.

8:30am - 10:00am

Forum4A: Hybrid Intelligent Geospatial Computing
Location: 716B

8:30am - 10:00am

WG I/4: LiDAR, Laser Altimetry and Sensor Integration
Location: 717A

8:30am - 8:45am

Automated Station Planning for Terrestrial Laser Scanning in Complex Forest Environments

Wenbo Zhang¹, Zhenyang Hui¹, Yuanping Xia¹, Penggen Cheng¹, Jialin Fu¹, Zhilu Zhang¹, Ting Hui²

¹East China University of Technology, China, People's Republic of; ²College of Management, Guangdong AIB Polytechnic

Terrestrial laser scanning technology can efficiently acquire high-precision three-dimensional spatial information in complex forest environments, making it an important technical means for detailed analysis of forest structure and resource monitoring. However, traditional terrestrial laser scanners planning methods are prone to coverage gaps and data redundancy due to factors such as tree obstructions, terrain undulations, and canopy overlap, making it difficult to simultaneously balance observation completeness and scanner station deployment cost. To address this, this paper proposes an intelligent survey station planning for terrestrial laser scanners in complex forest environments. The method first uses airborne LiDAR data to build a prior forest model, which is then used to quantitatively evaluate forest visibility features by calculating the cumulative visible central angle through visibility analysis. Finally, an integer linear programming model is further introduced to achieve global optimization of the station set based on an initial feasible coverage solution obtained using a greedy algorithm. To test the performance of the proposed method, this paper applies the proposed method to the forest plot located in Lushan city, Jiangxi province, China. Experimental results indicate that the proposed method achieves an overall coverage rate of 94.55% with only seven stations, reducing the number of stations by approximately 30% and 22% compared with the greedy algorithm and genetic algorithm, respectively. The results demonstrate the effectiveness and superiority of this method for station planning in complex forest areas and provide efficient and precise technical support for forest structure monitoring and spatial information acquisition.

8:45am - 9:00am

Improved reflectance calculation in full-waveform LiDAR considering the angle of incidence

Jan Rhomberg-Kauert¹, Florian Pöppl^1,2, Helena Laasch³, Martin Pfennigbauer⁴, Gottfried Mandlburger¹

¹Department of Geodesy and Geoinformation, TU Wien, 1040 Vienna, Austria; ²Laser Measurement Systems GmbH; ³Institute of Geodesy and Photogrammetry, ETH Zurich, 8093 Zurich, Switzerland; ⁴Research and Defense GmbH

Reflectance is a widely used feature for all types laser scanning data. Thus, the accuracy and improvement of the reflectance parameter is a persistent topic of research. For short laser pulses with medium-sized footprints, previous work has investigated the effects of inclined targets on the recorded waveform of full-waveform LiDAR systems. In this work, a new methods to extract incidence angle from only a single waveform can be leveraged to improve reflectance values through recalculation based on the laser-radar equation and correcting for angle of incidence artifacts. The results of the proposed method are evaluated with two datasets based on two different topo-bathymtric laser scanners. For both systems, we calculated the relative biconical reflectance and relative averaged bidirectional reflectance distribution function (rBRDF) and evaluated them on homogeneous roof faces. The two reflectance measures are then compared to the initial reflectance values of the laser scanners used in the study. Both measures showed improvements compared to the standard values. The biconical reflectance shows the best overall mean score for all surveyed roofs with an MAD improvement of 0.80 dB to 62 dB for Sensor I and 0.61 dB to 0.56 dB for Sensor II, in addition the rBRDF also displays an improvement with varying results depending on the deployed system. These results highlight the advantages of the proposed reflectance measures and the potential improvement of the widely used LiDAR attribute.

9:00am - 9:15am

Multi-branch deep Learning Architecture for bathymetric LiDAR Point Cloud Classification

Lida Asgharian Pournodrati¹, Gottfried Mandlburger², Uwe Sörgel¹

¹Institute for Photogrammetry and Geoinformatics, University of Stuttgart, Germany; ²Department of Geodesy and Geoinformation, TU Wien, Vienna, Austria

Accurate classification of topo-bathymetric LiDAR data remains challenging due to the heterogeneous nature of land-water transitional environments, where terrestrial, water surface, and submerged features must be distinguished simultaneously. This study presents a multi-branch deep learning architecture for classifying bathymetric LiDAR data into different classes: soil ground, trees and vegetation, water surface, seabed, aquatic plants and other underwater objects (dead wood, coral reef). The proposed framework employs three parallel feature extraction branches, while the first branch captures spatial structure by focusing on three-dimensional geometric coordinates (XYZ), the other two branches use two independent 1D U-Net architectures to extract signal-based features from RGB spectral reflectance and waveform-derived attributes (intensity, return number, number of returns). The discrete LiDAR attributes, though represented as point-wise numerical values, preserve signal characteristics derived from full-waveform analysis. The encoder-decoder of 1D U-Net architecture with skip connections effectively captures sequential patterns and multi-return patterns in different classes especially in vegetation canopies. The three feature streams are fused through fully-connected layers before final classification. Evaluation using different metrics demonstrates the capability of the framework to simultaneously classify diverse coastal zone and inland waters contexts spanning terrestrial and submerged domains within a unified processing pipeline, eliminating the need for separate terrestrial and bathymetric classification workflows.

9:15am - 9:30am

Low-cost Terrestrial Laser Scanners for Permanent Monitoring of Beach-Dune Systems

Daan Cornelis Hulskemper¹, Hannah Weiser^2,3, Ronald Tabernig^2,3, Bernhard Höfle^2,3, Thomas de Jong¹, Roderik Lindenbergh¹

¹Dept. of Geoscience and Remote Sensing, Delft University of Technology, The Netherlands; ²3DGeo Research Group, Institute of Geography, Heidelberg University, Germany; ³Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Germany

Permanent laser scanning (PLS) is an effective tool for near-continuous monitoring of topographical changes in beach-dune systems. While PLS systems were traditionally costly, the emergence of affordable LiDAR sensors enables larger-scale setups with multiple scanners or sites. However, the different characteristics compared to high-end devices, create challenges for one-on-one replacement. To assess how low-cost sensors can replace high-end sensors, we compare the performance of a setup with several low-cost Livox AVIA sensors to a single high-end RIEGL VZ-2000i sensor in its ability to capture an embryonic dune field with large variation in topography. This is evaluated using HELIOS++ virtual laser scanning (VLS). To also assess the representativeness of the simulations, we further compare the VLS to real-world measurements with the Livox AVIA. Based on a VLS setup with six AVIAs mounted on tripods at 2 m above ground, a coverage of 52% can be obtained, which is similar to the coverage of a single RIEGL VZ-2000i on a tower 8 m high. The real-world experiments confirm the VLS results with a slightly lower point cloud coverage of 42%. Furthermore, the effective range of the Livox AVIA in a beach-dune system lies around 100-150 m. At larger ranges, only pulses at high incidence angles (angle between surface and incoming beam, >20°) are registered at the scanner. The variations in coverage between the VLS and real-world scans highlight the need for careful consideration of the occlusion potential of different representations of the topography, beam divergence shapes, and the moisture conditions.

9:30am - 9:45am

Assessing Trajectory Accuracy of the CHCNAV RS10 Handheld Laser Scanner

Stefan Binapfl, Carolin Rünger, Gunnar Lelle-Neumann, Anette Eltner, Ferdinand Maiwald

TUD Dresden University of Technology, Germany

The aim of this abstract is to assess the accuracy of the trajectory of the handheld laser scanner CHCNAV RS10. The trajectory data of this PLS device is compared with a simultaneously measured total station measurement.

9:45am - 10:00am

LiDAR, green-wavelength, 3D point cloud, under water, refractive index.

Yu Lan¹, Jiale Wang¹, Ji Yang¹, Mario Kolling¹, Alexander Dorndorf¹, Christoph Werner², Alexander Reiterer^2,3, Jens-André Paffenholz¹

¹Institute of Geotechnology and Mineral Resources – Geomatics, Clausthal University of Technology; ²Fraunhofer Institute for Physical Measurement Techniques IPM; ³Institute for Sustainable Systems Engineering (INATECH), University Freiburg

Green-wavelength LiDAR systems enable high-resolution 3D sensing in underwater environments, but the geometric evaluation of measurements across the waterline remains difficult. A main challenge is that traceable reference instruments usually operate only in air, while refraction at the air-water interface systematically affects both the reconstructed 3D point cloud and the geometry of partially submerged objects. To address this problem, this study presents a controlled experimental framework for evaluating waterline-induced effects in an Underwater LiDAR (ULi) system, using the Z+F IMAGER 5016A as an in-air reference. A rigid reference frame (RRF) spanning the waterline was deployed in a swimming pool. The RRF was first scanned by the IMAGER in air to establish the reference geometry and was then measured by the ULi system under waterline conditions. The analysis considered the above-water, cross-waterline, and underwater parts of the RRF. The evaluation was based not only on overall geometric deviations but also on rigid-body-invariant internal quantities, especially pairwise distances that are independent of the pose of the RRF. In addition, the sensitivity of the reconstructed geometry to the refractive-index setting used in processing was assessed by perturbing the refractive index and quantifying the resulting changes. The proposed workflow provides a practical and traceable basis for isolating and evaluating waterline-related refraction effects in controlled ULi experiments.

8:30am - 10:00am

ByA1: ISPRS Best Young Author Award Papers
Location: 717B

Comparative practices in 3-D geoinformation by national mapping and cadastral agencies

Thomas Goldring¹, Jon Mills¹, David Holland², Norbert Haala³, Shidong Wang¹

¹Newcastle University, United Kingdom; ²Ordnance Survey, United Kingdom; ³University of Stuttgart, Germany

The rapid evolution of three-dimensional (3-D) geospatial science has redefined the standards of national mapping and cadastral agencies (NMCAs). Traditionally bodies of authoritative 2-D topographic products, these organisations now face the challenge of producing, maintaining, and disseminating national-scale 3-D geospatial datasets that support applications ranging from climate adaptation and urban planning to disaster response and digital twins. This paper presents a comparative study of five NMCAs, comprising IGN (France), BKG (Germany), Kadaster (The Netherlands), GSI (Japan) and USGS (United States of America). By examining agency structure, economic models, and 3-D data collection programmes, this paper identifies converging trends in AI integration, national surveys, along with divergences in funding and implementation. The analysis highlights insights and potential lessons for organisations at early stages of national 3-D dataset implementation.

Feasibility of Indoor Frame-Wise Lidar Semantic Segmentation via Distillation from Visual Foundation Model

Haiyang Wu, Juan J. Gonzales Torres, George Vosselman, Ville Lehtola

Department of Earth Observation Science, Faculty of Geo-Information Science and Earth Observation (ITC),University of Twente, Netherlands, The

Frame-wise semantic segmentation of indoor lidar scans is a fundamental step toward higher-level 3D scene understanding and mapping applications.

However, acquiring frame-wise ground truth for training deep learning models is costly and time-consuming. This challenge is largely addressed, for imagery, by Visual Foundation Models (VFMs) which segment image frames. The same VFMs may be used to train a lidar scan frame segmentation model via a 2D-to-3D distillation pipeline. The success of such distillation has been shown for autonomous driving scenes, but not yet for indoor scenes. Here, we study the feasibility of repeating this success for indoor scenes, in a frame-wise distillation manner by coupling each lidar scan with a VFM-processed camera image. The evaluation is done using indoor SLAM datasets, where pseudo-labels are used for downstream evaluation. Also, a small manually annotated lidar dataset is provided for validation, as there are no other lidar frame-wise indoor datasets with semantics. Results show that the distilled model

achieves up to 56% mIoU under pseudo-label evaluation and around 36% mIoU with real-label, demonstrating the feasibility of

cross-modal distillation for indoor lidar semantic segmentation without manual annotations.

Diachronic Stereo Matching for multi-date Satellite Imagery

Elías Masquil^1,2, Luca Savant Aira³, Roger Marí⁴, Thibaud Ehret⁵, Pablo Musé^1,6, Gabriele Facciolo⁶

¹IIE, Facultad de Ingeniería, Universidad de la República, Uruguay; ²Digital Sense, Uruguay; ³Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Torino TO, Italia; ⁴Eurecat, Centre Tecnològic de Catalunya, Multimedia Technologies, Barcelona, Spain; ⁵AMIAD, Pôle Recherche, France; ⁶Université Paris-Saclay, ENS Paris-Saclay, CNRS, Centre Borelli, 91190, Gif-sur-Yvette, France

Recent advances in image-based satellite 3D reconstruction have progressed along two complementary directions. On one hand, multi-date approaches using NeRF or Gaussian-splatting jointly model appearance and geometry across many acquisitions, achieving accurate reconstructions on opportunistic imagery with numerous observations. On the other hand, classical stereoscopic reconstruc- tion pipelines deliver robust and scalable results for simultaneous or quasi-simultaneous image pairs. However, when the two images are captured months apart, strong seasonal, illumination, and shadow changes violate standard stereoscopic assumptions, causing existing pipelines to fail. This work presents the first Diachronic Stereo Matching method for satellite imagery, enabling reliable 3D reconstruction from temporally distant pairs. Two advances make this possible: (1) fine-tuning a state-of-the-art deep stereo network that leverages monocular depth priors, and (2) exposing it to a dataset specifically curated to include a diverse set of diachronic image pairs. In particular, we start from a pretrained MonSter model, originally trained on a mix of synthetic and real datasets such as SceneFlow and KITTI, and fine-tune it on a set of stereo pairs derived from the DFC2019 remote sensing challenge. This dataset contains both synchronic and diachronic pairs under diverse seasonal and illumination conditions. Experiments on multi-date WorldView-3 imagery demonstrate that our approach consistently surpasses classical pipelines and unadapted deep stereo models on both synchronic and diachronic settings. Fine-tuning on temporally diverse images, together with monocular priors, proves essential for enabling 3D reconstruction from previously incompatible acquisition dates.

Refraction-Aware Gaussian Splatting for Shallow Water Bathymetry from UAV Imagery

Taiki Uno¹, Sohei Kobayashi²

¹Kyoto University, Graduate School of Engineering, Kyoto, Japan; ²Kyoto University, Disaster Prevention Research Institute, Uji, Japan

Unmanned Aerial Vehicles (UAV)-based photogrammetry provides an efficient solution for shallow water bathymetry, yet its accuracy is fundamentally constrained by light refraction at the air-water interface, which violates the central geometric assumptions of traditional photogrammetry.

Existing approaches, ranging from empirical corrections and iterative post-processing to black-box deep learning, often compromise geometric fidelity, physical interpretability, or generalization.

We address this challenge through Refraction-Aware Gaussian Splatting (RA-GS), which embeds a physically rigorous two-media refraction model directly into the Gaussian Splatting (GS) reconstruction pipeline.

Rather than relying on computationally expensive per-pixel ray tracing, we formulate an analytical parameter transformation that maps the true underwater position, scale, and opacity of each Gaussian to their apparent states observed through a planar refractive interface.

Through this fully differentiable transformation, true underwater 3D geometry and photorealistic appearance are jointly optimized by directly minimizing the photometric error within the standard GS framework.

This approach relies solely on RGB imagery, eliminating the need for external depth priors or deep learning networks.

Using a physically based, ray-traced synthetic riverbed dataset, we isolate and explicitly correct refractive distortions.

Our method achieves a geometric F1-score of 94\% (10 cm threshold at 10 m depth) and produces high-quality novel view synthesis with a PSNR of 25.9 dB and SSIM of 0.93.

Field experiments on real UAV data corroborate the practical utility for high-precision bathymetric mapping under calm-surface conditions.

By resolving the fundamental refractive difficulty, the proposed framework provides a physically grounded, computationally efficient, and practically useful solution for next-generation photogrammetric bathymetry.

10:00am - 10:30am

Morning Coffee Break
Location: Exhibition Hall "E"

10:00am - 1:30pm

Exhibition
Location: Exhibition Hall "F"

10:30am - 12:00pm

Plenary Session 4
Location: Exhibition Hall "G"

Keynote 1: Mr. Alex Miller
Awards Ceremony:

The Frederick J. Doyle Award
Wang Zhizhuo Award

Keynote 2: Professor Margurite Madden

12:00pm - 1:30pm

Lunch
Location: Exhibition Hall "E"

1:30pm - 3:00pm

WG II/2D: Point Cloud Generation and Processing
Location: 713A

1:30pm - 1:45pm

An Approach for deriving Branch Kinematics of Deciduous Trees from hyper-temporal terrestrial Laser Scanner Data

Anne Bienert, Katja Richter, Sophia Böhme, Hans-Gerd Maas

Dresden University of Technology, Institute of Photogrammetry and Remote Sensing, Germany

Understanding vegetation dynamics in three-dimensional, high-temporal resolution is essential for advancing ecological research and sustainable forest management. This study introduces a novel methodology for tracking branch kinematics in trees using hyper-temporal terrestrial laser scanning (TLS) data. Focusing on a solitary pedunculate oak (Quercus robur) over a one-year period, we employed a geometric feature detection algorithm combined with quantitative structure modeling (QSM) to identify and track distinctive point cloud sections on first- and second-order branches. By leveraging an iterative closest point (ICP) alignment process, branch kinematics were analyzed across multiple epochs, yielding detailed three-dimensional movement trajectories. The results demonstrate that branch movements exhibit screw-shaped patterns. Temporal resolution analysis revealed that a one-week recording interval is sufficient for our study subject to reliably capture kinematic dynamics, whereas longer intervals (e.g., three weeks) result in significant deviations from actual trajectories. The proposed method proved robust against partial occlusions from leaf growth but struggled under extensive occlusions. This research highlights the potential of hyper-temporal TLS for non-contact, high-resolution monitoring of tree canopy dynamics and provides a foundational approach for future studies aimed at modeling vegetation movement and structural changes over time.

1:45pm - 2:00pm

In-Field 3D Wheat Head Instance Segmentation From TLS Point Clouds Using Deep Learning Without Manual Labels

Tomislav Medic¹, Liangliang Nan²

¹ETH Zurich, Switzerland; ²TU Delft, Netherlands

3D instance segmentation for laser scanning (LiDAR) point clouds remains a challenge in many remote sensing-related domains. Successful solutions typically rely on supervised deep learning and manual annotations, and consequently focus on objects that can be well delineated through visual inspection and manual labeling of point clouds. However, for tasks with more complex and cluttered scenes, like in-field plant phenotyping in agriculture, such approaches are often infeasible. In this study, we tackle such a task - in-field wheat head instance segmentation using terrestrial laser scanning (TLS) point clouds. To address the problem and circumvent the need for manual annotations, we propose a novel two-stage pipeline. To obtain the initial 3D instance proposals, the first stage uses 3D-to-2D multi-view projections, the Grounded SAM pipeline for zero-shot 2D object-centric segmentation, and multi-view label fusion. The second stage uses these initial proposals as noisy pseudo-labels to train a supervised 3D panoptic-style segmentation neural network. Our results demonstrate the feasibility of the proposed approach and show significant performance improvements (up to +50\% in F1-score) relative to Wheat3DGS, a recent alternative solution for in-field wheat head instance segmentation without manual 3D annotations based on multi-view RGB images and Gaussian Splatting, showcasing TLS as a competitive sensing alternative. Moreover, the results show that both stages of the proposed pipeline can deliver usable 3D instance segmentation without manual annotations, indicating promising, low-effort transferability to other comparable TLS-based point cloud segmentation tasks.

2:00pm - 2:15pm

Optimal Path Planning for Kinematic Laser Scanning

Julius Knechtel¹, Mohammad Kordgholiabad², Jan-Henrik Haunert¹

¹University of Bonn, Germany; ²Politecnico di Milano, Italy

Prompted by the rapid advancements in software and hardware, 3D building data for numerous different applications is nowadays often captured via mobile or kinematic laser scanning. However, in contrast to other laser scanning methods, there exist only a few approaches tailored for the planning of a kinematic laser scan survey, and none of them provides an optimality guarantee. Therefore, we propose a novel approach based on Mixed Integer Linear Programming (MILP) to find the optimal trajectory for such a survey. To obtain a high-quality point cloud, we account for scanner-related constraints that influence the quality of the resulting point cloud. Moreover, we enable the introduction of tie points to mitigate the effects of uncertainties in the position estimation that are propagated in the acquired data. In our problem formulation, we aim to find the best tour in a properly weighted graph. For this, we propose two different weight settings to either enable a purely length-based optimization or to increase the redundancy in the measurements by incorporating a Visibility Ratio Factor (VRF) into the objective function.

To prove the applicability of our approach for offline panning, we apply our formulation to three different scenarios. In this context, the VRF-based weighting enables a significant speed-up of the solving process while resulting in only slightly prolonged routes. This approach paves the way for applying exact algorithms with an optimality guarantee in the planning process for efficient kinematic laser scanning surveys.

2:15pm - 2:30pm

Non-Contact Modal Analysis of Wind Turbine Blades using Terrestrial Laser Scanner

Martina Göring

Jade Hochschule, Germany

This contribution introduces a novel method for non-contact, marker-free modal analysis of wind turbine blades using terrestrial laser scanning (TLS). As part of a research initiative, TLS's potential for assessing modal properties like natural frequencies and mode shapes—key for extending blade service life—is explored. Traditionally, this analysis relies on numerous accelerometers, incurring high costs and effort. TLS is evaluated as a viable alternative.

In laboratory tests, TLS and photogrammetry were used on a 4-meter test object in vibration. Photogrammetric data, serving as a reference, used 3D coordinates from retroreflective markers for frequency analysis via Fast Fourier Transform (FFT). TLS data were similarly segmented, with frequencies derived using FFT, and both methods showed consistent results, validating TLS's feasibility.

Building on lab results, the method was applied to an 88-meter rotor blade in a field experiment. The laser scanner collected profile data along the blade's longitudinal axis, converted to the object coordinate system. By segmenting the blade, eigenfrequencies were determined. The calculation process was validated with simulations, achieving precise results even with manual blade excitation and amplitudes up to 20 cm.

TLS measurements reveal valuable insights into eigenfrequencies and modal shapes along the blade. This approach offers a cost-effective, efficient alternative to traditional sensor-based analysis, proving its practicality for the wind energy industry.

2:30pm - 2:45pm

Pixel-Accurate Registration of Photogrammetric Images and LiDAR in a Hybrid Airborne Oblique Imaging System

Deyan Deng¹, Rongjun Qin^1,2, Elisa Mariarosaria Farella³, Fabio Remondino³

¹Department of Civil, Environmental and Geodetic Engineering, The Ohio State University, Columbus, USA; ²Department of Electrical and Computer Engineering, The Ohio State University, Columbus, USA; ³3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy

Hybrid airborne imaging systems combining oblique cameras and LiDAR sensors offer significant advantages for applications requiring both geometric precision and rich texture information, including infrastructure monitoring, facility surveying, and detailed urban modeling. Despite capturing temporally consistent multi-modal data, achieving pixel-level registration between imagery and LiDAR remains fundamentally challenging due to insufficient calibration infrastructure and the technical complexity of deeply integrating heterogeneous sensors.

A critical bottleneck is that standard photogrammetric workflows exhibit non-linear cumulative drift, particularly across extended flight strips. This spatially varying deformation causes systematic misalignments when photogrammetric reconstructions are overlaid with LiDAR geometry. Conventional approaches applying global rigid transformations fail to address this issue because photogrammetric drift is inherently non-uniform—a single global registration cannot correct localized geometric deviations throughout the scene.

This work introduces a novel view-dependent registration framework that synergizes LiDAR's global geometric fidelity with photogrammetry's local density. Rather than attempting to warp entire models through global transformations, we decompose the registration problem by treating the geometry within each camera frustum as an independent rigid body. Building upon initial georeferencing, we perform fine-grained local SE(3) rigid registration to anchor each Multi-View Stereo (MVS) depth map directly to sparse LiDAR geometry within its corresponding viewing frustum.

This localized approach enables pixel-accurate alignment within individual frames while effectively compensating for accumulated photogrammetric drift and interpolation errors. By addressing registration at the frustum level rather than globally, our method achieves practical pixel-level fusion of hybrid airborne datasets, unlocking the full potential of integrated camera-LiDAR systems for high-precision geospatial applications.

2:45pm - 3:00pm

Integrating Airborne LiDAR and OpenStreetMap Features for Automated Hydrological Conditioning of Urban Digital Elevation Models

Tommaso Destefanis^1,2,3, Elena Durando³, Matilde Oliveti³, Emanuele Artù Cassin³, Martina Di Rita²

¹Sapienza Università di Roma, DICEA, Rome, Italy; ²Politecnico di Torino, SDG11Lab, Interuniversity Department of Regional and Urban Studies and Planning (DIST), Turin, Italy; ³Ithaca S.r.l., Turin, Italy

High-resolution Digital Elevation Models (DEMs) are essential for urban flood modelling, where small elevation differences govern surface drainage and inundation extent. DEMs frequently contain hydrological inconsistencies: elevated infrastructure such as bridges, tunnels and culverts may appear as artificial barriers disrupting flow continuity, while linear structures such as retaining walls may be underrepresented depending on spatial resolution or point density. These inconsistencies propagate errors through downstream hydraulic simulations.

This paper presents an automated, open-source Python pipeline for generating hydrologically conditioned DEMs by integrating classified airborne LiDAR data with OpenStreetMap (OSM) infrastructure features. The workflow is tested on a 16 km2 area of central Copenhagen using a 2023 national LiDAR acquisition at 13.5 pts/m2. A 0.5 m resolution DSM is generated from LiDAR ground and building classes via Inverse Distance Weighting interpolation, with Nearest Neighbour gap-filling for hydraulic model continuity. Hydrological conditioning is performed through four sequential operations: bridge burning, tunnel enforcing, culvert enforcing, and barrier rasterization. Barrier top-of-wall elevations are estimated directly from the LiDAR point cloud.

Vertical accuracy is assessed by pixel-wise comparison against the Danish national terrain model DHM/Terraen (NMAD = 0.066 m, LE90 = 0.265 m) and by independent checkpoint validation against the HojdefikspunktDanmark geodetic network. The inclusion of shallow tunnel underpasses proved a significant addition: tunnel features alone contributed approximately half of the total depression volume reduction. The conditioned DSM is designed as input for an urban flood simulation chain; full hydraulic validation will be performed by the Danish Meteorological Institute within the CLEAR-EO project.

1:30pm - 3:00pm

WG III/1K: Remote Sensing Data Processing and Understanding
Location: 713B

1:30pm - 1:45pm

Automated kelp mapping from Sentinel-2 satellite imagery

Mohsen Ghanbari¹, Neil Ernst², Taylor Denouden³, Luba Reshitnyk³, Piper Steffen¹, Alena Wachmann¹, Eduardo Loos⁴, Margot Hessing-Lewis³, Maycira Costa¹

¹Department of Geography, University of Victoria; ²Department of Computer Science, University of Victoria; ³Hakai Institute; ⁴Vertex Resource Group

Kelp forests are vital marine habitats with significant ecological, cultural, and economic importance. These ecosystems, found along coastlines, are susceptible to regional and global stressors (such as coastal development and climate change). This paper presents Satellite-based Kelp Mapping (SKeMa), a novel framework for automatically mapping canopy-forming kelp forests using Sentinel-2 satellite imagery along the British Columbia coast, specifically to support First Nations marine planning for these species. SKeMa employs a deep learning semantic segmentation model, offering an efficient alternative to traditional, labor-intensive, and time-consuming kelp mapping methods. A cross-validation study with independent test sets yields a mean Intersection over Union (IoU) of 0.5326, demonstrating the model’s capability to detect kelp canopies across diverse coastal regions, particularly for larger kelp beds.

1:45pm - 2:00pm

Addressing Spatial and Temporal Uncertainty in Predicting Sea Surface Temperature using Extended DualSeq a Novel Ensemble Method

Lalita Chaudhary

IILM University, India

The research extended DualSeq, an advanced

machine-learning model for predicting sea surface temperature

(SST), crucial for understanding oceanic ecosystems and climate

patterns. Traditional SST prediction methods typically employ

time-series regressions focusing on nonlinear temporal patterns,

but often overlook vital spatial correlations in SST dynamics,

limiting their accuracy. DualSeq addresses this by integrating

spatial and temporal uncertainty quantification, with a particular

focus on the Arabian Sea. It utilises LSTM and GRU networks to

effectively harness the SEVIRI-IO-SST dataset, which contains

five years of remote-sensing data. A distinctive aspect of DualSeq

is its incorporation of a weighted normalized linear equation,

which significantly improves the accuracy of SST predictions and

enhances the dependability of spatial and temporal uncertainty

assessments. The model stands out in its ability to forecast up to

one month in advance, significantly outperforming others. For 1-

month forecasts, DualSeq shows a remarkable R² value of 0.983,

surpassing the LSTM-attention model by 7.4% and reducing

RMSE and MAE by about 65.4% and 82.4%, respectively.

This performance illustrates DualSeq’s superior capability in

capturing both short-term and long-term uncertainties in SST

forecasting.

2:00pm - 2:15pm

From global to station-centric models: improved chlorophyll-a prediction in the Gulf of İzmir using Sentinel-2

Coskun Ozkan¹, Filiz Sunar², İbrahim Tan³

¹Erciyes University, Turkiye; ²İstanbul Technical University, Turkiye; ³TUBITAK MRC Marine and Coastal Research Group, Turkiye

This study presents a Station-Centric Geographically weighted Regression (SCGWR) framework for Chlorophyll-a prediction in the optically complex waters of the Gulf of İzmir using Sentinel-2 imagery. Unlike traditional global multiple regression model, the proposed approach calibrates an individual model for each sampling station while using 16 outer Moore-neighbor pixels (range 2) from surrounding stations as independent validation data in the model optimization, thereby preventing adjacency bias and information leakage in performance assessment. Compared to multiple linear regression (MLR) against 20 independent in-situ measurements, SCGWR method offers a robust, reproducible alternative for local-scale water-quality mapping in coastal environments where bio-optical variability is high.

2:15pm - 2:30pm

Evaluating the Impact of Super-Resolution for Coastal Boundary Segmentation Using Deep Learning for High-Resolution Imagery

Marc-André Blais^1,2, Moulay Akhloufi^1,2

¹Université de Moncton, Canada; ²Perception, Robotics and Intelligent Machines (PRIME)

Coastal areas play an important role economically, socially and environmentally due to their many functions.

However, these regions are at risk of erosion, which is further exacerbated by human-driven climate change.

Tracking and monitoring coastal boundaries enable efficient allocation of conservation and protection efforts.

Due to the vast size and complexity of coastal areas, on-site monitoring to track erosion is inefficient.

Artificial intelligence has shown impressive results in segmenting and extracting these boundaries from remote sensing imagery.

Historical remote sensing data make it possible to track long-term erosion but remain challenging due to the coarse resolution of older data.

Our work proposes studying the impact of super-resolution on coastal boundary segmentation using high-resolution imagery.

ESRGAN and SRCNN have proven highly beneficial in improving the quality of coarse-resolution samples, achieving superior performance compared to bicubic interpolation across scaling factors ranging from ×2 to ×12.

ESRGAN super-resolved samples achieved F1-scores ranging from 97.75% to 89.92% for scaling factors ×2 to ×12, while bicubic interpolation achieved between 97.34% and 65.27%.

These improvements demonstrate that SR enhances boundary delineation and robustness across scales.

Our work also explores the applicability of tracking erosion through historical data.

Results demonstrate a coastal boundary change of 0.23 m per year over seven years, which is on par with expected values.

2:30pm - 2:45pm

Region-aware full-waveform figure descriptor and convolutional vision transformer framework for underwater terrain classification

You-Chen Liu, Tzu-Yi Chuang

National Yang Ming Chiao Tung University, Taiwan

This study introduces a novel framework that integrates a region-aware Full-Waveform Figure Descriptor (FWFD) with a Convolutional Vision Transformer (CvT) for underwater terrain classification using bathymetric LiDAR data. The FWFD converts sequential waveform returns into a multi-directional image-like representation, enabling the preservation of spatial correlations among neighboring laser footprints. By combining convolutional token embedding and self-attention mechanisms, the CvT effectively learns both local and global waveform features. Experiments on a YellowScan full-waveform LiDAR dataset over coastal Australia demonstrate that the proposed FWFD-CvT model achieves 95.55 % overall accuracy under moderate waveform smoothing and exceeds 98 % accuracy for underwater objects. The framework shows robust performance across complex seafloor morphologies and maintains consistency in mixed land-water environments. This research contributes a transferable paradigm for region-aware waveform interpretation and establishes a foundation for extending full-waveform analysis to terrestrial, multispectral, and topographic LiDAR applications requiring fine-scale surface characterization.

2:45pm - 3:00pm

Integrated Geoinformatics for Reconstructing the Cultural Dynamics in Coastal and Shallow Submerged Sites

Nikos Papadopoulos, Nasos Argyriou

GeoSat ReSeArch Lab, Institute for Mediterranean Studies, Foundation for Research and Technology Hellas -, Greece

Shallow-water cultural heritage occupies a dynamic land-sea interface where coastal erosion, sediment transport, limited visibility and burial processes hinder conventional archaeological investigation. This paper presents an integrated geoinformatics framework for reconstructing the cultural dynamics of coastal and shallow submerged archaeological landscapes in southeastern Crete, Greece. The methodology combines multispectral remote sensing, satellite-derived and in situ bathymetry, UAV and shallow-water photogrammetry, marine geophysics, GIS-based coastal vulnerability, fuzzy logic multi-criteria risk assessment and digital dissemination through augmented reality. The workflow was applied at five representative case studies, including Stomio, Ierapetra harbour, Koufonisi, Chryse and associated coastal sectors. Optical data from Pleiades-1A, PlanetScope, and Sentinel-2A were used for shoreline mapping, feature enhancement, and satellite-derived bathymetry. Geophysical and bathymetric surveys covered more the 4.5 and 10 hectares respectively. UAV photogrammetry produced high resolution orthomosaics, while the proposed experimental Remote Control (RC) boat extends documentation potential to very shallow submerged environments. Integrated interpretation clarified palaeo-shorelines, submerged harbour structures, fish tanks, architectural continuities and archaeological risk hotspots. The results demonstrate a scalable and transferable framework for documenting, interpreting, monitoring, and communicating endangered shallow-water cultural landscapes.

1:30pm - 3:00pm

ICWG II/Ib: Digital Construction: Reality Capture, Automated Inspection, and Integration to BIM
Location: 714A

1:30pm - 1:45pm

Digital Twin Approach to Accessibility Assessment of Public Transport

Kourosh Khoshelham, Junjie Zhang, Marko Radanovic, Zizhao Li, Shuyu Zhu, Yuan Zhao, Martin Tomko

University of Melbourne, Australia

This paper presents an efficient approach to the accessibility assessment of tram transport based on a simulation within a digital twin environment. We propose a novel framework that integrates several advanced data acquisition and processing steps: mobile mapping of the tram routes, detection of rail tracks and tram stops, and the final assessment of tram accessibility by simulating the MAL deployment in the digital twin. Our experimental evaluation demonstrates that the digital twin provides a practical and reliable tool for assessing tram accessibility.

1:45pm - 2:00pm

Graph-based topology retrieval and constructive solid geometry for structural BIM refinement

Muataz Safaa Abed Albadri, Patricia González-Cabaleiro, Antonio Fernández, Lucía Díaz-Vilariño

CINTECX, Universidade de Vigo, GeoTECH group, 36310 Vigo, Spain

As-built Building Information Models (BIMs) are crucial for building digitalisation, structural analysis, and life cycle management. Despite recent advances, automated reconstruction of structural elements from point clouds remains a challenging task, particularly in ensuring geometric accuracy and topological consistency within a storey and across consecutive storeys. This paper proposes an automated method for refining topological inconsistency between columns, beams, and slabs, ensuring consistent as-built BIMs. The method places Constructive Solid Geometry (CSG) at the core of the refinement process, driven by fundamental structural principles. The method starts by creating solid rectangular prisms from labelled point clouds. Beams are then aligned both vertically and horizontally within each storey. Columns are vertically aligned across consecutive storeys. Topology relationships between the elements are retrieved and encoded in graphs. These graphs, together with a set of Boolean operations, are used to resolve gaps and trim overlaps between the connected elements. The refined elements are represented in accordance with the IFC standards. The proposed method was validated on two multi-storey case studies representing frame and flat-slab building structures. Both qualitative and quantitative evaluations confirmed the effectiveness of the approach, achieving significant geometric accuracy and topological consistency. In addition, the method exhibits efficient runtime performance, indicating its promise for scalable Scan-to-BIM automation.

2:00pm - 2:15pm

Integrating Photogrammetry and Topological Data Analysis within a Digital Twin Framework for Missing Bolt Detection in Bridges

Vahid Mousavi¹, Maria Rashidi^1,2

¹Centre for Infrastructure Engineering (CIE), Western Sydney University, Penrith, NSW 2751, Australia; ²Urban Transformations Research Centre (UTRC), Western Sydney University, Parramatta, NSW 2150, Australia

Bridge infrastructure plays a critical role in transportation networks, requiring reliable and efficient methods to detect missing bolts to ensure structural integrity and prevent failures. This study proposed a novel methodology integrating point cloud-based Digital Twins (DTs) with Topological Data Analysis (TDA), specifically using Persistent Homology (PH), for robust and accurate missing bolt detection. The framework combines 3D photogrammetric reconstruction to generate point cloud-based DTs, Convolutional Neural Networks (CNNs) for precise bolt localization, and PH to identify and quantify missing bolts. Through parameter evaluations and a real-world bridge case study, the proposed approach demonstrated high detection accuracy, effectively identifying missing bolts with a false positive rate below 10%. These findings confirm the reliability and effectiveness of integrating DTs with TDA as an advanced data-driven approach for automated structural inspection and bridge health monitoring.

2:15pm - 2:30pm

LGFormer: lightweight local-global transformer for indoor point cloud segmentation

Yuwei Zhang¹, Fashuai Li², Yiyi Liu¹, Ping Wang¹, Yuwei Chen³, Biao Xiong¹

¹Wuhan University of Technology; ²The Advanced Laser Technology Laboratory of Anhui Province; ³Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences

Semantic segmentation of indoor point clouds is a fundamental task in 3D scene understanding, supporting applications such as virtual reality, indoor navigation, and building management. Point-based transformer models achieve high accuracy but require substantial computational resources, while superpoint-based methods are more efficient yet often less precise. To address this trade-off, we propose LGFormer, a lightweight framework that integrates Graph Convolutional Networks (GCN) and transformers to jointly capture local and global contextual features. The method constructs a superpoint-based topology graph, where local features are extracted using GCN and global dependencies are modeled through transformer layers. Experiments on the S3DIS and ScanNet++ datasets demonstrate that LGFormer achieves 90.7% and 88.5% segmentation accuracy, respectively, while reducing inference time by more than 99% compared with point-based transformers. By effectively leveraging superpoints and local-global feature fusion, LGFormer dlivers competitive accuracy with significantly lower computational cost, making it suitable for large-scale indoor scene analysis.

2:30pm - 2:45pm

Dataset review of exposed reinforcement in concrete bridges and challenges for automated damage detection in UAS-assisted bridge inspections

Erkki Tobias Bartczak, Maarten Bassier, Maarten Vergauwen

Department of Civil Engineering, Faculty of Engineering Technology, Geomatics Research Group, KU Leuven,Gent, Belgium

Corroding reinforcement leads to cross section loss and reduced structural capacity of concrete bridges. Detecting exposed rebars

(ER) is crucial during bridge inspection to plan countermeasures early and prevent further corrosion. With advancements in deep

learning, several public datasets derived from inspection imagery have been released to identify ER and other concrete damage

automatically. At the same time, Uncrewed Aerial Systems (UAS) have become more capable of navigating even underneath the

bridge deck. This combination holds promise to improve efficiency of bridge inspection methods, but obtained imagery differs

from available datasets, featuring very small damages and complex backgrounds. To address this mismatch, this work reviews

publicly available ER datasets, presents a UAS-based bridge inspection dataset for evaluating ER damage (UBID-ER-val), and

quantifies similarities and differences between them. We train several YOLOv8 models on conventional inspection documentation

images and benchmark the reviewed datasets, scoring F2 = 0.229 at S2DS, F2 = 0.430 at CODEBRIM, F2 = 0.584 at Dacl10k,

compared to F2 = 0.505 at UBID-ER-val. We analyse factors influencing performance and find that tiled inference raises Recall

(+0.166) but drastically reduces Precision (−0.309), while matching training and validation image resolution underperforms across

all datasets (−0.061 to −0.129). The differences in best-performing combinations underscore the underlying domain shift that

complicates practical deployment. As a practical outcome of this work, UBID-ER-val is made publicly available to enable objective

benchmarking of ER detection models and to assess their reliability under field conditions.

2:45pm - 3:00pm

Domain-Adaptive Object Detection of Electrical Facilities for Enhanced Semantic Indoor Models

Lukas Arzoumanidis¹, Weilian Li^1,2, Youness Dehbi¹

¹HafenCity University Hamburg, Computational Methods Lab, Germany; ²Southwest Jiaotong University, Faculty of Geosciences and Engineering, China

Detecting visible electrical utilities is a prerequisite for developing advanced reasoning strategies to reconstruct hidden in-wall networks. This paper investigates the detection of visible power-related utilities using a domain-adaptive deep learning-based vision pipeline based on the YOLOv11-L, object detection model. Four publicly available datasets containing power sockets, power strips, and light switches were curated, relabeled, and merged into a unified training dataset of 3,459 images. The resulting model achieved a mean average precision (mAP) of 0.74 for power sockets and strips and 0.98 for light switches, demonstrating strong detection performance. Real-time evaluation on a low-cost smartphone via the Ultralytics HUB App indicates reliable detection in small-scale real-world environments and detected utilities could be integrated automatically into semantic indoor models using a marker-less referencing approach. The work further highlights broader applications, including Augmented Reality-based visualization to reduce cognitive load for project managers and inspectors or construction workers and electricians, and its potential use as input for existing and future reasoning methods for hidden-utility reconstruction. The prepared dataset, trained model and source code is available at: https://github.com/hcu-cml/indoor-electrical-facility-detection

1:30pm - 3:00pm

WG III/8D: Remote Sensing for Agricultural and Natural Ecosystems
Location: 714B

1:30pm - 1:45pm

Spatial Aerodynamic Roughness of Forested Landscapes from Airborne LiDAR

Mahmoud Ahmed¹, Joris Timmermans¹, Roderik Lindenbergh¹, Massimo Menenti^1,2

¹Department of Geoscience and Remote Sensing, Delft University of Technology, The Netherlands; ²National Key Laboratory of Remote Sensing and Digital Earth, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China

Accurately representing forest canopies in atmospheric models remains a major challenge due to the complex ways in which trees interact with airflow and modulate surface--atmosphere exchanges. Aerodynamic roughness is a key control variable in modelling frameworks related to air quality, meteorology, and atmospheric transport processes. In this study, we develop a physically based and spatially resolved framework to estimate aerodynamic roughness length from remote sensing observations. Specifically, using AHN (Actueel Hoogtebestand Nederland) airborne laser scanning data over a coniferous forest in Loobos, located within the Veluwe Natura 2000 region in the central Netherlands, we derive geometric roughness parameters and compare them qualitatively against eddy-covariance (EC) tower measurements at the site.

Results show that LiDAR-based roughness captures strong directional and structural variability driven by forest stand height and canopy heterogeneity, patterns that closely align with the anisotropy observed in the EC-derived displacement height and roughness length. Seasonal differences between leaf-on and leaf-off conditions further demonstrate the importance of canopy phenology in shaping aerodynamic behaviour. The spatial patterns resolved by the AHN data underscore the capacity of high-resolution laser scanning to reveal fine-scale canopy--atmosphere interactions that are entirely missed by traditional land-use--based roughness representations.

Additional opportunities remain for integrating complementary remote sensing observations (e.g., multispectral vegetation properties) to enhance the dynamical fidelity of the roughness estimates. The proposed framework provides an observation-driven pathway for parameterizing surface roughness, offering substantial potential for improving land-use representations in wind-flow and chemical transport models such as LOTOS--EUROS.

1:45pm - 2:00pm

Forest Canopy Height Mapping in Tanzanian Tropical Rainforests Using Multimodal Remote Sensing Data and Machine Learning

Soheil Zaghain^1,2, Seyed Ehsan Khankeshizadeh^1,2, Sadegh Jamali¹, Torbern Tagesson³, Ernest William Mauya⁴, Ali Mohammadzadeh², Filbert Francis¹

¹Department of Technology and Society, Faculty of Engineering, Lund University, P.O. Box 118, 221 00 Lund, Sweden.; ²Department of Photogrammetry and Remote Sensing, Faculty of Geodesy and Geomatics Engineering, K. N. Toosi University of Technology, Tehran 19967-15443, Iran.; ³Department of of Earth and Environmental Sciences, Lund University, Sölvegatan 12, SE-223 62 Lund, Sweden.; ⁴Department of Forest Engineering and Wood Sciences, College of Forestry, Wildlife and Tourism, Sokoine University of Agriculture, Morogoro, Tanzania.

Forest canopy height (FCH) is a critical biophysical parameter that characterizes forest structure and provides fundamental information for estimating above-ground biomass and carbon stocks. The Global Ecosystem Dynamics Investigation (GEDI) Level 2A (L2A) product offers accurate canopy height observations; however, its point-based nature constrains spatial continuity in FCH mapping. This study integrates the multimodal remote sensing datasets for continuous FCH mapping in Tanzania’s West Usambara (WUSA) forest, recognized globally for its rich biodiversity and ecological significance. Hence, remote sensing data, including Sentinel-1 polarizations (VV and VH), Sentinel-2 spectral bands and vegetation indices, and the SRTM digital elevation model (DEM), were integrated and matched with GEDI canopy height data used as reference for FCH modelling. The optimal feature set was derived by evaluating the performance of several feature selection and extraction methods, including Principal Component Analysis (PCA), Minimum Noise Fraction (MNF), Recursive Feature Elimination (RFE), Sequential Feature Selection (SFS), and the Selected K-Best approach using F-value and mutual information scoring functions. The feature set derived from RFE, comprising ten features from all data sources, demonstrated the highest accuracy and reliability in FCH modelling. Subsequently, four machine learning algorithms, including Random Forest (RF), Gradient Boosting Regressor (GBR), Support Vector Regressor (SVR), and Ordinary Least Squares (OLS), were evaluated for FCH modelling. Accordingly, RF achieved higher R² than GBR, SVR, and OLS, with differences of 0.9%, 8.7%, and 16.4%, respectively. Therefore, the RF model, as the most reliable model, was employed for FCH mapping across the WUSA forest.

2:00pm - 2:15pm

Comparing DeepLabv3+ and Depth Anything V2 on Canopy Height Model Prediction on a Continental Scale Dataset of Australia

Kevin Yuelin Qiu¹, Rewanth Ravindran², Nicolas Pucino^3,4,5, Dimitri Bulatov¹, Shaun R. Levick⁶, Martin Brandt⁷, Dorota Iwaszczuk², Tim McVicar³

¹Scene Analysis Department, Fraunhofer IOSB Ettlingen, Germany; ²Remote Sensing and Image Analysis, Technical University of Darmstadt, Germany; ³CSIRO Environment, Canberra, ACT, Australia; ⁴Fenner School of Environment and Society, Australian National University, Canberra, ACT, Australia; ⁵Climate Friendly Pty Ltd, Sydney, NSW, Australia; ⁶CSIRO Environment, Urrbrae, SA, Australia; ⁷Department of Geosciences and Natural Resource Management, University of Copenhagen, Copenhagen, Denmark

Canopy height models (CHMs) are raster maps representing normalized tree canopy height above ground and are often used as co-products for estimating carbon storage, forest degradation, and biodiversity at regional to global scales. While airborne LiDAR delivers the most accurate canopy height (CH) measurements, its high cost and limited temporal coverage motivate the use of spaceborne (multispectral) imagery combined with machine learning.

In this study, we compare two distinct deep-learning approaches for continental-scale CHM estimation from 3 m PlanetScope imagery: (1) a CNN-based regression model (DeepLabv3+), and (2) a monocular depth-estimation model (Depth Anything V2) based on a foundation model. We train/fine-tune both models on a curated dataset of 16,973 pairs of airborne point cloud-derived CHMs and PlanetScope imagery of Australia using a stratified sampling scheme to ensure balanced representation of vegetation structural classes. We then evaluate their generalizability on independent validation sets across Australia, across different heights, and under limited-data scenarios.

Through extensive quantitative and qualitative analysis, we show that the DeepLab-based regression model outperforms Depth Anything across all evaluation metrics, partly because it can incorporate additional spectral channels. DeepLab also learns more effectively from less data. On our dataset, the conventional CNN-based regression model performs better than the fine-tuned foundation model.

2:15pm - 2:30pm

Data-Driven vs Functional Approaches for Regionally Transferable Biomass Modeling Using Airborne LiDAR

Maxim Okhrimenko^1,2, Guillermo Castilla², Craig Coburn¹, Chris Hopkinson¹

¹University of Lethbridge, Canada; ²Canadian Forest Service, Canada

To address the critical challenge of regional transferability for ALS-based above-ground biomass (AGB) models, we developed and applied a rigorous leave-one-region-out cross-validation (LORO-CV) framework. This protocol integrates a <1 SE “near-zero” bias filter to ensure models are not just accurate, but statistically free of regional bias.

With this framework, we compared two distinct modeling methods: a data-driven Best-Subset Selection (BSS) method and a Functional Regression (FR) method. The analysis was based on 163 field plots and co-located multispectral Titan ALS data from four regions in the Taiga Plains ecozone, Canada.

The BSS method identified a transferable linear model using height skewness, p95, and an intensity-weighted metric, which achieved 19.3% LORO-CV %RMSE and 2.0% mean absolute bias. Crucially, it passed our <1 SE bias screen in all regions. The FR model, relying only on height, achieved 22.4% LORO-CV %RMSE (4.1% bias) but failed the bias screen in two regions.

Our findings demonstrate that a systematic, bias-controlled data-driven method is effective for producing regionally transferable models. The results highlight the critical importance of ALS intensity metrics for this success, while also showing that the data-driven method currently surpasses the functional approach.

2:30pm - 2:45pm

Optimization of the National Biomass Allometric Equation Using Remote Sensing Data

Shweta Parajuli¹, Tramo K. Remmel², Richard L. Bello³

¹York University, Canada; ²York University, Canada; ³York University, Canada

The role of forests in carbon sequestration and regulation is important to understand, given the alarming rate of global warming caused by greenhouse gases. Understanding the structural characteristics of trees can help assess the potential of forests for carbon storage. Light Detection and Ranging (LiDAR) has emerged as a powerful remote sensing tool that is capable of providing detailed three-dimensional information of the forest. The increasing availability of aerial LiDAR data has provided opportunities to estimate the forest biomass over a larger extent. This study utilizes the available LiDAR data from the provincial repository of geospatial data to estimate the diameter at breast height (DBH), which is a key parameter in existing biomass allometric models. LiDAR-derived tree metrics were integrated with the optical images to further differentiate the forest type to assess how it influences the aboveground biomass estimates in a heterogeneous mixed-wood forest. This research contributes to improving our understanding of LiDAR's potential for estimating DBH, an area that has not been explored much. It also demonstrates how existing global biomass allometric equations can be utilized in combination with remote sensing technology to provide a pathway to a larger extent and an efficient method of biomass estimation across diverse ecosystems.

2:45pm - 3:00pm

Turning rural infrastructure into smart sensors: high‑frequency agricultural monitoring for next‑generation precision farming

Xingli Qin¹, Bingfang Wu^1,2, Miao Zhang^1,2, Fangming Wu¹, Mengxiao Li^1,2, Hongwei Zeng^1,2, Fuyou Tian¹, Kaimin Sun³

¹State Key Laboratory of Remote Sensing and Digital Earth, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100101, China; ²College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China; ³State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing, Wuhan University, Wuhan, 430079, China

Communication towers equipped with cameras are widely distributed across rural landscapes but remain largely unused for scientific observation. This presentation introduces an AI-driven framework that transforms such existing infrastructure into a high-frequency, real-time agricultural monitoring system, complementing traditional satellite and UAV remote sensing.

The proposed system resolves three fundamental challenges that hinder tower-based sensing: (1) precise georeferencing of highly oblique imagery through a quaternion-based spatial transformation; (2) automated delineation of cultivated parcels via a GIS-guided, iterative segmentation process integrating the Segment Anything Model (SAM); and (3) intelligent recognition of crop types, growth stages, and farming activities using a multimodal large language model that fuses time-series imagery with contextual field data.

Validated through deployments in varied agricultural regions of China, the framework demonstrates stable operation and parcel-level accuracy for continuous monitoring within 1–2 km of each tower. The results indicate a practical pathway toward scalable, cost‑efficient, and autonomous agricultural information acquisition at high spatio‑temporal resolution.

1:30pm - 3:00pm

ICWG III/IVa-D: Disaster Management
Location: 715A

1:30pm - 1:45pm

A Deep Learning Framework for Rapid Building Damage Detection through Multimodal Data Fusion: Application to the 2025 Myanmar Earthquake

Luigi Russo¹, Deodato Tapete², Silvia Liberata Ullo³, Paolo Gamba¹

¹University of Pavia, Italy; ²Italian Space Agency (ASI), Italy; ³University of Sannio, Italy

Rapid and reliable assessment of building damage after major earthquakes is essential for effective emergency response and recovery planning. This study formulates post-disaster building damage detection (BDD) as a binary image classification task (damaged vs. undamaged buildings) using multimodal satellite data and a unified ResNet-18 backbone to enable a controlled comparison of fusion strategies. The analysis focuses on the Mw 7.7 Myanmar earthquake of 28 March 2025 and integrates post-event COSMO-SkyMed Second Generation (CSG) dual-polarization (HH, HV) SAR imagery, Maxar optical data, OpenStreetMap (OSM) building footprints, and UNOSAT damage annotations. Three fusion paradigms are evaluated: Early Fusion (EF), Late Fusion (LF), and a novel Middle Fusion (MF) approach. The proposed MF framework introduces a Footprint-Guided Cross-Attention (FGCA) mechanism that uses building geometry as a spatial prior to guide feature-level interaction between SAR and optical representations. Five-fold cross-validation results show that MF consistently outperforms EF and LF, achieving higher precision, F1-score, and robustness across modality configurations. By jointly exploiting SAR structural sensitivity, optical detail, and footprint-based spatial context, the proposed Footprint-Guided Middle Fusion (FGMF) framework enables accurate and scalable building damage mapping from heterogeneous Earth Observation (EO) data.

1:45pm - 2:00pm

Rapid Building Damage Detection from Remote Sensing Images : a Novel Lightweight Network with Contrastive Learning

Wei Li, Guorui Ma, Lunjun Fan, Haiming Zhang

State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University

Accurate and timely building damage detection (BDD) is crucial for disaster emergency response. Although deep learning-based change detection methods have made significant progress in remote sensing, their practical application in disasters still faces two major challenges: (1) Existing high‑accuracy models are typically computationally complex and difficult to deploy for real‑time inference on edge devices.. (2) Model performance heavily relies on large amounts of annotated data, but disaster data are extremely scarce. To address these challenges, this paper proposes a novel lightweight Local‑Global Interaction Network (LGINet) for efficient BDD. The core of LGINet is the proposed Local‑Global Interaction Unit (LGIU), which achieves efficient fusion of detailed and contextual features through a dual‑path architecture and channel‑wise cross‑attention mechanism. Furthermore, a Frequency Difference Enhancement Unit (FDEU) is proposed to generate more accurate damage features, and contrastive learning is employed to reduce the model’s sensitivity to weather conditions and its reliance on annotated data. Experimental results on the xBD and WBD datasets show that LGINet achieves F1-scores of 81.76% and 80.91%, respectively, with an inference speed of 47.83 FPS. It achieves the best balance between accuracy and efficiency, outperforming existing methods.

2:00pm - 2:15pm

Fusion of AlphaEarth embeddings and Sentinel-1 time-series for conflict-related urban damage mapping

Jakub Ślesiński, Kinga Karwowska, Magdalena Lewińska

Military University of Technology, Poland

Recent armed conflicts have increased the need for reliable, spatially explicit damage mapping to support situational awareness, humanitarian assessment, and reconstruction planning. This contribution presents a hybrid change-detection framework for conflict-related urban damage mapping that combines AlphaEarth Foundations embedding change with Sentinel-1 SAR change indices. AlphaEarth provides semantically informed annual embeddings, while Sentinel-1 time series contribute all-weather sensitivity to structural change.

The study compares several embedding-based change metrics and combines the selected AlphaEarth indicator with SAR-derived change measures through simple scalar fusion rules. The proposed framework is designed to preserve the sharp sensitivity of SAR to abrupt structural changes while reducing part of the diffuse background response that often complicates single-source interpretation.

Experiments are conducted over war-affected urban areas in Ukraine, with illustrative examples from Bakhmut and Avdiivka. The results show that AlphaEarth and Sentinel-1 provide complementary information and that their fusion improves the spatial specificity of detected damage patterns. The contribution highlights the potential of combining foundation-model representations with radar time series for operational damage mapping in conflict settings.

2:15pm - 2:30pm

Street-Level Disaster Location Detection Using Image Matching of Social Media Images

Pai-Hui Hsu¹, Manikandan Sathianarayanan^1,2

¹National Taiwan University, Taiwan; ²Research Centre for Humanities and Social Sciences (RCHSS), Academia Sinica, Taiwan

Rapid and precise identification of disaster locations is essential for efficient emergency response and management. However, during the immediate post-disaster phase, the lack of timely and reliable information often impedes relief operations. Although satellite imagery and ground-based sensing systems provide valuable data, their effectiveness is constrained by factors such as time delays, high costs, and limited spatial resolution. At the same time, social media platforms such as X (formerly Twitter), Instagram, and Facebook have become valuable channels for real-time, crowd-sourced information. Users function as "human sensors," contributing extensive on-the-ground insights. Much of this content is visual—images that capture the effects of disasters with finer street-level detail and immediacy than textual posts. In this study, we propose a novel, deep learning-based image-matching framework designed to pinpoint the geographic coordinates of disaster events from social media images with street-level accuracy. The core of our approach is to match a query disaster image against a database of georeferenced Google Street View (GSV) imagery. The methodology consists of image pre-processing and feature enhancement; deep feature extraction and matching, and location inference and verification. The preliminary results on an external validation dataset are highly promising, demonstrating a high detection rate of ~90% with confidence scores above 0.9. The model proves resilient to key challenges such as partial occlusion and varied lighting, accurately segmenting multiple objects against complex backgrounds of damaged structures and flooded areas.

2:30pm - 2:45pm

Multi-Modal Attention for Automated Disaster Damage Assessment Using Remote Sensing Imagery and Deep Learning

Tewodros Gebre¹, Jagrati Talreja¹, Leila Hashemi-Beni^1,2

¹North Carolina A&T State University, Greensboro, NC, USA; ²United Nations University Institute for Water, Environment and Health, Richmond Hill, ON, Canada

The paper presents a novel deep learning framework for automated disaster damage assessment using remote sensing imagery. It addresses the challenge of timely and accurate damage classification in the aftermath of disasters, aiming to improve emergency response and resource allocation. The proposed system leverages both pre- and post-disaster satellite images to assess building damage across four categories: no damage, minor damage, major damage, and destroyed. The central innovation lies in the development of a multi-modal attention mechanism, which integrates features from both pre- and post-event images to enhance damage detection. A lightweight ConvNeXT-Tiny architecture serves as the backbone, ensuring efficient processing while maintaining high performance.

Key contributions of this work include: (1) a cross-attention module that fuses multi-modal data, (2) an optimized preprocessing pipeline designed for large-scale datasets, and (3) novel data augmentation techniques that improve the model’s robustness. Experiments on a large-scale disaster damage dataset show the model achieves an impressive 94.90% classification accuracy, with strong performance in discriminating damage levels and resilience to incomplete or corrupted data. This framework represents a significant step forward in disaster response, offering a scalable solution for real-time damage detection. The research demonstrates the potential of combining remote sensing, multi-temporal imagery, and deep learning to expedite and improve disaster damage assessment, ultimately supporting more efficient emergency management.

2:45pm - 3:00pm

AI-based multi-temporal analysis of urban dynamics using Sentinel-2 data. A case study over Osmaniye, Turkey

Antonino Pigna^1,2, Pietro Di Stasio^1,2, Deodato Tapete², Paolo Gamba³, Silvia Liberata Ullo¹

¹University of Sannio, Italy; ²Italian Space Agency, Italy; ³University of Pavia, Italy

Urban areas evolve rapidly, often increasing exposure to natural hazards, especially in seismically active regions such as southern Turkey. This contribution presents an AI-based workflow for multi-temporal analysis of urban expansion in the city of Osmaniye between 2015 and 2025. The methodology integrates Sentinel-2 multispectral imagery with a U-Net convolutional neural network trained on World Settlement Footprint (WSF) masks for binary segmentation of built-up versus non-built-up areas. After training on 2015 and 2019 data, the model was applied to the full temporal series to assess its generalisation capability and to quantify long-term urban growth.

Results show a substantial increase in built-up surfaces over the decade, with a temporary decline linked to the 2023 earthquake and a marked acceleration during the reconstruction phase. Beyond the quantitative trends, the spatial patterns identified by the model highlight how urban expansion has progressively shifted from the central districts toward peripheral zones, revealing both densification processes and outward sprawl. These observations provide valuable indications on how development pressures interact with seismic vulnerability.

The approach demonstrates the potential of AI and open satellite data for large-scale, reproducible monitoring of urban dynamics and for supporting risk-informed urban planning. Because it relies entirely on open-source datasets and tools, the workflow can be easily transferred to other hazard-prone regions, offering a scalable and transparent framework for assessing urban change, post-disaster reconstruction, and long-term exposure.

1:30pm - 3:00pm

WG IV/10: Applied Spatial Science for Public Health
Location: 715B

1:30pm - 1:45pm

Benchmarking and assessment of image-based methods for particulate matter estimation: The AQpictures project

Afshin Moazzam¹, Daniele Oxoli¹, Maria Antonia Brovelli¹, Songnian Li², Francesco Pirotti³, Shishuo Xu⁴

¹Politecnico di Milano, Italy; ²Toronto Metropolitan University; ³University of Padova; ⁴Beijing University of Civil Engineering and Architecture

The AQpictures project, conducted under the ISPRS Scientific Initiatives 2025, addresses the emerging field of image-based estimation of fine particulate matter (PM2.5) concentrations in urban areas. PM2.5 represents a major public health concern, yet existing ground-based monitoring networks offer limited spatial coverage and satellite-derived products struggle to capture surface-level variability. Recent studies have demonstrated that visual attributes in outdoor images, such as sky colour, haze, and visibility, can provide useful indicators of PM2.5 concentrations. Building upon this premise, AQpictures aims to develop an open, reproducible framework for benchmarking and validating image-based air quality estimation methods.

The project first conducts a comprehensive literature review to classify existing approaches into four methodological categories: physics-based, machine learning, deep learning, and hybrid models. Based on this synthesis, a benchmark experiment is implemented for the city of Milan, combining a ten-month dataset of webcam images with co-located PM2.5 ground measurements. The workflow involves image preprocessing, feature extraction, and model evaluation using standard statistical indicators (R², RMSE, MAE). Preliminary tests include physics-based visibility models, feature-based regressors, and convolutional deep learning architectures.

All codes, datasets, and documentation are consolidated in an open-access GitHub repository to ensure transparency, reproducibility, and adaptability of methods across different environmental contexts. Early results confirm the feasibility of PM2.5 estimation from RGB imagery, though further investigations on multi-city datasets are planned to evaluate model transferability and robustness under varying urban and climatic conditions.

1:45pm - 2:00pm

Interoperable Federated Access to Multi-Vendor Wearables for Postpartum Wellbeing Support: A Standards-Based Architecture for MAMAI

Faraneh Falah, Sepehr Honarparvar, Mahnoush Mohammadi Jahromi, Steve Liang, Sara Saeedi

University of Calgary

This paper presents MAMAI (Maternal Assistance and Monitoring through Artificial Intelligence), a standards-based framework designed to enable interoperable postpartum well-being monitoring using multi-vendor wearable devices. The proposed system addresses a key limitation in digital maternal health: the fragmentation of wearable ecosystems and the lack of integration with clinical infrastructures. MAMAI introduces a federated, edge–cloud architecture that allows wearable data to be processed locally while transmitting only summarized to the cloud.

A core contribution of this work is the integration of two complementary interoperability standards: the OGC SensorThings API for structuring IoT-based sensor observations, and HL7 FHIR for representing well-being indicators in clinically compatible formats. Through this dual-standard approach, heterogeneous wearable data—such as sleep patterns, physical activity, and heart-rate variability—are harmonized into standardized, platform-independent representations.

The framework further introduces a composite well-being score derived from normalized physiological indicators, enabling continuous and interpretable assessment of maternal health. A prototype implementation demonstrates the feasibility of the architecture, supporting end-to-end data ingestion, transformation, interoperability mapping, and visualization. Experimental results show efficient system performance with low end-to-end latency.

Overall, MAMAI provides a scalable and interoperable solution for integrating consumer wearable data into healthcare ecosystems, offering a foundation for next-generation maternal digital health systems and continuous postpartum monitoring.

2:00pm - 2:15pm

Seeing vertical greenery: Global differences in residents’ green exposure and inequality

Xiaozhen Ren¹, Xuefeng Guan¹, Liqun Sun², Yifan Teng¹, Qingyang Xu¹, Chang Liu¹, Zhangyan Xu¹, Xu Li¹

¹State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China; ²Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China

Achieving the United Nations Sustainable Development Goal (SDG) 11.7.1—“providing universal access to safe, inclusive, accessible, and green public spaces by 2030”—underscores the critical role of urban green space in advancing global sustainability.Although extensive research has examined urban greenery from a traditional planar perspective, green spaces inherently possess vertical structure. Currently, systematic quantitative assessments of urban vertical greenery, residents’ actual exposure to vertical green space, and the associated inequalities remain limited. To address these gaps, this study integrates global population data with vegetation height information to construct an exposure-based analytical framework.We quantify spatial patterns of vertical greenery, residents’ green exposure, and exposure inequality across global urban areas, and further examine the drivers of inequality. Our findings reveal pronounced spatial disparities in urban greenery worldwide. On average, cities in the Global North exhibit approximately three times greater vertical greenery and nearly four times higher green exposure than cities in the Global South. African urban areas possess only one-sixth of the average vertical greenery and one-seventh of the exposure level observed in North America, while displaying roughly twice the inequality in green exposure, indicating much more uneven access to green resources. We also find that cities with higher average vertical greenery tend to experience lower exposure inequality, suggesting that increasing overall greenery can help promote more equitable access. These results provide new theoretical insights and policy-relevant evidence for advancing sustainable and equitable urban green development, supporting global progress toward sustainable development goals.

2:15pm - 2:30pm

Modeling Dynamic Walkability to Support Time-Based Route Planning for Older Adults

Febrian Fitryanik Susanta^1,2, Pei Fen Kuo¹, I Gede Brawiswa Putra¹

¹Department of Geomatics, National Cheng Kung University, No. 1 Dasyue Road, East District, Tainan City 701, Taiwan; ²Department of Geodetic Engineering, Universitas Gadjah Mada, Jl. Grafika No. 2, Yogyakarta 55281, Indonesia

Walkability assessments for elderly pedestrians are often based on static representations of the built environment, overlooking temporal variations that influence walking conditions throughout the day. This study develops a network-based dynamic walkability framework that integrates static infrastructural characteristics with time-dependent environmental factors to capture spatiotemporal variability in pedestrian suitability. The approach combines sidewalk and arcade-based pedestrian networks with dynamic variables, including traffic, air quality index (AQI), temperature, humidity, shade, and lighting, evaluated at two time periods (12:00 and 17:00) across weekdays and weekends in three urban contexts in Tainan, Taiwan: a hospital area, a university campus, and a residential neighborhood. Results indicate clear spatial differences, with hospital and campus areas showing higher baseline walkability than residential areas. Dynamic analysis reveals temporal variation, with improvements ranging from approximately 3–8% in institutional environments to over 10% in residential areas. Segment-level results further show that temporal factors can alter pedestrian suitability, particularly in areas with limited infrastructure. Route-based validation demonstrates that the model generates alternative paths that prioritize safety and environmental comfort over the shortest distance. Compared to Google Maps routes, the proposed approach achieves higher average walkability, with improvements ranging from approximately 5% to over 15%, particularly in residential areas. These findings highlight the limitations of static and shortest-path approaches and emphasize the importance of incorporating temporal dynamics. The proposed framework supports time-sensitive routing and age-friendly urban planning strategies.

2:30pm - 2:45pm

An Environment-Aware Indoor-Outdoor Integrated Digital Twin for Healthy Mobility

Yan Li, Chenming Ye, Wenxuan Shi, Wenqing Zhang, Yuyang Zhang, Teng Hu, Zhizhong Kang

China University of Geosciences (Beijing), China, People's Republic of

Existing building digital twins treat indoor environments as static geometric containers, ignoring the dynamic coupling between ventilation structure states and indoor environmental quality. Furthermore, managing indoor and outdoor spaces as separate data silos prevents the continuous assessment of occupant exposure across building boundaries. This paper proposes an environment-aware, indoor-outdoor integrated digital twin framework coupling geometric entity states with physical environmental fields for healthy mobility assessment. The framework utilizes a three-layer architecture. First, the Geometric-Semantic Layer provides a seamless LOD4 model with topologically stitched spaces, modeling ventilation facilities as first-class entities with mutable state attributes (Full Closed, Half Open, Full Open). Second, the Physical Field Layer maps mobile sensing data (PM2.5, CO2) onto semantic entities using a semantic-constrained method, treating walls and closed windows as aggregation barriers. Finally, the Behavioral Response Layer combines entity-level pollution values with pedestrian counts to compute a cumulative Crowd Exposure Index (CEI). Implemented on a Cesium platform, the framework was validated through a week-long university building experiment. Results show indoor PM2.5 in a fully enclosed study room averaged 61.2 μg/m³—1.6 times the outdoor level and 4.1 times the WHO guideline. This resulted in a CEI 12 times higher than in outdoor transit areas. Semantic correlation confirms the "Full Closed" window state primarily drives pollutant accumulation. This validates the framework's core geometry-physics coupling, demonstrating its potential to guide intelligent ventilation interventions and healthy building management.

2:45pm - 3:00pm

Integrating ulti-Source Remote Sensing and GIS for Urban Air Quality Mapping in Emerging City: Insights from Nashik City, India

Chetankumar Patel, Shishir Dadhich

SVNIT,SURAT

Rapid industrialization and unplanned urbanization have increased air pollution levels across Indian cities, posing serious environmental and health challenges. This research presents a geospatial assessment of air pollutant behaviour across Nashik city by integrating multi-source remote sensing datasets and real observation datasets from Sentinel-5P, NASA POWER, and CPCB ground observations within a GIS-based analytical framework. Using ward-level mapping and spatial overlays, the study examines the distribution of key pollutants—PM2.5, PM10, NO2, SO2, and CO—and their relationship with environmental and anthropogenic parameters, including land use, road networks, wind direction, temperature, and vegetation density. The results consistently reveal high concentrations of PM2.5, ranging from a minimum of 52.4 µg/m³ to a maximum of 73 µg/m³, and PM10, a minimum of 87.3 µg/m³ and a maximum of 121.5 µg/m³, particularly along high-traffic corridors and industrial zones, which exceed the WHO standards. Correlations with meteorological and vegetative factors further highlight the influence of urban form and climatic conditions on pollutant dispersion. This integrated approach demonstrates how multi-source remote sensing and GIS tools can be effectively employed to identify emission hotspots, support evidence-based policy formulation, and strengthen urban environmental management strategies for sustainable development.

3:00pm - 3:15pm

Long-Term Monitoring of NO₂ Pollution in the Mining and Industrial Region of Korba in Chhattisgarh Using Sentinel-5P and NDPI

Aman Srivastava, Aditya Kumar Thakur, Rahul Dev Garg, Pradeep Kumar Garg

Indian Institute of Technology Roorkee, India

Air pollution is a critical environmental challenge, with nitrogen dioxide (NO₂) from vehicles and industries posing serious health and atmospheric risks. Traditional monitoring is limited, making satellite-based methods essential for large-scale assessment. Korba, Chhattisgarh is an industrial hub of coal mining and thermal power plants is a major pollution contributor. This study investigates the spatiotemporal dynamics, statistical behavior, and long-term trends of NO₂ concentrations over the Korba region from 2019 to 2024, utilizing Sentinel-5P TROPOMI-derived NO₂ column density and the Normalized Difference Pollution Index (NDPI). Year-wise NDPI patterns revealed a consistent pollution hotspot in the central-southern region, with the annual mean NDPI gradually increasing from 0.175 in 2019 to 0.191 in 2023. The monthly NDPI peaked in December-2024 at 0.525, indicating severe winter pollution. Statistical analysis showed moderate variability and a near-symmetric NDPI distribution with occasional spikes near industrial zones. Trend analysis identified a marginal but steady increase in pollution. Autocorrelation analysis revealed strong short-term persistence (lag-1 = 0.594), while spectral analysis identified a dominant annual frequency (0.083 cycles/month) with a peak power of 0.107, confirming the presence of strong seasonal variation and short-term persistence in NO₂ concentration. These results underscore the cyclic yet escalating nature of NO₂ pollution, with notable winter intensification. The findings emphasize the need for targeted emission control strategies and policy-level interventions to manage regional air quality. Future work should integrate ground-based validation and explore meteorological influences to improve predictive accuracy and guide sustainable environmental management.

1:30pm - 3:00pm

IvS7A: Innovative Remote Sensing of Wetlands in Canada and Beyond
Location: 716A

1:30pm - 1:45pm

Retrieving Peatland Soil Moisture from Polarimetric L- and C-band SAR to Support Carbon and Wildfire Assessments in Boreal Ecosystems

Laura Bourgeau-Chavez¹, Daniel Leyva², Rabecca Bakker¹, Anthony Chavez¹, Michael Battaglia¹, Jeremy Graham¹, Rachel Gumpper¹

¹Michigan Technological University, United States of America; ²Purdue University, United States of America

The accumulation of C in peatlands generally depends on hydrologic conditions that maintain saturated soils and impede rates of decomposition. Boreal Peatlands have provided rich reservoirs of stored C for millennia. However, with climate change, warming and drying patterns across the boreal and arctic are resulting in dramatic changes in ecosystems and putting these systems at risk. As long as peatlands are functioning hydrologically, they will continue to sequester and store carbon. The ability to retrieve and monitor soil moisture from peatlands is of interest for a wide range of applications from hydrological modeling to understanding ecosystem vulnerabilities to increased drought, decomposition and wildfire to monitoring methane flux and peatland restoration. To develop soil moisture retrieval algorithms, we studied a range of boreal peatland sites (bogs and fens) stratified across geographic regions of North America from 2010 to 2024. We developed soil moisture retrieval algorithms from polarimetric C-band (5.7 cm wavelength) and L-band (24 cm wavelength) synthetic aperture radar (SAR) data. Both multi-linear regressions and gradient boosters (XGBoost, CatBoost and Explainable Boosting Machines) were developed. We found that integrating polarimetric SAR parameters that are sensitive to vegetation structure and parameters most sensitive to surface soil moisture in the models provided the best results. Data were withheld for model testing and coefficient of determination, RMSE, unbiased RMSE are reported.

1:45pm - 2:00pm

Using a Landsat multi-index and thermal image composite time series framework to evaluate hydroclimatic forcing and vegetation trajectories in the Peace-Athabasca Delta

Farnoosh Aslami¹, Chris Hopkinson^1,2, Laura Chasmer^1,2, Daniel L Peters³, Craig Coburn¹, Mina Nasr⁴

¹Department of Geography and Environment, University of Lethbridge, Lethbridge, AB, Canada; ²Department of Geography and Environment, Western University, London, ON, Canada; ³Environment and Climate Change Canada, University of Victoria Queenswood Campus, Victoria, BC, Canada; ⁴Government of Alberta, Ministry of Environment and Protected Areas, Edmonton, AB, Canada

The Peace–Athabasca Delta (PAD) is undergoing long-term ecological change driven by climate warming, hydro-regulation, and fluctuating flood–dry cycles. This study uses a harmonised 40-year Landsat composite time series (1984–2024) to assess vegetation, surface-water extent, and thermal conditions across the delta. An 11-year moving-window Mann–Kendall trend analysis was applied to NDVI, EVI, MNDWI, and LST, retaining only significant Theil–Sen slopes. Significant vegetation–water trends were combined into a 10-class framework that maps greening, browning, wetting, and drying across all landscape types, including ecotones. Parallel LST trends reveal reinforcing or contrasting thermal feedbacks. It provides a coherent basis for interpreting whether vegetation and hydrologic changes reflect ecotone expansion or contraction under thermal variability.

2:00pm - 2:15pm

Aquatic and Riparian Land Cover Trends across Mountainous Headwater Basins in Alberta, Canada

Italo Rodrigues¹, Christopher Hopkinson¹, Laura Chasmer¹, Ryan MacDonald¹, Suzanne Bayley²

¹University of Lethbridge, Canada; ²University of Alberta

Mountain headwaters of the Eastern Slopes of Alberta (ES) are the primary source of freshwater of major easterly flowing basins in western Canada, supplying a significant volume of water to about four million people. However, increasing temperatures is altering mountain aquatic (open water areas, lakes, reservoirs, rivers, ponds, wetlands) and riparian vegetation (herbaceous and woody/shrub) ecosystems. The ES, Canada, has demonstrated landcover and process changes associated with climate warming, e.g., increases in the air temperature [1] have led to earlier snowmelt, and increased glacier wastage, resulting in higher river flows over a shorter period, which can result in expansion of open water areas during and following peak flow periods [2]. The impacts on wetlands are less visible or well known, and there is a need to evaluate spatial and temporal changes and trends in wetland loss, growth, or genesis across this mountainous ecosystem. Here, we provide a framework for quantifying and assessing multi-decadal wetland extents over the large spatial scale of the ES from 1984 to 2023. We used the historical Landsat archive to produce a remote sensing-based time series landcover classification over the last 40 years in the ES.

2:15pm - 2:30pm

Transfer Learning using Functional Data Analysis of Seasonal SAR Time Series

Amir Behnamian¹, Kenneth Chu², Jason Duffe¹, Jon Pasher¹, Mina Nasr³

¹Environment and Climate Change Canada; ²Statistics Canada; ³Alberta Government

Functional Data Analysis (FDA) provides a powerful framework for representing temporal dynamics in remote-sensing data. Building on this concept, this study develops a transfer learning framework using a minimally trained Functional Principal Component Analysis (FPCA)-based feature extraction engine (“FPC engine”) to map dynamic wetlands at large scale. A small set of training locations from Ontario was used to train the FPC engine, which captures dominant seasonal backscatter patterns of open water, shallow water, and marsh-like vegetation. The trained engine was then transferred to the Prairie Pothole Region (PPR) to delineate dynamic wetland classes without extensive local calibration. This label-efficient design—supervised in selecting training locations but unsupervised in feature extraction—reduces field data needs while maintaining strong generalization. Validated results show that the transferred FPC engine effectively separates dynamic wetland classes across contrasting climatic and geomorphic conditions, supporting scalable and cost-efficient monitoring with Sentinel-1 SAR data.

2:30pm - 2:45pm

Multi-scale DSM and Multi-temporal Sentinel-2 Derivatives for Wetland Mapping: A Boreal Case Study

Amir Behnamian¹, Courtney Irvine², Anne Blondin², Morgan Hrynyk¹, Jason Duffe¹, Jon Pasher¹

¹Environment and Climate Change Canada, Canada; ²Parks Canada

Wetland mapping in boreal environments remains challenging due to complex vegetation structure, subtle and variable terrain gradients, diverse wetland types, and the proportion of treed wetlands. This study develops and evaluates a framework to remotely identify wetland types in Pukaskwa National Park (Ontario, Canada) by integrating multi-scale terrain metrics with multi-temporal Sentinel-2 spectral derivatives. Five years (2017–2021) of Sentinel-2 data were used to derive harmonic NDVI metrics, including linear trend, amplitude, and phase of the first Fourier component, capturing seasonal vegetation and hydrologic dynamics. These spectral predictors effectively delineated open water and non-treed peatlands but struggled in densely forested wetlands where canopy obscures surface moisture signals. To address this limitation, Gaussian scale-space analysis was applied to the Copernicus GLO-30 DSM, informed by FFT-based evaluation of terrain wavelengths (100 m–10 km), to generate multi-scale Local Relief Models and curvature metrics representing depressional and convex landforms. A hierarchical workflow masked open water using Sentinel-1, removed upland convex terrain using LRM-curvature rules, then applied Random Forest classification using field training data and combined spectral-terrain predictors. Accuracy assessment stratified by terrain context showed strong performance in low-lying depressional areas and suppression of false wetland detections in high terrain with local depressions. Reduced accuracy in relatively flat areas was attributed to DSM vertical uncertainty limiting detection of shallow depressions beneath dense canopy, resulting in reliance on optical separability that weakens under closed canopy but improves where tree cover is sparse. Overall, results demonstrate the value of combining Fourier-based temporal descriptors with multi-scale terrain analysis for boreal wetland mapping.

1:30pm - 3:00pm

Forum4B: Hybrid Intelligent Geospatial Computing
Location: 716B

1:30pm - 3:00pm

Forum9A: Exploring the Role of DGGS and AI in Addressing Challenges of National Mapping & Remote Sensing Agencies
Location: 717A

3:00pm - 3:30pm

Afternoon Coffee Break
Location: Exhibition Hall "E"

3:30pm - 5:15pm

SpS4A: Remote Sensing of Atmospheric Components for Climate Change and Air Quality: Bridging ISPRS and AERSS
Location: 713A

3:30pm - 3:45pm

Satellite Remote Sensing and Numerical Simulation of the Impact of Biomass Burning on Black Carbon in East Asia

Huimin Chen¹, Jane Liu^2,3, Bingliang Zhuang⁴, Tijian Wang⁴

¹Suzhou Meteorological Bureau, China; ²Fujian Normal University, China; ³University of Toronto, Canada; ⁴Nanjing University, China

As an essential component in the atmosphere, black carbon (BC) can affect regional and global climate, air quality, and human health. Biomass burning is an important source of BC aerosols, and biomass burning in East Asia is rather active. In this study, we analyze the biomass burning over East Asia in 2010 using MODIS satellite fire data. A global chemical transport model, GEOS-Chem, is used to simulate temporal and spatial variations of BC aerosols and impact of biomass burning on these variations through two numerical experiments: one with all BC emissions while the other without the biomass burning emissions.

The results show that the 2010 biomass burning over East Asia frequently occurred in northeast China, north China, northern India and indo-China Peninsula. In China, biomass burning mostly happened in summer and fall, while in Southeast Asia, biomass burning happened in spring and winter. GEOS-Chem can reasonably reproduce the temporal and spatial variations of BC. The surface concentrations of BC in China are high in the North China and Southwest basins. Such a spatial pattern is similar in four seasons, with seasonality that BC concentrations are the highest in winter, followed by autumn, spring and summer. Sensitivity analysis shows that the biomass burning in East Asia contributed 8.6% BC concentrations in East Asia. Based on the EOF decomposition and correlation analysis, the BC concentrations due to biomass burning in some parts of East Asia was significantly increased through transport of BC in the first mode at 850 hPa in spring and winter.

3:45pm - 4:00pm

Validation of global land-ocean aerosol products retrieved from the DPC-2/GF-5(02) on-orbit measurements

Zihan Zhang¹, Zhengqiang Li^1,2,3

¹Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100101, China; ²State Key Laboratory of Spatial Datum, College of Remote Sensing and Geoinformatics Engineering, Faculty of Geographical Science and Engineering, Henan University, Zhengzhou 450046, China; ³University of Chinese Academy of Sciences, Beijing 100049, China

The Chinese second-generation Directional Polarization Camera (DPC-2) onboard the GF-5(02) satellite provides global multi-angle, multispectral polarization observations, effectively bridging the gap between POLDER/PARASOL and SPEXone/PACE. Using one year of DPC-2/GF-5(02) measurements, land-ocean aerosol products are generated by fully exploiting polarization and angular information to enhance sensitivity to aerosol properties. Ground-based observations from the AErosol RObotic NETwork (AERONET) are used to evaluate the retrieval accuracy of Aerosol Optical Depth at 550 nm (AOD550), Ångström Exponent between 440 nm and 670 nm (AE440-670), and Single Scattering Albedo at 440 nm (SSA440), demonstrating the stability and reliability of the retrievals. For AOD550, the Root Mean Square Error (RMSE) and bias are 0.109 and -0.006 over land, and 0.071 and -0.001 over ocean. For AE440-670, the RMSE and bias are 0.488 and -0.151 over land, and 0.275 and -0.047 over ocean. For SSA440, the RMSE and bias are 0.044 and 0.003 over land, and 0.039 and 0.002 over ocean. Comparisons with mainstream satellite aerosol products indicate comparable and consistent accuracy. Overall, these results provide a coherent global characterization of aerosol distribution and properties, highlighting the strong potential of DPC-2/GF-5(02) for long-term aerosol monitoring and climate research.

4:00pm - 4:15pm

Intra-urban aerosol heterogeneity in Hong Kong based on Lidar observations

Xinyu YU¹, Man Sing WONG^1,2,3,4, Jiaqi Jin¹

¹Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ²State Key Laboratory of Climate Resilience for Coastal Cities, The Hong Kong Polytechnic University, Hong Kong, China; ³Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ⁴Research Institute of Land and Space, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China

This study involves remote sensing and Lidar-based data analysis to quantify the aerosol extinction profile under different urban patterns and seasons.

4:15pm - 4:30pm

Contrasting Meteorological Impacts of Dust Storms from the Gobi Desert versus the Taklimakan Desert over China

Jianhong Li, Nana Luo

Beijing University of Civil Engineering and Architecture, China, People's Republic of

Direct and indirect climate forcing from Asian dust storms has been well documented, such as lifted dust aerosols becoming cloud-forming particles and changing radiation flux from surface to the top of atmosphere. However, whether such forcing becomes distinguished as related to dust origins remains unclear. Here we present a comparative analysis of historical dust storms in China originating in Mongolia and Xinjiang from 2016-2023, and determine their respective dominators by involving their individual and combined influence on dust storms. Most dust storms originated in Mongolia, with observed long-range transport and global scale development, in comparison to those originating in Xinjiang. During dust storms, cloud properties such as cloud droplet radius and cloud retrieval fraction liquid had nonlinear response, and a dominant role in 60.2% of the study area. Climate conditions such as surface thermal radiation and dewpoint temperature became dominated in periphery of dust storms. Xinjiang-originated dust storms, in contrast, were dominated by local aridity (65.2%). As the aridity decreased, dust storms were dominated by total precipitation, with increase from 0.5 up to 3.6, and the influence of surface heat flux decreased. Heat-flux-dominated regions encountered increased aridity, and the dominance of total precipitation was neutralized. These findings have important implication for global management and mitigation of Asian dust emissions.

4:30pm - 4:45pm

The Arctic Observing Mission (AOM): A high priority candidate mission for the Government of Canada

Ray Nassar¹, Joseph Mendonca¹, Chris Sioris¹, Alec Casey², Matt Arkett², Josep Aparicio³, Kyle Ziolkowski⁴, Dan-Tam Nguyen⁵, Shen-En Qian⁵, Mary-Anne Fobert⁶, Alexander Trishchenko⁶

¹Environment and Climate Change Canada, Science and Technology Branch, Toronto, Canada; ²Environment and Climate Change Canada, Meteorological Service of Canada, Gatineau, Canada; ³Environment and Climate Change Canada, Science and Technology Branch, Dorval, Canada; ⁴Environment and Climate Change Canada, Science and Technology Branch, Winnipeg, Canada; ⁵Canadian Space Agency, St.-Hubert, Canada; ⁶Natural Resources Canada, Ottawa, Canada

The Arctic Observing Mission (AOM) is a satellite mission concept under study by the Canadian Space Agency (CSA), in partnership with Environment and Climate Change Canada (ECCC) and Natural Resources Canada (NRCan). AOM would use two satellites in a highly elliptical orbit (HEO) to enable frequent observations of meteorological variables, greenhouse gases (GHGs), space weather and air quality (AQ) over northern regions, reaching beyond the usable viewing range of geostationary satellites. These observations are important for operational activities, environmental monitoring and scientific research aligned with the Government of Canada priority of enhancing Arctic and northern situational awareness and security.

4:45pm - 5:00pm

Global Point Source CO2 Emissions Monitoring Based on Hyperspectral Remote Sensing Imagery

Huayi Wang^1,2, Ge Han^1,2, Zhipeng Pei^1,2

¹Hubei Key Laboratory of Quantitative Remote Sensing of Land and Atmosphere, School of Remote Sensing and Information Engineering, Wuhan University; ²Perception and Effectiveness Assessment for Carbon-neutrality Efforts, Engineering Research Center of Ministry of Education, Institute for Carbon Neutrality, Wuhan University

This study presents a hyperspectral remote sensing approach for monitoring global CO₂ point source emissions using China’s GF5 and ZY1 satellites. By applying the matched filter method in the 1.6 μm and 2.0 μm absorption band and the Integrated Mass Enhancement (IME) technique, this study successfully detects and quantifies emissions from multiple facilities within a single scene—demonstrated in a high-density industrial cluster in Xinjiang. Results show current systems can detect power plants with annual emissions above 2.90 MtCO₂, covering 6.74 GtCO₂/year globally across eight sectors. While power and chemical sectors are well captured, cement and petrochemical emissions remain poorly detected, highlighting the need for improved sensitivity to low-intensity sources.

5:00pm - 5:15pm

Remote Sensing of CO, ozone and Their Correlation in Tropical Fire Regions

Jane Liu¹, Jufen Lai², Joey Zhou Zang¹

¹University of Toronto, Canada; ²Jiangsu Ocean University

Biomass burning releases a large amount of pollutants including carbon monoxide (CO), and generates secondary pollutants, e.g., ozone (O3). Both CO and O3 are major pollutants and can also significantly affect tropospheric chemistry. Understanding O3-CO relationship is important for evaluating transport and evolution of the pollutants in fire plumes. Here, we analyse the satellite remote sensing of fire count data from MODIS, satellite remote sensing of CO and O3 from AIRS, and the simulation of the global atmospheric chemistry model GEOS-Chem in the middle and lower troposphere during June and August of 2010. AIRS can capture fire-induced CO and O3 enhancements (ΔCO and ΔO3) well in fire-affected and fire-plume outflow regions. Two areas with high ΔCO and ΔO3 include central Africa and northwestern South America in the tropics, where the numbers of hotspots are the large in the MODIS fire data. AIRS CO and O3 in fire plumes are highly correlated in 850 hPa and 500 hPa. The GEOS-Chem simulation show CO and O3 enhancement in northwestern South America, but with lower ΔO3/ΔCO values. These findings highlight the importance of integrating satellite observations with atmospheric chemistry modelling on refining fire-affected air quality and tropospheric chemistry assessments.

3:30pm - 5:15pm

SpS3: Cooperation on Ground Motion Monitoring for Disaster Risk Reduction and Resilience
Location: 713B

3:30pm - 3:45pm

From InSAR Norway to a Global Ground Motion Service: Operational Monitoring for Disaster Risk Reduction

John Dehls¹, Yngvar Larsen², Marie Bredal¹, Gökhan Aslan¹, Dag Anders Moldestad³

¹Geological Survey of Norway, Norway; ²NORCE Research; ³Norwegian Space Agency

InSAR Norway (InSAR.no) is one of the world’s first fully operational, open-access national ground-motion services. Jointly operated by NGU, NVE and the Norwegian Space Agency, with processing by NORCE on NGU’s high-performance computing cluster, it provides nationwide deformation time-series from Copernicus Sentinel-1 data. The service delivers more than five billion measurement points annually through a public web portal and is widely used for landslide mapping, infrastructure monitoring and climate-related research. It has transformed how Norway identifies and manages unstable slopes, supports early warning and infrastructure safety, and integrates satellite data with in-situ monitoring through networks of snow-protected corner reflectors.

Experience from InSAR Norway directly informed the European Ground Motion Service (EGMS) under the Copernicus Land Monitoring Service, which scales the same operational principles to continental level. EGMS demonstrates that harmonized, validated and open InSAR products can be maintained across national borders.

Building on these achievements, this paper outlines the concept of a Global Ground Motion Service (GGMS)—a federated system providing standardized, GNSS-anchored ground-motion data worldwide. Such a service would combine open satellite data, interoperable processing frameworks and regional capacity-building to support disaster-risk reduction and resilience globally. As the global community invests in disaster-risk reduction, an open GGMS could become one of the most tangible and enduring legacies of the Copernicus era.

3:45pm - 4:00pm

Seismic Hazard for the Alpine Himalayan Belt from Trans-Continental Sentinel-1 InSAR & GNSS

Andrew Hooper¹, John Elliott¹, Jin Fang¹, Milan Lazecky¹, Yasser Maghsoudi², Qi Ou³, Jessica Payne¹, Chris Rollins⁴, Dehua Wang¹, Timothy Wright¹

¹COMET, School of Earth, Environment and Sustainability, University of Leeds, United Kingdom; ²Centre for Environmental Mathematics, University of Exeter, Penryn Campus,TR10 9FE, United Kingdom; ³School of GeoSciences, University of Edinburgh, Edinburgh, EH8 9XP; ⁴Earthquake Physics and Statistics, Earth Sciences New Zealand, 1 Fairway Drive, Avalon, 5011, Lower Hutt, New Zealand

Satellite geodesy has become a cornerstone for mapping tectonic deformation, fault activity, and seismic hazard through measurements of surface velocities and strain rates. Yet, in vast regions of diffuse continental deformation, such as the Alpine–Himalayan Belt, observational coverage remains limited. Historically, large-scale studies have relied on sparse GNSS networks, which cannot resolve shorter-wavelength deformation features in many areas. To address this gap, we processed Sentinel-1 radar acquisitions from 2016 to 2024 to generate transnational surface velocity fields and time series at 1 km resolution, spanning more than 11,000 km from southern Europe to eastern China and covering over 20 million km². Our solution integrates more than 220,000 Sentinel 1 SAR images with a newly compiled GNSS dataset, all referenced consistently to the Eurasian frame. From these velocities, we compute horizontal strain rates by taking spatial gradients, providing near-continuous deformation maps across the planet’s largest actively deforming zone. Horizontal motions and strain patterns are primarily tectonic, exhibiting a dual character: strongly localised along major faults yet broadly distributed elsewhere. In contrast, short-wavelength vertical signals largely reflect non-tectonic processes, especially widespread groundwater depletion. These new velocity and strain-rate products constitute foundational datasets, offering a detailed view of continental deformation at a transcontinental scale that feed into the Disaster Risk Management cycle.

4:00pm - 4:15pm

Volcano Risk Reduction in Canada – The Government of Canada’s Dedicated Volcano Monitoring System Using InSAR Technology

Drew Rotheram-Clarke¹, Melanie Kelman¹, Mandip Sond², Timothy Beattie², Chloe Lam¹

¹Geological Survey of Canada, Pacific Division, Vancouver, British Columbia, Canada; ²Canadian Hazards Information Service, Ottawa, Ontario, Canada

The west coast of Canada occupies an active subduction zone and is the host of an often underestimated threat of volcanic eruption. This tectonically active region is the home of 348 known volcanic vents that have been active since the Pleistocene, 54 of which are Holocene in age or younger. The annual probability of any eruption has been estimated at 1/200, while the annual probability of a major explosive eruption has been estimated at 1/3333.

In 2021 the Geological Survey of Canada published a volcanic threat ranking study) which used a threat score assignment methodology developed by the United States Geological Survey. In this study, we describe how the results of this threat ranking guide the acquisition strategy of routine RCM SAR data over the highest threat volcanoes in and around Canada.

We describe the architecture of the fully automated, cloud-based processing system that routinely searches for fresh RCM SAR data, ingests and processes the raw data and displays processed InSAR data on a purpose-built interface for scientific analysis.

With the proliferation of the heavily automated InSAR measurements, human analysis of vast volumes of data becomes challenging. In this research, we also describe the application and performance of an open weight deep learning model trained specifically for the purpose of detecting magmatic unrest in InSAR data. We demonstrate a deformation detection threshold of 6.2 cm and a true positive rate of 0.98 using observations from a real magmatic unrest event in Reykjanes, Iceland through 2023-2024.

4:15pm - 4:30pm

Updates on the NASA-ISRO NISAR Mission and the OPERA North America Surface Displacement Product

Cathleen Jones, Alexander Handwerger, Steven Chan

Jet Propulsion Laboratory, United States of America

We provide updates on the NASA-ISRO NISAR synthetic aperture radar mission and the NASA OPERA project. NISAR launched in June 2025 and began science operations in November 2025. The mission status will be presented and products for different science applications shown. The OPERA project produces four different product streams to support agency information needs, with the Dynamic Surface Water Extent (DSWx), Surface Disturbance (DIST), and Surface Displacement (DISP) products already available, and algorithm development underway for a future Vertical Land Motion product. These are generated from a variety of sensor data, including harmonized Landsat/Sentinel-2, Sentinel-1, NISAR, and SWOT. Examples shown will focus on the DISP products, currently generated from Sentinel-1 data and with a new product line using NISAR data to roll out in early 2027.

4:30pm - 4:45pm

Prediction of line-of-sight surface displacement using PSInSAR, and environmental factors powered by XGBoost

Masoud Mohsenifard, Ramata Magagi, Kalifa Goïta

Universite de Sherbrooke, Canada

Monitoring precursory ground deformation is essential for assessing landslide hazard in regions where hydrological conditions strongly influence surface stability. In Québec’s Saguenay–Lac-Saint-Jean (SLSJ) region, numerous surface failures have occurred in highly sensitive postglacial marine clays, where rainfall, snowmelt, and groundwater fluctuations act as dominant triggers. Although Persistent Scatterer InSAR (PSInSAR) enables regional monitoring of slow ground deformation, its utility for short-term prediction remains limited by the temporal gap between Sentinel-1 acquisitions. This study investigates whether hydrological time-series, when integrated with PSInSAR displacement trends, can be used to forecast the line-of-sight (LOS) displacement observed at the satellite acquisition immediately preceding documented failure events.

A dataset of 102 historical failures (2018–2024) was assembled and paired with 168 Sentinel-1 ascending scenes processed through the StaMPS PSInSAR workflow. Daily precipitation, air temperature, groundwater level, and terrain slope were compiled and temporally synchronized with LOS displacement time series.

An XGBoost regression model was trained to predict the LOS displacement at the subsequent Sentinel-1 acquisition, using an 80/20 train–test split and five-fold cross-validation. Model performance was evaluated using Pearson’s r, MAE, and RMSE. Results show strong predictive skill, with r = 0.82, MAE = 4.36 mm, and RMSE = 6.26 mm. Feature importance analysis highlights the dominant role of recent PSInSAR displacement and groundwater variability. These findings demonstrate the feasibility of integrating hydrological and InSAR time-series to forecast pre-failure surface displacement, supporting the development of satellite-based early warning strategies for hydrologically sensitive terrain.

4:45pm - 5:00pm

Validating social media Geospatial Tags Using Sentinel-1A InSAR on Google Earth Engine: A Hurricane Harvey Case Study

Michael Paul¹, Dodji Kuwonu¹, Eugene Levin¹, Todd Gary¹, Long Nguen², Hieu Minh Duong²

¹Meharry Medical College, United States of America; ²University of Louisville

This research validates social media geospatial tags using Sentinel-1A Interferometric Synthetic Aperture Radar (InSAR) data processed on Google Earth Engine, focusing on Hurricane Harvey as a case study. The study addresses critical uncertainties regarding the spatial reliability of crowdsourced disaster information, which has limited integration of social media data into operational disaster management frameworks.

Methodology: The methodology integrated 144,546 geotagged posts from Twitter, Facebook, and Instagram collected during Hurricane Harvey (August 25 - September 3, 2017) with Sentinel-1A SAR imagery processed on the Google Earth Engine cloud platform. InSAR analysis identified 1,247 square kilometers of flooded areas in the Houston metropolitan region. Spatial validation employed buffer zone analysis at 500m, 1km, and 2km distances, with temporal alignment matching social media timestamps to SAR acquisition dates.

Results: Results demonstrate that 68.3% of flood-related social media tags fell within actively flooded areas using 1km buffers, with accuracy increasing to 82.1% within 500m buffers, compared to only 12.7% random expectation. Temporal analysis revealed social media activity peaked 6-18 hours before peak SAR-detected flooding, suggesting potential early warning capabilities. The cloud computing paradigm reduced processing time from weeks to 4-6 hours, enabling near-real-time validation.

Conclusion: This study establishes that validated social media geospatial information can effectively augment satellite-based disaster monitoring systems, particularly during initial response phases when temporal resolution is critical. The integration framework demonstrates operational feasibility for multi-source geospatial data fusion in disaster risk reduction applications.

5:00pm - 5:15pm

European Ground Motion Service: public and open source InSAR in support of Risk Management

Lorenzo Solari¹, Joanna Balasis-Levinsen¹, John Dehls²

¹European Environment Agency, Copernicus Land Monitoring Service; ²Geological Survey of Norway

The paper presents an overview of the European Ground Motion Service (EGMS), a CLMS product that delivers continent-wide, high-resolution measurements of ground motion to users based on Sentinel-1 data. It explains the EGMS architecture, which integrates Persistent and Distributed Scatterer techniques to generate standardised products—Basic, Calibrated, and Ortho—allowing millimetric monitoring of land motion across Europe. The paper emphasises how EGMS fills a critical gap between localised ground measurements and global geodetic frameworks, offering harmonised datasets for hazard assessment, infrastructure management, and policy-making. Applications discussed include subsidence and uplift detection, landslide mapping, and analysis of critical infrastructure.

Looking forward, the paper outlines a potential evolution towards an expansion of the EGMS concept beyond Europe. This would enable standardised, freely accessible deformation data to support global hazard mitigation and climate adaptation. The paper concludes that while technically feasible, a global implementation will require strategic GNSS densification and international cooperation to ensure reliability and equitable access.

3:30pm - 5:15pm

WG III/4C: Landuse and Landcover Change Detection
Location: 714A

3:30pm - 3:45pm

Canopy Height Estimation Through the GEDI Era Using Multiple Sensors Combination and Machine Learning

Ghaith Kouki, Bahram Salehi

SUNY ESF, USA

Accurate large-scale forest canopy height mapping is critical for biomass estimation and carbon monitoring, yet remains constrained by the limitations of individual remote sensing systems. This study presents a multisensor machine learning framework that integrates GEDI LiDAR with Sentinel-2, Sentinel-1, ALOS-2 PALSAR-2, and 3DEP terrain data to generate a 25 m resolution canopy height model (CHM) for the Northeastern United States in 2022. A key contribution is an adaptive GEDI relative height (RHad) strategy that selects optimal RH metrics based on canopy density, improving generalization across heterogeneous forest conditions compared to any single fixed RH metric. Independent validation against airborne LiDAR and USDA FIA plot data confirms that RHad achieves the highest accuracy and lowest bias of all configurations tested. The resulting regional canopy height map provides a reliable baseline for large-scale forest monitoring and future multitemporal analyses.

3:45pm - 4:00pm

Near Real-Time Forest Loss Detection in the Brazilian Amazon Using Bayesian Fusion of Sentinel-1 SAR and Sentinel-2 Multispectral Time Series

Marta Bottani¹, Laurent Ferro-Famil^1,2

¹CESBIO, Toulouse, France; ²ISAE Supaero, Toulouse, France

Timely and accurate detection of deforestation is essential for managing tropical forests, yet individual Earth observation sensors have inherent limitations. Multispectral imagery offers detailed spectral information on vegetation properties but is frequently hindered by cloud cover, while Synthetic Aperture Radar (SAR) imagery provides insights on vegetation structure independent of weather conditions but is sensitive to moisture variability and residual vegetation post-clearing. The complementary nature of these data has motivated multi-source fusion approaches, though most existing methods rely on offline processing or decision-level integration, limiting their real-time applicability.

This study generalizes a Bayesian Online Changepoint Detection (BOCD) framework based on the recursive estimation of the number of acquisitions since the last change to asynchronous, irregularly sampled Sentinel-1 SAR and Sentinel-2 multispectral time series. A dynamically weighted fusion mechanism is implemented, in which each sensor’s relevance reduces with increasing time since its last observation, according to a physical decay model. The resulting method, named ms-BOCD, enables interpretable, and Near Real-Time (NRT) detection of forest loss.

The ms-BOCD method is validated using MapBiomas Alerta reference data spanning deforestation polygons ranging from 0.1 to 50 hectares in the Brazilian Amazon.

Compared to $VH$-BOCD (BOCD using Sentinel-1 cross-polarization only) and the operational RADD and TropiSCO systems, ms-BOCD achieves a 25% improvement in detection performance and maintains 13% fewer false alarms than Global Forest Watch (GFW), a platform that aggregates multiple independent deforestation alert products. Overall, these results demonstrate the strong potential of multi-source Bayesian fusion for operational tropical forest monitoring.

4:00pm - 4:15pm

Community Managed vs. Protected Forests: A Remote Sensing Workflow for Assessing Forest Conservation in Liberia (2002–2024)

Amanda Aragon, Marguerite Madden, Matthew Auer

University of Georgia, United States of America

This study assesses long-term forest change in Liberia’s Community Forest Management Areas for Conservation (CFMACs) and Protected Areas (PAs) from 2002 to 2024 using an integrated Landsat–Google Earth Engine (GEE) and an ArcGIS Pro workflow. Annual dry-season composites for three time periods were classified using a Random Forest model with 81.7% accuracy (Kappa = 0.781). Results show contrasting governance outcomes: CFMACs experienced modest forest gains from 2002–2014 and localized losses thereafter, while PAs exhibited larger overall gains but also greater cumulative forest loss, particularly along concession boundaries. Stability analysis revealed that PAs retained a higher proportion of Mature Forest over the 20-year period, whereas CFMACs showed more dynamic turnover and localized regrowth. The combined GEE/ArcGIS approach provides a scalable, transparent monitoring framework and demonstrates how governance type influences forest persistence, degradation, and recovery across Liberia’s tropical landscapes.

4:15pm - 4:30pm

A benchmark dataset for canopy cover change evaluation in North America

Camile Sothe, Maxwell Joseph, Christopher B. Anderson

Planet Labs PBC, San Francisco, CA, USA

Accurate assessment of tree cover change is essential for monitoring deforestation, carbon emissions, and restoration progress. However, validation of global forest change products remains limited by the scarcity of consistent reference data. We present a benchmark dataset for tree canopy cover change evaluation across North America, derived from multitemporal airborne LiDAR data from the National Ecological Observatory Network (NEON). Using canopy cover maps from 2016–2022, we identified tree cover loss as a decrease of at least 20% in canopy cover persisting across multiple time steps. Thirty NEON sites spanning diverse biomes were included, forming a spatially and temporally robust reference for change detection.

We demonstrate the benchmark applicability by evaluating two global products: Forest Carbon Diligence (FCD) from Planet Labs, and the Global Forest Change (GFC) from University of Maryland. Across all sites, both products showed strong agreement with the LiDAR benchmark (r = 0.90 for FCD; r = 0.88 for GFC), though both underestimated change extent. Categorical metrics revealed higher precision than recall, indicating conservative detection thresholds relative to the benchmark.

This study establishes the NEON LiDAR-based benchmark as a valuable open resource for assessing and improving large-scale canopy cover change datasets. The approach highlights the importance of high-resolution, temporally consistent reference data for evaluating the accuracy of global monitoring products and guiding improvements in forest carbon accounting and conservation applications.

4:30pm - 4:45pm

Spatiotemporal Vegetation Degradation Simulation and Inversion in Inner Mongolia Autonomous Region

He Yang, Yongjun Zhang, Yongxiang Yao, Duanning Chen, Yi Wan, Bin Zhang

School of Remote Sensing and Information Engineering, 430079, Wuhan, Hubei, China

Under climate and human pressures, vegetation in Inner Mongolia exhibits complex fragmentation and degradation. Scientifically inverting its spatiotemporal dynamics is crucial for regional ecological restoration. To address the challenges faced by traditional cellular automata (CA) models in large-scale complex ecological transition zones—such as computing power bottlenecks and subjective transition rules—this study proposes a cloud-based vegetation degradation simulation and inversion framework (CA-VDS) via Google Earth Engine. By coupling Random Forest (RF) and an Improved Genetic Algorithm (IGA) with CA, the framework extracts nonlinear driving potentials and automates the optimization of bidirectional transition thresholds. Validation against the 2020 baseline shows CA-VDS effectively resolves manual parameter tuning limitations. Furthermore, it smooths the spectral fluctuations caused by short-term sporadic disturbances through the underlying spatial neighborhood mechanism, demonstrating its value in simulating potential ecological degradation risks and developmental trajectories. This work not only verifies the reliability of CA-VDS in analyzing complex nonlinear ecological processes, but also establishes a reliable parameter baseline and model paradigm for subsequent integration with CMIP6 and other multi-scenario data to conduct long-term future ecological predictions.

4:45pm - 5:00pm

Particle Swarm Optimization for Woody Vegetation Assessment in a Semi-Arid Savannah Ecosystem

Nesisa Analisa Nyathi, Juliane Krenz, Andrea Meier, Brigitte Kuhn, Nikolaus J. Kuhn

¹Physical Geography and Environmental Change Research Group, Department of Geography and Physical Sciences, Faculty of Philosophy and Natural Sciences, University of Basel, Basel, 4056

This study explores the application of Particle Swarm Optimization (PSO) to enhance vegetation indices (VIs) for the assessment of woody vegetation in a semi-arid savannah ecosystem. By optimizing VIs, the research aims to improve the discrimination between vegetated and non-vegetated areas, facilitating a more accurate random forest classification for habitat quality assessment. The optimization process preserves minimum VI values across different sensors to maintain lower bounds of reflectance, ensuring ecologically valid signals are represented, particularly in low-vegetated areas. Results indicate that maximum VI values increase post-optimization, enhancing sensitivity to canopy vigor, stress, health, and presence. The study highlights the effectiveness of UAV-derived indices, such as NDVI, NDRE, and SAVI, in capturing the dynamics of vegetation health and dryness, thereby contributing valuable insights into remote sensing methodologies for ecological monitoring.

5:00pm - 5:15pm

Research on a Method for Identifying Potential Cropland Abandonment Areas Using Bitemporal Remote Sensing Images

Shuangxi Miao¹, Pinzhou Guan¹, Zhongxiang Xie¹, Haoxu Zhao¹, Hao Wu²

¹China Agricultural University, CHINA; ²National Geomatics Center of China，CHINA

The paper proposes the STF-Net (Spatial-Textural-Frequency Network) framework, designed to achieve a paradigm shift from traditional "change detection" to "suspected area identification," precisely identifying suspected abandonment areas and effectively suppressing pseudo-changes. The core of this framework lies in its fine-grained four-level annotation system and a three-stream parallel feature extraction architecture. The four-level annotation system includes "confirmed abandonment," "suspected abandonment," "non-abandonment change," and "no change," providing a robust data foundation for the model to learn the "suspected" concept, thereby compensating for the lack of "user-oriented" definitions in existing research. The three-stream parallel feature extraction architecture captures changes in geometric information (location, shape) via the spatial stream; quantifies the transition of surface texture from ordered to disordered, capturing structural degradation due to abandonment, through the textural stream; and analyzes periodic structural information in images, identifying the disappearance of periodic structures caused by cessation of cultivation, using the frequency stream. These three types of features are deeply fused, comprehensively utilizing information from different modalities, significantly enhancing the model's adaptability and identification accuracy in complex scenarios.

3:30pm - 5:15pm

WG II/1A: Image Orientation and Fusion
Location: 714B

3:30pm - 3:45pm

AI-based Camera Pose Estimation on mixed Aerial and Ground Images: A comparative Study

Zichao Zeng, June Moh Goo, Jan Boehm

University College London, United Kingdom

Estimating camera poses jointly from aerial and ground imagery remains difficult because large viewpoint changes reduce overlap, alter appearance, and weaken the geometric assumptions relied on by both classical photogrammetry and recent AI-based reconstruction models. This paper presents a controlled comparison between a classic photogrammetric approach represented by COLMAP and a cross-view fine-tuned end-to-end model based on Dust3R. Tests are carried out on a London building scene containing 10 aerial and 29 ground images. Fine-tuned Dust3R reconstructs the full image set, whereas COLMAP successfully registers 24 ground-level images. Because both reconstructions are defined only up to an unknown similarity transform and no ground-truth poses are available, we evaluate the shared subset through 7-DoF similarity transformation analysis rather than direct metric pose errors. After transformation, the translation RMSE of the shared camera centres is 10.0\% of the reconstructed scene diagonal in the fine-tuned Dust3R coordinate frame. We further compare pairwise geometric support using a unified fundamental-matrix RANSAC evaluation over 406 image pairs. The AI-based pipeline achieves substantially higher inlier ratios than photogrammetric pipeline under the same verification settings, indicating more successful cross-view orientation. The study contributes a clearer evaluation protocol for mixed aerial-ground pose estimation without ground truth, together with an empirical analysis of robustness, alignment behaviour, and current limitations of both pipelines.

3:45pm - 4:00pm

Epipolar Rectification of a Generic Camera

Marc Pierrot Deseilligny, Ewelina Rupnik

Univ Gustave Eiffel, Géodata Paris, IGN, LASTIG

We propose a generic method for epipolar resampling that is not tied to a specific camera model. We demonstrate the effectiveness of the approach on a central perspective, pushbroom and pushbroom panoramic camera models. We also devise an \textit{epipolarability index} that measures the suitability of an image pair for epipolar rectification, and provide a formal derivation of the ambiguity bound to epipolar resampling. An open-source implementation of the algorithm is available at github.com/micmacIGN/micmac

4:00pm - 4:15pm

ThermalAssist: Towards Efficient Annotation of Thermal Imagery

Jingwei Zhu^1,2, Manoj Biswanath^1,3, Benjamin Busam^1,3

¹Chair of Photogrammetry and Remote Sensing, Technical University of Munich, Germany; ²School of Geospatial and Artificial Intelligence, East China Normal University, China; ³Munich Center for Machine Learning (MCML), Munich, Germany

Thermal infrared (TIR) imaging provides surface temperature of the objects and reveals heat-transfer patterns of buildings, which supports applications such as insulation inspection, energy leakage, and thermal bridge detection. However, the TIR image dataset with reliable annotations for deep learning remains scarce, as the labeling process is time-consuming and tedious, and particularly challenging due to the low-texture and blurred features of TIR images. To address this challenge, we propose ThermalAssist, a geometry and gradient-aware framework designed to assist thermal anomaly labeling in TIR imagery. By combining sparse manual annotations with dense correspondence via flow-based propagation, the framework efficiently transfers labels across image sequences while preserving semantic consistency and boundary integrity. Experiments on the TBBR dataset demonstrate that ThermalAssist can transfer labels between images, achieving up to 21% higher F1-score and 35% higher precision compared to state-of-the-art tracking-based baselines. It also helps identify missing annotations and boundary inconsistencies for quality checks. This work provides a foundational tool for quality-assured thermal annotation pipelines and represents a key step toward more scalable, reliable, and intelligent labeling of thermal imagery.

4:15pm - 4:30pm

Evaluation of recent AI-based point matching algorithms applied on aerial images

Pablo d'Angelo, Franz Kurz, Alaa Eddine Ben Zekri, Reza Bahmanyar

German Aerospace Center, Germany

Accurate image matching is essential for the precise orientation of airborne imagery, yet modern feature matchers are rarely evaluated on real aerial data with great temporal, seasonal, and radiometric changes. For this study, we introduce the AerialRefMatch dataset, which comprises 51 challenging aerial images and corresponding true-ortho reference data. We benchmark classical and deep learning–based matching algorithms on AerialRefMatch, considering two scenarios: matching original images and matching approx-orthorectified images generated using GNSS/IMU orientations. For each method, image-based ground control points are derived and used for single-image pose estimation; accuracy is assessed via independent checkpoints. Results show that directly matching on original images is very difficult: fewer than 14\% of images can be oriented with pixel-level accuracy. When approx-orthorectification is used, performance improves substantially. JamMa, SIFT, and SuperPoint+LightGlue achieve pixel-level accuracy for up to 30\% of images, with JamMa being most robust on difficult cases and SIFT-based variants being more precise on the easier ones. Deep detector-free models such as ELoFTR and RoMa are less accurate but more robust to the original images than other models. Overall, state-of-the-art deep learning-based matchers still struggle with large rotations, scale differences, and semantic differences, and strongly benefit from prior image orientation knowledge and lack sub-pixel precision.

4:30pm - 4:45pm

Faster than Light: An Embedded-Efficient Matching Model with ReLU Linear Attention

Ziang Wang¹, Tao He¹, Wei Cui¹, Yu Duan¹, Kaimin Sun¹, Haoyun Miao²

¹State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan China; ²North Automatic Control Technology Institute. Taiyuan, China

Deep learning-based image matching faces a critical challenge when deployed on computationally constrained embedded aerial devices. Transformer-based architectures, particularly the scaled dot-product attention mechanism, incur high computational costs that limit inference speed for real-time applications. To address this bottleneck, we propose FastGlue, a sparse feature matching algorithm that adapts the LightGlue architecture through two targeted modifications: replacing the scaled dot-product attention with a ReLU-based linear attention module, and reducing the depth of the graph neural network. These changes reduce computational complexity while maintaining competitive matching performance. Evaluations on HPatches and MegaDepth-1500 benchmarks show that FastGlue achieves accuracy comparable to LightGlue while improving inference speed—from 20.05 ms to 17.05 ms on GPU, and from 840.45 ms to 665.85 ms on an RK3588 embedded CPU. Our work demonstrates that targeted architectural simplifications can yield meaningful efficiency gains for deep learning-based feature matching on resource-constrained platforms.

4:45pm - 5:00pm

SCOP: An Open-Source and Educational JAX-Powered Framework for Generic Photogrammetric Bundle Adjustment

Adrien Gressin

University of Applied Sciences Western Switzerland (HES-SO / HEIG-VD)

We present SCOP, an open-source and educational framework for generic photogrammetric bundle adjustment built in Python and powered by JAX automatic differentiation. SCOP removes the need for manual Jacobian derivation by expressing all projection models as pure mathematical functions with automatically computed exact derivatives.

The framework supports multiple camera geometries (pinhole, fisheye, equirectangular) and optimization methods (Gauss-Newton, Gauss-Newton-Armijo, Levenberg-Marquardt, Gradient Descent). Its modular architecture, separating cameras, images, and observations, allows easy extension to new sensors and constraint types, including GNSS positions, ground control points, and geodetic observations.

A hybrid computation pipeline combines JAX for differentiation with a Rust backend for sparse Schur complement elimination, achieving ~0.5 s per iteration on a real-world dataset with 79k unknowns and 181k observations. Following classical least-squares photogrammetry, SCOP provides rigorous uncertainty estimation through covariance matrices, normalized residuals, and reliability indices. With synthetic data tools and interactive 3D visualization, it enables transparent teaching and reproducible research.

5:00pm - 5:15pm

TriCo-Net: Learning Semantically Aware Local Features via Triple Consistency

Longze Zhu^1,2, Li Yan^1,2, Hong Xie^1,2, Hao Wu^1,2, Shan Su^1,2, Binbing Wang¹, Xiaoteng Yang¹, Junjie Yuan¹, Aoran Li³

¹Wuhan University, The School of Geodesy and Geomatics, Wuhan 430079, Hubei, China; ²Hubei Luojia Laboratory, Wuhan 430079, Hubei, China; ³Henan Normal University, The College of Software, Xinxiang 453000, Henan, China

Local feature matching in complex scenes is hindered by semantic ambiguity, where detectors often latch onto transient or repetitive patterns. We present TriCo-Net, which learns semantically aware and discriminative local features by enforcing a Triple Consistency (TriCo) principle across implicit semantics, scale, and spatial context. During training, an Implicit Semantic Strategy (ISS) distills cues from a segmentation teacher to modulate keypoint reliability and descriptor learning, while introducing no overhead at inference. A Scale-wise Semantic Harmonizer (SSH) aligns and fuses feature-pyramid levels to ensure cross-scale coherence, and a Global Context Propagator (GCP) broadcasts scene-level dependencies to resolve local ambiguities. On Aachen Day–Night v1.1, TriCo-Net achieves strong and consistent gains in visual localization, particularly under night conditions, and exhibits robustness to blur, noise, and large homographies. Ablations show complementary benefits from ISS, SSH, and GCP, with ISS contributing most at tight thresholds and at night. TriCo-Net narrows the day–night performance gap while maintaining mid-range throughput, offering a practical trade-off between robustness and efficiency.

3:30pm - 5:15pm

WG III/2A: Spectral and Thermal Data Processing and Analytics
Location: 715A

3:30pm - 3:45pm

Impact of Urban Surface Heterogeneity on Thermal Anisotropy: Perspective of Geometric Structure and Component Temperature

Jiaying Xun¹, Qiang Chen¹, Yixiao Huang²

¹Beijing University of Civil Engineering and Architecture, China, People's Republic of; ²HUAYUN Shine Tek Co., China Meteorological Administration, China, People's Republic of

Urban surface structure and component temperatures induce significant thermal anisotropy (TA), resulting in substantial differences in observed surface temperatures across varying viewing angles. Although previous studies have investigated the temporal dynamics of TA through observations and modeling, its spatial differentiation over heterogeneous surfaces remains poorly constrained. Resolving how surface heterogeneity influences TA is hindered by the coarse spatial resolution and limited angular sampling of current multi-angle satellite observations. Consequently, most mainstream thermal-anisotropy models were developed for simplified scenes and lack systematic evaluation of their applicability to complex urban environments. To address these challenges, we coupled the microscale 3D urban energy balance model (TUF-3D) with the state-of-the-art Discrete Anisotropic Radiative Transfer (DART) model. This approach allows for rapid and accurate TA modeling of hypothetical urban scenes with varying geometric structures and component temperatures, thereby quantifying the impact of surface heterogeneity on TA. Building height variability was used to represent geometric heterogeneity, while differences in building material properties were used to characterize component temperature heterogeneity. To evaluate , The results of a series of sensitivity experiments have validated the individual effects of geometric and component temperature heterogeneity on TA. From the perspective of component temperature, changes in average component temperatures result in a maximum TA difference of 7.29 K, while temperature variability alone contributes only 0.54 K. These findings suggest that assuming simplified scenes with uniform building heights or homogeneous component temperatures can introduce biases in TA simulations, potentially compromising the accuracy of models correcting for the angular effects of land surface temperature.

3:45pm - 4:00pm

GloSVeT: A Global Monthly Soil–Vegetation Component Temperature Dataset Generated using a Multi-source Fusion Framework

Xiangyang Liu

Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, China, People's Republic of

Understanding the thermal behavior of soil and vegetation separately is essential for interpreting land–atmosphere energy exchange, diagnosing ecosystem stress, and improving land surface modelling. However, conventional satellite LST products only provide a mixed radiometric signal, masking the distinct thermal responses of soil and canopy. This study introduces GloSVeT, the first global dataset that provides monthly surface soil and vegetation component temperatures at 0.05° resolution for 2003–2023. The dataset is generated using an enhanced multisource fusion framework that integrates multi-temporal MODIS observations with ERA5-Land skin temperature and vegetation structural information to retrieve physically consistent component temperatures. We summarize the data sources, modelling framework, and global implementation strategy, and present an independent evaluation using flux-tower networks with screened spatial representativeness. Validation results show strong agreement with in-situ measurements, with correlations typically above 0.9 and RMSE around 2 K for both soil and vegetation temperatures. Seasonal variations in performance reflect expected hydrothermal conditions, and a small cool bias is attributable to the temporal sampling of satellite observations. GloSVeT provides a new basis for studying surface energy partitioning, monitoring hydrothermal dynamics, and supporting ecosystem and climate model applications.

4:00pm - 4:15pm

Design and Field Validation of a MWIR Vicarious Calibration Framework with Controlled-Emissivity Targets

Sun Do Lim¹, DongHwan Cha², Doo Chun Seo², Jae-Keun Yoo¹, Seongchong Park¹

¹Korea Research Institute of Standards and Science (KRISS), Korea, Republic of (South Korea); ²2 Korea Aerospace Research Institute (KARI), Korea, Republic of (South Korea)

This study presents the development of a ground-based observation system designed for vicarious calibration of satellite sensors operating in the mid-wave infrared (MWIR) region. Conventional natural targets used in LWIR calibration lack spectrally stable emissivity in MWIR, motivating the need for dedicated reference targets and high-sensitivity measurement instruments. We introduce a thermally controlled ground reference target whose effective emissivity can be tuned by adjusting the ratio of water and metal surfaces using perforated plates of varying hole diameters. In parallel, an MWIR radiation thermometer employing lock-in detection was developed to enable accurate measurement of low-signal MWIR radiance from room-temperature targets. The system achieved measurement uncertainties down to 20–70 mK. A field campaign was conducted at the Goheung Aerospace Center using the integrated reference targets and radiation thermometer to validate performance under real environmental conditions. The results demonstrate the feasibility of applying controlled emissivity targets and lock-in-based MWIR radiometry to improve the accuracy of MWIR vicarious calibration frameworks.

4:15pm - 4:30pm

Research on Identification Methods of Industrial Heat Source Integrating Thermal Anomaly Features

Yu Liu¹, Xinyuan Gao¹, Lijuan Zheng¹, Hailun Dai², Lijing Xu¹

¹LASAC, China, People's Republic of; ²Beijing Satlmage Information Technology Co. Ltd.

A Method for Identifying Industrial Heat Sources

4:30pm - 4:45pm

A 3D Urban Solar Shortwave Radiation Transfer Model Incorporating Sky View Factor for Remote Sensing Applications

Shisong Cao, Youqiang Dong, Jingyi Chen

Beijing University of Civil Engineering and Architecture, Beijing, China

This study addresses the limitations of conventional urban shortwave radiation simulations in representing complex three-dimensional morphology. A parameterization approach for large-scale urban sky view factor was proposed, significantly improving computational efficiency and spatial adaptability. Based on this, an urban solar shortwave radiation transfer model was developed to quantitatively characterize the shading and reflection effects of building clusters. Furthermore, a novel remote sensing inversion method for urban surface reflectance and solar radiation parameters was introduced, enabling high-accuracy estimation of surface radiative properties and offering a new technical pathway for urban thermal environment and energy balance research.

4:45pm - 5:00pm

Dynamic regime-aware downscaling of MODIS land surface temperature using MODIS-internal predictors.

Marco Ventura, Gabriele Lo Grasso, Emanuele Mandanici, Gabriele Bitelli

University of Bologna, Italy

Urban Heat Islands (UHIs) emerge from reduced vegetation, impervious surfaces, and anthropogenic heat emissions, leading to elevated surface temperatures in urban areas. Monitoring UHIs at fine spatial and temporal scales requires thermal data capable of capturing both urban heterogeneity and daily variability—conditions not satisfied by the native 1 km resolution of MODIS Land Surface Temperature (LST).

This study presents a regime-aware machine learning workflow to downscale daily MODIS LST to the native spatial scale of MODIS NDVI (231 m) over Bologna (Italy), using only MODIS-internal predictors and meteorological forcing. The approach adopts a two-stage architecture: a Ridge regression model estimates a day-level atmospheric bias, while a Random Forest reconstructs pixel-level residuals to recover fine-scale thermal variability from vegetation, land-cover, topographic, and atmospheric predictors. To account for atmospheric control, the dataset is partitioned into three thermal regimes (COLD, MILD, HOT), with independent models trained for each regime.

Pre-processing and data integration were performed in Google Earth Engine using MODIS LST (MOD11A1/MYD11A1), NDVI, SRTM-derived terrain variables, and built-up fraction from ESA WorldCover. Experiments show strong predictive performance (RMSE < 1 K; R² ≈ 0.90) and spatial patterns consistent with Local Climate Zones.

The MILD and HOT regimes provide the largest enhancement in spatial detail compared to the original MODIS product, while the COLD regime shows reduced performance, likely due to weaker surface–atmosphere coupling. Results highlight that atmospheric conditions play a dominant role in downscaling accuracy, exceeding the impact of model architecture. The framework enables scalable, daily UHI monitoring and supports heatwave analysis and climate-resilient urban planning.

5:00pm - 5:15pm

A spatial and spectral Analysis of the Sentinel-2 nighttime Image

Merlijn Dingemanse¹, Daniele Cerra¹, Ferran Gascon², Tobias Storch¹

¹German Aerospace Center (DLR), Germany; ²European Space Agency (ESA), Italy

Nighttime optical remote sensing provides valuable insights into natural and, in particular, human activities. This study evaluates the nighttime imaging capabilities of the Sentinel-2 mission using the only available nighttime acquisition not limited to ocean observations for dark signal calibration, covering the United Arab Emirates with Dubai in 2015. We checked the detection limit using granules over the Persian Gulf, extracted radiance spectra for different regions of interest, and analysed lighting types and temperatures. Results suggest a conservative nighttime detection limit of approx. 0.37 W/m²/um/sr for visible/near infrared bands, and 0.08 W/m²/um/sr for short-wave infrared bands. Sentinel-2’s high spatial resolution and multispectral bands, although designed for daytime observations, were capable of detecting and classifying bright visible/near and short-wave infrared emitters. Comparisons with hyperspectral EnMAP imagery acquired in 2025 validated the classifications and revealed changes in urban lighting over a decade. While limitations apply, this study highlights S2’s potential for nighttime remote sensing and supports considerations of nighttime capabilities for future satellite missions.

3:30pm - 5:15pm

WG II/4C: AI/ML for Geospatial Data
Location: 715B

3:30pm - 3:45pm

DeepChoice: Learning View Weighting for Image-Guided 3D Semantic Segmentation

Antoine Carreaud^1,2, Digre Frinde¹, Shanci Li¹, Jan Skaloud², Adrien Gressin¹

¹University of Applied Sciences Western Switzerland (HES-SO / HEIG-VD); ²ESO lab, EPFL, Switzerland

Multi-view image-to-point label transfer is an effective strategy for 3D semantic segmentation, but its performance largely depends on how predictions from multiple image observations are fused for each 3D point. Most existing pipelines rely on hard voting or handcrafted weighting rules, which do not explicitly learn the reliability of each view under varying geometric and image-quality conditions. In this paper, we introduce DeepChoice, a lightweight view-weighting module for image-guided 3D semantic segmentation. For each visible observation of a 3D point, DeepChoice exploits a compact set of visibility cues, including incidence angle, range, contrast, sharpness, signal-to-noise ratio, and saturation, to predict normalized per-view weights used to aggregate 2D semantic class probabilities into final 3D point-wise predictions. The method is sensor-agnostic, requires no meshing, and can be integrated as a replacement for standard multi-view fusion rules. Experiments on the full GridNet-HD benchmark show that DeepChoice improves over hard voting by 3.85 mIoU points and over mean-probability fusion by 1.26 points, while reducing the gap with the AnyView oracle upper bound. The largest gains are observed on thin and difficult classes such as conductors, pylons, and insulators. Furthermore, a complementary evaluation on the Images PointClouds Cultural Heritage}dataset shows that the proposed weighting strategy remains beneficial under a very different acquisition context and scene structure, yielding a 1.55 mIoU point improvement over hard voting. These results show that learning how to weight views is a simple yet effective way to strengthen image-guided 3D semantic segmentation pipelines. Code is publicly available at: https://huggingface.co/heig-vd-geo/DeepChoice.

3:45pm - 4:00pm

Semantic Segmentation of Textured Non-manifold 3D Meshes using Transformers

Mohammadreza Heidarianbaei, Max Mehltretter, Franz Rottensteiner

Leibniz University Hannover, Germany

Textured 3D meshes jointly encode geometry, topology, and appearance, yet their irregular structure poses significant challenges for deep-learning-based semantic segmentation.

While a few recent methods operate directly on meshes without imposing geometric constraints, they typically overlook the rich textural information also provided by such meshes. We introduce a texture-aware transformer that learns directly from raw pixels associated with each mesh face, coupled with a new hierarchical learning scheme for multi-scale feature aggregation.

A texture branch summarizes all face-level pixels into a learnable token, which is fused with geometrical descriptors and processed by a stack of Two-Stage Transformer Blocks (TSTB), which allow for both a local and a global information flow.

We evaluate our model on the Semantic Urban Meshes benchmark and a newly curated cultural-heritage dataset comprising textured roof tiles with triangle-level annotations with damage types.

Our method achieves 81.9\% mF1 and 94.3\% OA on SUM, and 49.7\% mF1 and 72.8\% OA on new dataset, substantially outperforming existing approaches.

4:00pm - 4:15pm

Pothole Classification using Point Cloud Data: a Comparison between Machine Learning and Deep Learning

Kristin Eggen, Hongchao Fan

Norwegian University of Science and Technology, Norway

Automatic pothole detection is important for improving road maintenance and transportation safety. While image-based pothole detection often struggles under poor lighting and weather conditions, point cloud data provides a robust alternative by capturing detailed surface geometry. Machine learning has demonstrated strong performance in point cloud classification. While traditional machine learning is simpler and relies on handcrafted features, deep learning models are more powerful, as they learn complex, high-dimensional patterns directly from the input data. While most existing work relies on deep learning models, which are time-consuming to train and require extensive labelled datasets, potholes can be well described by geometric features, making pothole detection well-suited for feature engineering. This paper compares traditional machine learning and deep learning approaches for pothole classification using point cloud data, to evaluate whether the added complexity and data demands of deep learning models are justified, or if traditional machine learning techniques are sufficient for accurate classification. A dataset with labelled pothole instances is created to train both models. The machine learning approach uses manually engineered geometric features as input to an ensemble classifier, while the deep learning model is trained on sampled data. Experimental results show that the machine learning approach outperformed the deep learning model. These results suggest that for this particular task, where informative domain-specific features can be manually engineered, the machine learning approach offers a more practical and efficient solution for real-world deployment, where labelled data may be limited.

4:15pm - 4:30pm

From Canopy to Crown: High-Fidelity Tree Facade Synthesis from Nadir LiDAR data

Raghav Sharma¹, Frank Zhang¹, Jane Liu², Baoxin Hu³

¹University of Fraser Valley; ²University of Toronto; ³York University

Synthesizing realistic fac¸ade views of individual trees from nadir-view remote sensing data would transform large-scale forest

analysis, yet remains unsolved due to data scarcity and task ambiguity. We present the first conditional diffusion model to generate

structurally plausible fac¸ade views of individual tree crowns from single nadir-view LiDAR rasters, leveraging the FOR-species20K

benchmark dataset. Our approach integrates nadir projections with tree species and height within a U-Net-based denoising diffusion

framework. Experiments demonstrate that nadir imagery alone is insufficient, but conditioning on species and height enables

synthesis of visually realistic, species-specific fac¸ade views. The fully conditioned model achieves substantial gains in perceptual

(LPIPS: 0.184) and structural (SSIM: 0.576) similarity, outperforming nadir-only baselines by more than twofold. Our results

establish that ancillary attributes critically constrain the solution space, enabling diffusion models to infer plausible structures

from ambiguous nadir input. This work demonstrates a scalable path to enriching nadir-based forest inventories with synthesized

structural detail, reducing the need for resource-intensive ground surveys.

4:30pm - 4:45pm

Evaluation of Metric Monocular Depth Estimation Models Under Adverse Weather Conditions in Driving Scenarios

Nour Khalefa, Roberto Souza, Naser Elsheimy

University of Calgary, Canada

Metric monocular depth estimation has become increasingly important and is often used as a redundancy mechanism in autonom

ous driving, where accurate scene understanding is essential for safe decision-making. In this work, we evaluate three recently

proposed models that represent the state-of-the-art (Depth Anything, PackNet-SfM, and UnidDepth) using zero-shot testing on the

DrivingStereo dataset across diverse weather conditions, and benchmark their performance. Our analysis considers not only metric

depth accuracy metrcis but also each model’s ability to generalize under challenging environmental variations. While UniDepth

achieves notable improvements over Depth Anything and PackNet-SfM, our results show that substantial progress is still needed for

robust real-world deployment. To further assess its practical suitability for autonomous driving applications, we conduct a detailed

examination of UniDepth’s strengths, limitations, and failure modes.

4:45pm - 5:00pm

Out-of-Distribution Detection for Real-World Honey Bee Monitoring Using Simulated Permanent Laser Scanning

William Albert^1,2, Ronald Tabernig^1,2, Jannik S. Meyer¹, Bernhard Höfle^1,2

¹3DGeo Research Group, Institute of Geography, Heidelberg University; ²Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University

We present the first Open-Set Recognition (OSR) workflow for environmental monitoring for Permanent Laser Scanning (PLS) setups, using a Deep Neural Network (DNN) solely trained on simulated data. Such monitoring systems were previously only trained with real-world data and under the closed-set assumption, because they are commonly designed to observe a specific and predefined phenomenon (e.g., beach erosion, rockfall activity, vegetation change, animal behavior). The use of real-world data requires manual labeling, which is tedious given the great amount of point clouds. For this reason, we use Virtual Laser Scanning of Dynamic Scenes (VLS-4D) in a PLS setup to investigate how knowledge from synthetic data can be applied to real-world PLS monitoring systems in open-set settings. We introduce a novel framework that enables Open-Set Recognition (OSR) for animal monitoring (e.g. honey bees) using PLS data. The DNN is fine-tuned exclusively on a simulated LiDAR point cloud time series of flying honey bees, and integrates OSR to handle unknown classes during real-world deployment (e.g., butterflies, leaves, wren, and hare). By leveraging deviations in feature embeddings of the DNN, our method reliably distinguishes the known honey bee class from previously unseen classes, supporting robust monitoring under persistent distribution shifts. This approach reduces the dependence on extensive manual annotation of real-world point clouds, while maintaining reliable classification performance. It also highlights the potential of synthetic training data and OSR for environmental monitoring with PLS systems.

3:30pm - 5:15pm

WG IV/6: Human Behaviour and Spatial Interactions
Location: 716A

3:30pm - 3:45pm

Semantic-Enhanced Dynamic Spatial-Temporal Graph for Human Mobility Prediction

Samira Kamel, Songnian Li

Toronto Metropolitan University, Canada

This work proposes a semantic-enhanced dynamic spatiotemporal model that integrates temporal attention, dynamic graph learning, and semantic module to better capture the complexity of human mobility. By combining dynamic adjacency learning with geographic and semantic structures, the model identifies both physical and functional relationships between zones. Results on TELUS mobility data demonstrate that semantic-enhanced graph construction improves prediction accuracy and robustness, offering a more meaningful representation of urban mobility dynamics and providing a strong foundation for future mobility forecasting and city-scale analytics.

3:45pm - 4:00pm

Development of a Perception-based Urban Quality of Life Index using Street View Imagery and Deep Learning: the Case of Metro Manila, Philippines

Karl Adrian Vergara

Department of Geodetic Engineering, University of the Philippines – Diliman, Quezon City, Philippines

Urban quality of life (QoL) assessments often rely on objective spatial indicators such as infrastructure access, land use, and environmental conditions. However, these metrics may overlook how residents subjectively perceive their surroundings. This disconnect reflects a methodological gap in urban studies: the lack of frameworks that integrate both objective and perceptual aspects of urban quality. In response, this study introduces a Perception-Based Urban Quality of Life Index (PUQLI) derived from street view imagery and deep learning and compares it with a composite objective indicator built from 13 spatially measured indicators across seven QoL domains. Rather than replacing conventional QoL assessments, PUQLI is intended to capture the visual-perceptual or experiential dimension of urban quality as inferred from street-level imagery. Each indicator was normalized and spatially joined to a hexagonal grid system. Pearson correlation revealed only modest associations between PUQLI and the objective indicators, indicating that subjective and objective urban quality are related but not equivalent. A mismatch index was then computed to quantify perception–provision gaps, revealing statistically significant and spatially patterned divergences (t = –10.535, p < 0.0001). Positive mismatch clustered in mixed-use urban centers, whereas negative mismatch aligned with documented environmental and infrastructural stressors; together with the significantly negative mean mismatch, this indicates a structural perception–provision gap in which measurable provision does not always translate into favorable lived experience. These findings highlight the need to integrate subjective perception into urban quality assessment and position the mismatch index as a practical diagnostic tool for perception-informed urban planning.

4:00pm - 4:15pm

Detection and Modeling of Pedestrian Groups Based on Laser Sensor Trajectories

Toshihiro Osaragi¹, Tomona Takeuchi¹, Hiroyuki Kaneko²

¹Institute of Science Tokyo, Japan; ²Kajima Technical Research Institute, Japan

This research develops a pedestrian behavior model that incorporates the existence and dynamics of pedestrian groups. Using high-precision laser sensor data collected in the atrium of a hospital, the research first defines spatiotemporal parameters representing interpersonal distance, relative speed, and walking direction between pedestrians. Based on these parameters, machine learning techniques, including Support Vector Machine (SVM) and Random Forest algorithms, were employed. The SVM demonstrated superior accuracy and stability, successfully identifying groups even under complex walking conditions. Building on these results, the pedestrian behavior model described by psychological stress factors, such as stress from other pedestrians, obstacles, and group dispersion, is improved to account for the behavior of pedestrian groups. Model parameters were calibrated using laser sensor trajectory data with individual attributes (sex, staff, mobility aid usage). The proposed model accurately reproduced observed walking trajectories, with errors within 80 cm for approximately 80% of pedestrians. Finally, the model was applied to evaluate pedestrian spaces by mapping spatial distributions of psychological stress. Pedestrian stress was highest around reception areas, while group dispersion stress was greater in low-density zones where groups tend to spread out. These findings demonstrate that incorporating group behavior enhances the realism and applicability of pedestrian models for evaluating and designing public spaces. Future work will focus on applying the model to diverse facilities and pedestrian environments.

4:15pm - 4:30pm

From sensing to understanding: modeling pedestrian crossing behavior from LiDAR-derived trajectories

Zihan Liu¹, Jiaying Xue¹, Yu Feng², Liqiu Meng¹

¹Chair of Cartography and Visual Analytics, Technical University of Munich, 80333 Munich, Germany; ²Institute for Spatial Information and Surveying Technology, Mainz University of Applied Sciences, 55128 Mainz, Germany

This study presents a workflow that links roadside LiDAR sensing with the modelling of pedestrian crossing behavior. Using self-collected LiDAR data from an informal mid-block crossing in Munich, the workflow includes object detection, tracking, trajectory reconstruction, event extraction, and contextual feature engineering. Behaviour-based yielding and stepping-out moments are used to identify pedestrian decision moments, which are subsequently labelled as gap-accepted or gap-rejected according to gap-acceptance theory. For each decision moment, features describing pedestrian state, social context, and vehicle context are extracted from the reconstructed trajectories. A logistic regression classifier is applied as an interpretable baseline to estimate gap-acceptance decisions under varying traffic conditions. The preliminary results indicate satisfactory predictive performance and show intuitive coefficient patterns, highlighting the influence of vehicle time gaps, pedestrian standing position, and peer presence. Overall, the study demonstrates the effectiveness of LiDAR-derived trajectories as a behavioral sensing foundation for modelling pedestrian crossing decisions.

4:30pm - 4:45pm

Ring-based Spatial Transformer: Learning Non-linear Spatial Interactions between Building Distribution and Pedestrian Flow

Shun Nakayama^1,2, Takahiro Kanamori^2,3, Wanglin Yan²

¹Senshu University, Japan; ²Keio University, Japan; ³PASCO Corporation, Japan

This study proposes a ring-based SpatialTransformer to learn how building uses at different distances from a railway station interact to generate pedestrian flow. Concentric ring buffers at 100-meter intervals up to 800 meters were defined around 100 randomly selected stations in Tokyo, treating each ring as a spatial token. Self-Attention was applied to learn inter-zone interactions directly from data, without prior structural assumptions. GPS-derived walking trip counts served as the target variable and Geographically Weighted Regression as the baseline. Across 30 independent trials, the SpatialTransformer consistently outperformed GWR in predictive accuracy. SHAP analysis revealed that mid-to-outer distance zone features dominate pedestrian flow prediction, while features from the 0-100m zone contributed little. The attention matrix showed that each distance zone attends most strongly to spatially distant zones, demonstrating that pedestrian flow is regulated by structural interactions across the entire catchment area rather than by any single zone in isolation. These findings challenge the compact city assumption that station-proximate development maximizes pedestrian flow, and suggest that land use distribution across the full walkable catchment area deserves greater consideration in urban planning practice.

4:45pm - 5:00pm

Who Can Reach What? Travel-Time-Based Accessibility and Urban Inequality in Los Ángeles, Chile

Richard Weidlin, Guido Staub

University of Concepción, Chile

Urban accessibility is a key factor in understanding spatial inequality, as it conditions residents’ ability to reach essential services and urban opportunities. This study analyses accessibility in the intermediate city of Los Ángeles, Chile, characterized by a centralized concentration of services and expanding peripheral residential areas. Accessibility to educational, healthcare, and commercial facilities was evaluated using approximate travel times generated through the TravelTime API, considering walking, public transport, and private vehicle modes. Travel times were calculated from the centroids of residential census blocks, and opportunity-based accessibility was assessed using travel-time thresholds to identify the range of accessible commercial establishments.The results reveal marked spatial disparities. Central areas exhibit the highest levels of accessibility due to the density and diversity of amenities, with walking emerging as the most efficient mode for short distances. In contrast, peripheral neighbourhoods show limited access to healthcare and educational facilities and depend largely on private vehicles to reach central services, despite having higher population densities. Commercial accessibility in these areas is primarily restricted to small-scale neighbourhood establishments. These findings indicate that accessibility is influenced not only by travel time and transport networks but also by the spatial distribution and variety of urban functions. The study highlights the usefulness of routing APIs as an alternative methodological tool for accessibility analysis in contexts where official mobility data are outdated or incomplete, offering valuable insights for urban planning and policies aimed at reducing spatial inequalities.

5:00pm - 5:15pm

Perception-Oriented 3D Blue–Green–Grey Urban Landscapes: A Multi-Source Data and XGBoost–SHAP Analysis in Geo-information Town

Dehu Yang^1,3, Yin Gao^2,3, Jun Chen^2,3, Feng Yong^3,4, Yuan Yang³

¹Southwest Jiaotong University, Chengdu, China; ²National Geomatics Center of China, Beijing, China; ³Moganshan Geospatial Information Laboratory, Huzhou, China; ⁴China University of Mining and Technology, Xuzhou, China

Rapid urbanization is accelerating the fragmentation of blue–green spaces and the degradation of ecosystem services, while widening inequalities in environmental exposure and access to ecological benefits. Taking the “Geo-information Town” as a case study, this paper develops an integrated 3D framework linking urban form, human behavior and spatial interactions. First, UAV oblique images are semantically segmented to identify blue–green–grey features and to jointly assess and filter image quality. Second, multi-source spatial data, including Gaode POIs, nighttime lights, urban land use, OSM road networks, vector base maps and Baidu heat maps, are used to characterize urban functions and vitality patterns related to catering, sightseeing, shopping and cultural–educational services. Third, social media check-in data from Xiaohongshu and Weibo are incorporated to capture residents’ subjective evaluations and place preferences for different spatial units. An XGBoost–SHAP modelling framework is employed to quantify the relationships between these subjective evaluations and blue–green–grey indicators, and to interpret the marginal contributions of different environmental and functional attributes. The results reveal how perceived landscape qualities and service functions jointly shape spatial attractiveness and human–landscape interactions at the neighborhood scale. Finally, we discuss future research on 3D indicator systems, fine semantic segmentation of blue–green spaces, multi-source big data fusion and perception–behavior–function coupling, providing methodological support for perception-oriented assessment of residential environmental quality and optimization of blue–green urban landscapes.

5:15pm - 5:30pm

Active Mobility Accessibility Index - Assessing Local Transport Competitiveness

Tom Komar, Philip James

Newcastle University, United Kingdom

Active Mobility Accessibility Index (AMAI) quantifies the competitiveness of walking and cycling relative to driving using travel-time and distance ratios on identical sampled origin-destination pairs, reflecting network structure rather than destination choice. AMAI combines time parity and distance parity in a simple diagnostic score, using equal weights as a default specification for interpretation and policy use. Applied across the five Tyne and Wear local authorities, it demonstrates that cycling is more competitive than walking against driving. The median origin-level cycling AMAI is 0.820 and the median walking AMAI is 0.645. Parity remains limited where the share of origins at or above parity is 10.0% for cycling and 1.7% for walking. Initial API-based tests suggested that time-of-day effects are limited for the short local trips studied here, supporting development of a scalable in-house routing workflow for the main analysis. Validation against OA-level Census 2021 mode shares, with controls for terrain gradient and commute-distance composition, suggests that AMAI captures a relevant behavioural signal, while its main value lies in diagnosing local network competitiveness for policy and planning.

5:30pm - 5:45pm

Causal Discovery and Deep Learning-based Interaction-aware Pedestrian Trajectory Prediction

Wen-Xin Qiu, Takashi Fuse

The University of Tokyo, Japan

Understanding pedestrian behaviors is the foundation of simulation for space planning. However, conventional behavior modeling methods are insufficient for learning detailed interactions, and deep learning methods often lack interpretability. This study aims to develop a pedestrian trajectory modeling approach based on discovering causal relationships among pedestrians. The proposed method consists of two parts: analyzing causal relationships among pedestrians using statistical causal discovery methods and predicting trajectories using attention-based deep learning methods. The first part employs a semi-parametric method to identify the causal relationships underlying observed pedestrian behavior and construct a spatial-temporal graph based on these causal relationships. The second part primarily uses the graph attention network to learn interactions among pedestrians. The experimental results demonstrate that the proposed method achieves a good balance between prediction accuracy and interpretability, while also identifying limitations, including at low-density scenes and due to causal model assumptions.

3:30pm - 5:15pm

Forum4C: Hybrid Intelligent Geospatial Computing
Location: 716B

3:30pm - 5:15pm

Forum9B: Exploring the Role of DGGS and AI in Addressing Challenges of National Mapping & Remote Sensing Agencies
Location: 717A

3:30pm - 5:15pm

CATCON
Location: 717B

3:30pm - 5:30pm

P4: Poster Session 4
Location: Exhibition Hall "E"

Flood mapping using multi-temporal Sentinel-1 SAR images: A case study from Lower Tubarão River Sub-basin, Santa Catarina, Brazil

Liliana Osako

Federal University of Santa Catarina, Brazil

Floods are natural hazards triggered by intense rainfall and are particularly destructive in low-lying areas such as floodplains. In flood-prone regions, effective disaster management relies on prevention, monitoring, and emergency response strategies. In this context, remote sensing, especially Synthetic Aperture Radar (SAR), has become indispensable for flood mapping and monitoring due to its ability to acquire data under adverse weather conditions and persistent cloud cover. Multi-temporal SAR imagery processed into RGB composites allows rapid visualization of inundation patterns, while the Geographic Object-Based Image Analysis (GEOBIA) approach improves the classification of flooded areas through the integration of backscatter thresholds and terrain elevation data. This study investigates the spatial and temporal dynamics of flooding in the Lower Tubarão River Sub-basin (LTRSb), southern Brazil, following an extreme precipitation event that produced 260 mm of accumulated rainfall between May 24 and 25, 2019. The Sentinel-1B SAR images, acquired pre- and post-event, were used to map flooded areas with an overall classification accuracy of 88%. The results indicate that three days after the event, flooding covered 140 km² (29%) of the LTRSb, predominantly affecting agricultural (86.3 km²) and pasture areas (47.6 km²). The flooded extent decreased to 62 km² after 15 days and to 15 km² after two months, with agricultural land consistently accounting for 97% of the flooded area. Urbanized areas (≈1 km²) were also impacted, indicating significant risks to infrastructure and public health. These findings highlight the importance of SAR-based flood monitoring for risk assessment and disaster management in hydrographic basins.

Deformation Pattern Modifications Induced by 2021 Brentonico Earthquake: Insights from EGMS Ortho Products

Rasoul Eskandari, Nicola Genzano, Marco Scaioni

Dept. of Architecture, Built Environment and Construction Engineering, Politecnico di Milano, Via Ponzio 31, 20133 Milano, Italy

Post-seismic deformation reflects the crustal adjustment to stress perturbations induced by earthquakes and may lead to modifications of ongoing deformation patterns. While such effects are well documented for moderate-to-large events, their detectability and significance after low-magnitude earthquakes remain poorly understood. Here, possible deformation pattern modifications associated with the February 2022 ML 3.5 Brentonico earthquake in Northearn Italy are investigated using ground-deformation time series derived from the European Ground Motion Service (EGMS) Ortho products. The earthquake occurrence is treated as a temporal discontinuity, enabling the estimation of pre- and post-event vertical/horizontal differential velocities during 2019-2023. A dual-weighted interpolation scheme, accounting for both spatial proximity and time-series reliability, is applied to derive spatially coherent maps of kinematic modifications. The results reveal measurable and spatially organized changes in deformation patterns, including localized accelerations/decelerations and direction reversals. A clear spatial correspondence between differential velocity anomalies and mapped fault systems suggests that the earthquake acted as a localized stress perturbation, modulating pre-existing tectonic structures. This study demonstrates the capability of EGMS datasets to capture post-seismic deformation signals and highlights the importance of considering low-magnitude events in long-term deformation analyses.

Evaluating Metro Construction Impacts on Urban Ground Stability Using Multi-Temporal Sentinel-1 InSAR

Suat Coskun¹, Caglar Bayik², Fusun Balik Sanli³, Tolga Gorum⁴, Saygin Abdikan⁵

¹Ministry of Environment, Urbanization and Climate Change, Turkiye; ²Bulent Ecevit University,Turkiye; ³Yildiz Technical University, Turkiye; ⁴Istanbul Technical University, Turkiye; ⁵Hacettepe University, Turkiye

Underground transportation networks are essential for mobility in densely populated cities, addressing urbanization challenges such as traffic congestion, noise, and air pollution. Ensuring the safety and reliability of these infrastructures requires structural health monitoring (SHM) systems, which detect faults, deterioration, and damage. While traditional in-situ monitoring techniques provide real-time data, they are often economically restrictive. Synthetic Aperture Radar Interferometry (InSAR) offers advantages for large-area, long-term monitoring and has been successfully applied to various infrastructures, including dams, bridges, highways, and subways.

This study investigates surface displacement along a 15.4 km metro line with 11 stations in the Gebze district of Kocaeli, Türkiye, using multi-temporal InSAR. Sentinel-1 SLC IW data acquired between January 2019 and October 2025 were processed using MiaplPy, generating 504 interferograms with a 6-day temporal baseline. The phase-linking workflow utilized Persistent Scatterers (PS), Distributed Scatterers (DS), and Statistically Homogeneous Pixels (SHP), combined with Combined Phase Linking (CPL) algorithm and SNAPHU for phase unwrapping, to obtain reliable displacement time series and mean deformation velocities.

Results indicate line-of-sight displacements ranging from –10 to 10 mm/year, with the highest movements near the first station. Time series analysis shows stability from 2019 to 2021, a sudden displacement from 2021 to mid-2022, and stabilization until 2025. Monitoring these deformations provides insights into construction-induced dynamics and enables early detection of potential risks. Incorporating additional data, such as lithological, soil, and geotechnical information, can enhance data-driven monitoring. Long-term deformation monitoring ultimately supports the development of sustainable urbanization strategies and contributes to safe, resilient underground infrastructure management.

Global Coverage of Sentinel-1 and Spaceborne LiDAR: A Data-Driven Foundation for Forest Height Estimation

Hossein Aghababaei¹, Giampaoli Ferraioli², Jaan Praks³, Erkki Tomppo⁴, Gilda Schirinzi²

¹University of Twente, Netherlands, The; ²Universita degli Studi di Napoli “Parthenope,; ³Aalto University; ⁴University of Helsinki

While polarimetric interferometric SAR techniques provide a strong theoretical framework for forest height retrieval, their application using C-band Sentinel-1 data is challenging due to repeat-pass acquisition geometry and strong temporal decorrelation. In this study, we assemble a globally distributed dataset combining Sentinel-1 interferometric observations with spaceborne LiDAR forest height measurements from the GEDI and ICESat-2 missions. More than 1800 Sentinel-1 interferometric image pairs were processed and spatially matched with LiDAR observations across tropical, temperate, and boreal forest regions. Sentinel-1 Single Look Complex data were used to derive interferometric coherence and polarimetric–interferometric observables, enabling statistical analysis of their relationship with forest structural properties. The results reveal physical relationships between Sentinel-1 coherence and canopy height across multiple forest biomes, indicating that Sentinel-1 interferometric measurements, under near-zero spatial baseline conditions, retain measurable sensitivity to vegetation structure despite temporal decorrelation effects. These findings provide a conceptual basis for exploiting similar repeat-pass interferometric observations from new low-frequency SAR missions such as NISAR and and upcoming ROSE-L for forest height mapping. In addition, the assembled dataset provides a global benchmark for developing and evaluating data-driven approaches for forest height estimation using Sentinel-1 observation.

Integrating MTInSAR and Geoscientific Data for Subsurface Deformation Monitoring in The Epe Cavern Field, Germany

Chia-Hsiang Yang¹, Marcin Pawlik², Tobias Rudolph², Sebastian Teuwsen¹, Carsten Stemmler¹, Peter Goerke-Mallet², Andreas Müterthies¹

¹EFTAS Remote Sensing Transfer of Technology, Germany; ²Research Centre of Post-Mining, Technische Hochschule Georg Agricola (THGA), Bochum, Germany

This contribution presents an integrated monitoring framework that combines Sentinel-1 multi-temporal InSAR (MTInSAR) with geological, hydrological, and operational datasets to analyse long-term ground deformation in the Epe cavern storage field, Germany. Using the SBAS approach, vertical and horizontal deformation components are derived for the 30 km² storage area, revealing a bowl-shaped subsidence feature and distinct east–west deformation patterns associated with cyclic cavern operation. The analysis incorporates geoscientific information from GeoBasis NRW, AGSI+, ELWAS, and BÜK/BK50 into a harmonized GIS environment. This enables correlation of SAR-derived deformation fields with cavern-pressure cycles, soil characteristics, and groundwater variations, providing process-oriented interpretation of subsurface–surface interactions. Deformation results will be updated to include data through October 2025. The study demonstrates how MTInSAR, combined with geoscientific knowledge, supports transparent deformation assessment and contributes to the development of geospatial digital twins for subsurface infrastructure, improving operational resilience and environmental risk management.

Leveraging PolSAR Features and Machine Learning for Improved Land Cover Discrimination with ALOS-2

Melih Altay¹, Beste Tavus¹, Fatih Fehmi Simsek², Omer Gokberk Narin³, Saygin Abdikan¹

¹Department of Geomatics Engineering, Hacettepe University, Ankara 06800, Türkiye; ²TÜBİTAK Space Technologies Research Institute, Ankara 06800, Türkiye; ³Department of Geomatics Engineering, Afyonkocatepe University, Afyon, 06800, Türkiye

Accurate land-cover mapping in heterogeneous metropolitan regions requires robust methods capable of overcoming limitations of optical imagery, particularly under persistent cloud cover. This study investigates the potential of L-band ALOS-2 PALSAR-2 SAR data for operational land-cover classification over Istanbul, Türkiye, by integrating advanced machine-learning algorithms, feature-selection strategies, and hyperparameter optimization. Four classifiers Random Forest (RF), XGBoost, LightGBM, and a shallow Artificial Neural Network (ANN) were evaluated using full-polarization SAR observations and a reference dataset derived from Sentinel-2 composites, orthophotos, and LPIS parcel boundaries.

Pre-processing included radiometric calibration, Lee filtering, terrain correction, and extraction of GLCM texture metrics from HH and HV channels, yielding an initial 20-feature set reduced to 18 through correlation and variance filtering. A LightGBM-driven Recursive Feature Elimination (RFE) procedure identified an optimal subset of ten features. Model optimization employed Bayesian hyperparameter tuning (TPE) under stratified 5-fold cross-validation to ensure reproducibility and generalization.

Results show that LightGBM achieved the highest accuracy (OA = 85.1%, κ = 0.81), followed by XGBoost (83.6%), RF (81.4%), and ANN (78.9%). Water surfaces were consistently the most accurately classified class, while confusion primarily occurred between urban and bare surfaces. Hyperparameter tuning improved F1-scores across all models, and reducing the feature stack to ten variables enhanced performance without loss of class separability. HV-derived texture features, particularly Entropy and Contrast, provided the highest discriminative power.

The study demonstrates that optimized feature selection and systematic hyperparameter tuning significantly enhance SAR-based land-cover classification, offering a transferable workflow for large-scale metropolitan mapping.

Monitoring and Mapping of Fast and Slow Subsidence in Hard Rock Metal Mining Using SAR Interferometry Techniques on High Resolution TSX/TDX Satellite Data

Avinash Singh, Mohammad Soyeb Alam, Mohammad Khalid Umar

INDIAN INSTITUTE OF TECHNOLOGY (INDIAN SCHOOL OF MINES) DHANBAD JHARKHAND INDIA, India

Mining induced deformation in underground metal mines poses a threat to surface infrastructure, underground access and environmental safety, and needs reliable spatially continuous monitoring. In this contribution, a long-term interferometric SAR analysis over an underground hard rock metal mines (Mine-B) in Khetri Copper Belt, India using TSX/TDX high resolution SAR data is carried out. Coherent small baseline DInSAR time series (CSB-DTS), stacking DInSAR, and single reference PSI chain are implemented. Stacking DInSAR derived average LOS deformation velocity and single-reference PSI derived velocity are obtained for mine-B for dataset of January 2023- December 2024. The obtained results verify that Mine-B is substantially stable while having a persistent fast and slow deformation concentrated inside and around the trough in SoZ-2 in Mine-B. The workflow shows how the combination of CSB-DTS, stacked DInSAR and PSI can facilitate the operational subsidence monitoring and long-term stability evaluation in complex mining environments and gives and gives an indication for the future integration with in-situ measurements and numerical models.

Modelling Drought Codes using ALOS-2 L-Band Polarimetric SAR in Mountainous Forests of British Columbia

Marin Loncar¹, Brigitte Leblon¹, Ashlin Richardson², Laura Bourgeau-Chavez³, Dzhamal Amishev¹

¹Lakehead University; ²British Columbia Wildfire Service; ³Michigan Technological University

Spatially accurate fire danger information is critical for reliably predicting fire ignition probability, spread potential, and behaviour. However, Canadian fire management agencies mainly predict fire danger using weather stations, which only collect observations at explicit spatial points and cannot accurately model the fine-scale spatial variability of moisture across large and remote areas. This study predicts and maps the drought code, a variable representative of the moisture of deep, slow drying, compact organic matter across the landscape of British Columbia using ALOS-2 polarimetric SAR. A random forest model predicted the drought code of target areas with high accuracy to values derived from weather stations. The model was applied to forested areas across a time-series of ALOS-2 images on a grid-by-grid basis at a one square kilometer resolution and predicted the occurrence of fine-scale differences in drought code associated with differences in topography and elevation. The development of this drought code prediction model will allow fire management agencies to predict spatially accurate, fine-scale differences in drought code across the densely forested and highly mountainous landscape of British Columbia, improving fire behaviour and fire prediction systems.

High-resolution water level changes from SAR amplitude data: a new approach testing Sentinel-1 imagery

Lorenza Ranaldi¹, Valeria Belloni¹, Andrea Nascetti², Mattia Crespi^1,3

¹Geodesy and Geomatics Division, Sapienza University of Rome, 00184 Rome, Italy; ²Division of Geoinformatics, KTH Royal Institute of Technology, 11428 Stockholm, Sweden; ³Sapienza School for Advanced Studies, Sapienza University of Rome, 00161 Rome, Italy

Monitoring water levels in small and remote reservoirs is critical due to the climate crisis and rising water demands. Traditional in-situ gauge networks often provide sparse or inconsistent coverage, especially in remote regions. Satellite altimetry provides a global alternative, but it is frequently limited by long revisit times or coarse footprints unsuitable for smaller water bodies. Existing SAR-based methods face inherent limitations: amplitude-based approaches rely on accurate external Digital Elevation Models, whereas interferometric techniques are affected by coherence loss and phase-unwrapping ambiguities. To address these limitations, this research introduces a novel approach for estimating water level changes using SAR amplitude data without relying on prior morphological information. By modeling the coastal zone as a set of distinct planar slopes, the method relates the vertical water level change to the horizontal shoreline shift specific to each slope, observed as changes in the satellite range direction. The stack's standard deviation image is used to identify low-slope areas, where the horizontal response to water level variations is most pronounced. In these regions, area-based image matching is applied to quantify displacements within the coregistered stack. Finally, a least-squares estimation is used to determine temporal water level changes and local coastal slopes. The method was validated on Lake Trasimeno, Italy, using a stack of 30 Sentinel-1 images acquired in 2022. Comparisons with in-situ gauge data demonstrated high reliability, achieving an accuracy of 4 cm and a Normalised Median Absolute Deviation of 9 cm. The preliminary results are promising, while further experiments are currently underway.

Baseline Optimization Strategy for TomoSAR: Comparison Between X- and C- Bands

Xin Zhou^1,2, Lingxiao Peng^1,2, Bei Lin^1,2, Zhe Zhang^1,2, Jinfei Wang³

¹Aerospace Information Technology University, China; ²Suzhou Aerospace Information Research Institute, China; ³The University of Western Ontario

This paper investigates wavelength-adaptive baseline design for spaceborne repeat-pass SAR tomography (TomoSAR) through a controlled simulation framework comparing representative X-band and C-band configurations. The study focuses on how radar wavelength influences the trade-off among vertical resolution, temporal decorrelation sensitivity, sidelobe behaviour, and baseline sampling efficiency. Using a discrete TomoSAR forward model, several experiments are conducted to analyse reconstruction performance under identical aperture, varying coherence conditions, different baseline sampling strategies, joint aperture-spacing design scans, and noise perturbations. Quantitative results show that X-band provides a clear intrinsic resolution advantage under coherent conditions, particularly for closely spaced scatterers, but this advantage degrades more rapidly under temporal decorrelation. C-band, while offering lower nominal resolution, exhibits more stable performance across coherence loss, wider design-space tolerance, and stronger robustness in noisy conditions. The comparison of uniform, minimum-redundancy, and irregular baseline patterns further indicates that baseline optimization is more critical for X-band than for C-band. The study moves beyond the general statement that “X-band is higher resolution whereas C-band is more robust” by providing experiment-based and frequency-dependent baseline design guidance. The findings support practical acquisition planning for future repeat-pass TomoSAR missions and contribute to a more quantitative understanding of wavelength-dependent sampling design.

Why should you start projecting the Ground Range Data in the Slant Range while working with SAR Data, and how can you do it?

Lauriane Mousset^1,2, Laetitia Thirion², Régis Guinvarc'h², Xavier Dupuis¹, Frédéric Brigui¹, Thierry Koleck³

¹DEMR, ONERA, France; ²SONDRA, CentraleSupélec, Université Paris-Saclay, France; ³CESBIO, CNES, France

In order to preserve their quality, SAR data are usually used in their native plane, the slant range. However, it is sometimes necessary to link ground data and radar data. Today, ground range or terrain-corrected data are frequently used for this purpose. An alternative to this approach is to project the data into slant geometry, which allows both the superimposition and co-registration of data from different sensors and the preservation of the resolution and phase of the SAR data.

An Observational Definition of the Absolute Phase in Radar Interferometry

Simon Zwieback

University of Alaska Fairbanks, United States of America

An Observational Definition of the Absolute Phase in Radar Interferometry

The absolute phase in radar interferometry is required for topography and displacement estimation but lacks a general definition. The conventional definition states that absolute phase is proportional to the range difference between primary and secondary acquisitions. This definition is appropriate for simple targets such as point targets, but it cannot be directly applied to general targets.

Here, a universal observational definition of the absolute phase is proposed. It applies to any mode and does not require any assumptions about scattering mechanisms. For differential interferometry, the absolute phase is obtained by temporally unwrapping the phase as the intermediate secondary acquisition time progresses from the primary to the secondary acquisition time. This definition requires a continuous series of interferometric phase measurements along a pre-specified absolute path. The absolute phase matches the wrapped phase modulo 2π and agrees with the conventional definition for point targets.

This contribution discusses several implications of this general definition. The absolute phase of a complex target need not, and sometimes cannot, be proportional to the range difference. An example involving a permafrost landform demonstrates that the absolute phase following a cyclic change is nonzero and cannot be interpreted as a range difference. Another consequence is that the absolute phase in an interferogram can show 2π discontinuities, even when the interferogram itself is continuous and the coherence high.

This general definition enables thorough evaluation of InSAR processing chains and supports interpretation of observations.

Object Change Detection by Using Basis Derived from Multi-temporal PolSAR Images

Jiyong Lee, Junichi Susaki, Yoshie Ishii

Graduate School of Engineering, Kyoto University, Japan

Because of its high reproductivity, PolSAR is a suitable sensor for change detection in urban areas. Although many methods of change detection have been proposed, methods focused on polarimetric states transformation are rarely adapted. Through eigenvalue calculations, polarimetric basis which maximizes the polarimetric sensitivity can be calculated, and if this basis is fixed, quantitative change detection is available. The result from this method shows the obvious change in the target area, which is ‘Umekita project 2nd’ in Osaka city. However, changes outside the target area were also larger so that the change detection was not very effective from a relative viewpoint. To solve this problem, algorithms which surpass unnecessary changes in urban areas should be developed, and deeper understanding of scattering mechanisms in urban areas is needed.

Fluvial Dynamics Changes Driven by Illegal Gold Mining: A Land Use/Land Cover Analysis in the Ecuadorian Amazon

Isabel Adriana Chuizaca-Espinoza^1,2, Daniela Romero-Bermeo^2,3, Luciana das Dores de Jesus da Silva⁴, Andrés Velastegui-Montoya^2,5

¹Earth Observation and Geoinformatics Division, National Institute for Space Research (INPE); ²Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ³Faculty of Life Sciences, ESPOL Polytechnic University; ⁴Departament of Aquatic Systems, Concepción University; ⁵Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University

The Amazon region has experienced increasing pressure due to the expansion of mining, especially illegal alluvial mining, driven by rising gold prices and a lack of economic opportunities. In Ecuador, this activity has contributed significantly to deforestation and the alteration of water systems, affecting river stability and water quality. The increase in suspended sediments and the modification of river channels generate ecological, economic and social impacts, including production losses and increased vulnerability of riverside communities. In this context, monitoring through remote sensing and Geographic Information Systems has become an essential tool for assessing river dynamics and the effects of illegal mining in the Amazon biome. This study analyses changes in land use and land cover in the Nangaritza River, considering an intense rainfall event that occurred in 2025. Cloud-free mosaics were generated using Sentinel-2 images, spectral indices were calculated, and supervised classification using Random Forest was applied to establish seven coverage categories. The results show a notable expansion of mining areas and sand deposits, accompanied by a reduction in forest cover. The transition matrix revealed significant losses of forest transformed into mining soil and turbid water, as well as an increase in sedimented areas downstream. The analysis of river dynamics identified five critical areas of mining expansion associated with increased sedimentation, turbidity, and morphological alterations to the riverbed. These changes reflect the growing anthropogenic pressure on the river and the need to strengthen monitoring systems to mitigate environmental impacts.

Predicting LULC Transformations with Geospatial Intelligence for Sustainable Land Management

Dr. Syed Amer Mahmood, Muhammad Haseeb, Zainab Tahir, Areeba Amer

Institute of space science, university of the punjab, Lahore, Pakistan

This study investigates the rapid transformations in land use and land cover (LULC) within Lahore District, a phenomenon with profound implications for ecological sustainability and land-use governance. Analysing these dynamics is crucial for minimizing adverse environmental impacts and promoting sustainable urban development. The primary objective is to assess historical LULC patterns over 30 years (1994–2024) and to simulate potential changes for the years 2034 and 2044 using an integrated CA-Markov modeling approach supported by GIS techniques. Landsat imagery from multiple sensors (TM and OLI) was processed through supervised classification methods, achieving classification accuracies exceeding 90%. The temporal analysis revealed marked changes, notably a substantial increase in built-up areas by 359.8 km², alongside reductions in vegetation cover (198.7 km²) and barren land (158.5 km²). Water bodies exhibited minimal variation throughout the study period. Future LULC scenarios generated via the CA-Markov hybrid model demonstrated strong predictive performance, as evidenced by a kappa coefficient of 0.92. The projections indicate continued urban Expansion primarily at the expense of green and undeveloped areas. These findings emphasize the pressing need for sustainable land management practices and provide a robust decision-support framework for urban planners. By integrating predictive modeling into planning policies, this research helps align developmental objectives with environmental conservation in rapidly urbanizing regions like Lahore.

From Natural Land to Built-Up Areas: Monitoring Residential Expansion Using Sentinel-2 and Support Vector Machine

Karolayn Gaibor-Yambay¹, David Freire-Granda^2,3, Andrés Velastegui-Montoya^1,3, Marconi de Arruda Pereira⁴

This study evaluates the residential expansion of Guayaquil between 2016–2020 and 2020–2024 using Sentinel-2 imagery and supervised classification with the Support Vector Machine (SVM) algorithm. Given the rapid land transformation in tropical coastal cities, the research applied the Built-up Area Extraction Index (BAEI) to enhance the detection of built-up surfaces and distinguish them from vegetation and bare soil. The integration of BAEI and SVM allowed the development of an accurate, replicable, and low-cost approach to monitor the city’s urban growth.

Three periods of Sentinel-2 images with low cloud coverage were processed, re-projected, classified, and validated. Classification included four thematic classes (residential, vegetation, bare soil, and water bodies) using 1,000 training samples distributed across the city. All classifications achieved over 85% overall accuracy and a Kappa index of 1.0, confirming the model’s robustness in heterogeneous urban environments.

Spatial analysis of land-use transitions revealed that residential growth is concentrated in peripheral sectors such as Ciudad Santiago, Mucho Lote 2, Mi Lote, Trinipuerto, and areas near Narcisa de Jesús Avenue. Results indicate a strong tendency toward contiguous expansion, forming residential corridors along major road networks and the Guayas River. However, dispersed peripheral nuclei highlight challenges for service provision and environmental sustainability.

Overall, the combination of Sentinel-2 imagery, BAEI, and SVM proved highly effective for detecting built-up areas in tropical contexts, offering a scalable methodology for monitoring urban expansion in Latin American cities.

Assessment of automatic hedgerows detection using Pleiades Neo 30cm images and Foundation model

Léa Moyon, Marie Ballere, Noe Roure, Anne Jacquin, David Souveton, Vincent Varoquaux, Alexandre Mayerowitz, Laurent Gabet

Airbus Defence and Space, France

Hedgerows, a traditional agroforestry practice, are declining in Europe, threatening biodiversity and climate control. To support high-quality agricultural carbon credit certification, a method for automatic hedge detection using Pleiades Neo 30cm satellite imagery was developed.

Two methodological approaches were tested in three French study areas with varied landscapes: (i) a classic image segmentation using NDVI, Green Cover Fraction, and LiDAR-derived Digital Height Model, and (ii) a foundation model retrained on 150 annotated tiles with diverse landscape and satellite acquisition configurations. The methods were compared using quantitative (Intersection Over Union, Omission & Commission errors) and qualitative indicators.

The foundation model demonstrated superior hedge detection and robustness across different landscapes. A ground truth dataset based on stratified random sampling and equal allocation was created to allow the quantification of its accuracy using standard accuracy metrics. It achieved a precision of 0.89 and a recall of 0.83 for the hedge class. It effectively adapted to the morphological and ecological diversity of hedges, with few commission errors primarily due to confusion with isolated trees or linear vegetation, and omissions mainly in discontinuous or degraded hedges.

The study confirms the relevance of Pleiades Neo for detecting thin-scale elements like hedges, the effectiveness of foundation models with limited reference data, and their potential for large-scale hedge mapping. Future work aims to incorporate more spectral bands and expand the model's training to detect hedgerows across the European Union under various satellite acquisition contexts, paving the way for operational tools in agricultural carbon credit valuation.

Analysis of Spatiotemporal Changes in Land Cover of Wind Farms within County Areas

Ting Liu¹, Tao Zhang¹, Guanghui Wang¹, Yunjia Zou¹, Wei Zhang¹, Hailun Dai^1,2

¹Land Satellite Remote Sensing Application Center, MNR, Beijing, China; ²Beijing Satlmage Information Technology Co. Ltd., Beijing, China

This study focuses on county-level areas with high-density wind farm distribution in the Xing'an League of Inner Mongolia, China. Using high-resolution satellite imagery from 2016 to 2024, land cover information within and around wind farms was extracted through visual interpretation, and the spatiotemporal dynamics of land cover in these areas were analyzed. The results indicate that: (1) From 2016 to 2024, land cover change in the study area was primarily driven by wind farm expansion, which increased cumulatively by 130.87 km² (+260.35%) and exhibited the highest dynamic degree among all land cover categories (LK = +32.54%/yr). (2) Grassland was the most severely impacted land cover type, with 78.74 km² converted to wind farm land, accounting for 59.66% of the total newly established wind farm area, while cultivated land and forest land contributed 20.06% and 18.33%, respectively. (3) As wind power expanded, the land cover composition within wind farms shifted from a cultivated land–grassland balance toward grassland dominance. (4) Areas subjected to temporary disturbance from wind farm construction activities tended to recover progressively, with cultivated land exhibiting a faster recovery rate than grassland.

An Automated Approach based on Machine Learning for Tracking Urban Expansion: Case of Study in Gharbia Governorate, Egypt

Wael Ahmed^1,2, Islam Atef³, Moustafa Baraka⁴, Adel Elshazly¹, Ahmad M. Senousi^1,2, Walid Darwish^1,2

¹Geomatics Engineering Lab, Public Works Department, Cairo University, Giza 12613, Egypt;; ²NAMAA for Engineering Consultations, Dokki , Giza 12612, Egypt; ³Department of Civil Engineering, King Fahd University of Petroleum Minerals, Dhahran 31261, Saudi Arabia; ⁴Civil Engineering Program, German University in Cairo 11835, Egypt

Addressing the United Nations Sustainable Development Goals, particularly sustainable cities and communities (SDG 11), and

the protection of terrestrial ecosystems (SDG 15), is closely linked to understanding patterns of urbanization. Rapid urban growth

significantly influences ecosystem functions, including transportation, housing, and economic development. Monitoring this growth

and analyzing performance patterns are essential for supporting decision-making and guiding urban planning and management. This

study presents an automatic approach for monitoring urban expansion by applying the Random Forest machine learning classifier

from 2015 to 2025 using Google Earth Engine. The method exploits spectral indices not only for unsupervised classification but

also for training the Random Forest classifier, thereby ensuring a fully automated workflow. The proposed approach is applied

to Gharbia Governorate, a region which lacks surrounding desert margins and is instead entirely composed of fertile agricultural

land, to monitor urban expansion in three-year intervals. The proposed study, which achieved a kappa coefficient exceeding 0.96

across all study periods, revealed a gradual decline in agricultural land from 75.5% in 2015 to 72.7% in 2025. These outcomes offer

valuable insights to support evidence-based planning and promote sustainable land use management.

Detection of Cropland Abandonment through Multi-Temporal Landsat Data and Spatially Independent Machine Learning Validation

Luka Stojanović, Anka Lisec, Krištof Oštir, Bujar Fetai

Faculty of Civil and Geodetic Engineering, University of Ljubljana, Slovenia

Cropland abandonment (CA) is a major land-use change with important environmental and socio-economic implications. This study evaluates cropland abandonment detection using multi-temporal Landsat features and a spatially independent validation framework, comparing the performance and spatial behaviour of Random Forest and XGBoost classifiers. A set of temporally aggregated spectral indices (NDVI, BSI, NDBI, and MNDWI), including multi-year trends and variability measures, was integrated into a 56-band composite dataset. Training and validation samples were generated using 100 × 100-pixel windows centred on land-use parcels, with overlapping areas between different reference classes explicitly excluded to avoid label ambiguity. To reduce spatial autocorrelation, the data were split into separated training (1,582.6 km²) and testing (719.2 km²) areas within the Savinjska statistical region in Slovenia. Random Forest (RF) and XGBoost (XGB) classifiers were trained and evaluated using spatially separated validation data. Classification performance was assessed using overall accuracy, user’s and producer’s accuracy, and F1-score. Results indicate that XGB achieved a higher overall accuracy (0.705) compared to RF (0.670) and exhibited strong sensitivity in detecting cropland abandonment, while RF produced more conservative and spatially stable estimates of abandoned cropland area. Spatial error maps and area-based comparisons reveal systematic differences between the two classifiers, particularly in their tendency to overestimate abandonment extent. The findings highlight the importance of spatially explicit validation strategies, careful reference data preparation, and multi-temporal feature design for robust cropland abandonment mapping. The main contribution lies in the systematic assessment of model behaviour, spatial error patterns, and area estimates under strict spatial separation of training and testing data.

Assessment of different architectures based on 2D-UNet, 3D-UNet and UNet-ConvLSTM for land use land cover classification using multi-modal and multi-temporal satellite images

Fatemeh Saba^1,2, Pedro Achanccaray Diaz², Markus Gerke²

¹University of Hamburg (UHH), Institute of Geography, Hamburg, Germany; ²Institute of Geodesy and Photogrammetry, Technische Universität Braunschweig, Braunschweig, Germany

Land Use/Land Cover (LULC) classification is essential for understanding the spatial distribution of Earth’s surface and for supporting sustainable environmental and economic development. Recent extreme events in Central Europe have emphasized the link between LULC change and disaster vulnerability, highlighting the need for accurate monitoring. Advances in satellite technologies, particularly Sentinel-1 and Sentinel-2, combined with deep learning (DL) methods, have significantly improved LULC mapping. Convolutional Neural Networks (CNNs) excel at spatial feature extraction, Long Short-Term Memory (LSTM) networks capture temporal dependencies, and Convolutional LSTM (ConvLSTM) models integrate both spatial and temporal information.

This study evaluates the comparative performance of DL architectures for LULC classification in the Harz Mountains, Central Germany—a region experiencing notable forest cover loss. We assess 2D-UNet using two temporal processing approaches, examine the effect of attention mechanisms in 3D-UNets, and explore multiple integrations of ConvLSTM layers within UNet architectures. Our goal is to identify the most effective strategy for capturing spatio-temporal dynamics in LULC datasets, contributing to improved monitoring and management of vulnerable landscapes.

Assessing the Temporal Transferability of Random Forest Models for Land Use and Land Cover Change Detection

Beste Tavus¹, Melih Altay², Fatih Fehmi Simsek³, Omer Gokberk Narin⁴, Saygin Abdikan¹

¹Hacettepe University, Türkiye; ²Hacettepe University, Türkiye; ³TÜBİTAK Space Technologies Research Institute, Türkiye; ⁴Afyonkocatepe University, Türkiye

Monitoring land-use and land-cover (LULC) dynamics in rapidly urbanizing regions is critical for sustainable environmental planning. Dynamic metropolitan areas with rapid urbanization, such as Istanbul, Türkiye, are experiencing significant land-cover changes, among which deforestation is one of the most critical. This study presents a Google Earth Engine (GEE)-based framework to monitor LULC changes in Istanbul from 2016 to 2025 by fusing Sentinel-2 optical imagery, Sentinel-1 SAR and topographic data. From these datasets, a feature set—including spectral bands, vegetation indices, SAR backscatter metrics, and topographic variables—was derived and used to train a Random Forest (RF) baseline model on 2016 Land Parcel Identification System (LPIS) reference data. The baseline model was then applied across the time series to assess its temporal transferability, overcoming the limitation of up-to-date ground-truth data. The baseline model achieved an overall accuracy of 72%, calculated using a validation dataset derived from the LPIS reference data. Feature importance analysis revealed that structural variables—particularly DEM and SAR metrics—were the primary contributors to the classification, used in combination with optical features. Time-series results indicate a cumulative decline of 231 km² in agriculture and 379 km² in forest cover during the study period, inversely corresponding to urban growth. The results of the study highlight that, although applying a single-year model without independent annual validation data causes certain uncertainties—arising from methods, sensors, or topography (e.g., misclassifications)—the proposed framework is highly practical for monitoring deforestation and urbanization trends in complex landscapes.

Exploring land use mapping with multimodal data fushion and convolutional neural network

Juan Gu, Yongguo Wang, Yun Gong, Tingting Jin, Jun Wu

Beijing Institute of surveying and mapping, China, People's Republic of

Accurate and efficient land-use mapping provides intuitive spatial information, which helps to rationalize the planning and deployment of land resources, and provides a basis for urban planning, agricultural development, environmental protection and other aspects. This study utilizes the Google Earth Engine platform and the Resnet-50 method to explore the spatial distribution of land use in Daxing District, Beijing in 2023, by combining point of interest (POI) data, nighttime light data, Sentinel-1 data, and Sentinel-2 data. The results of the study show that the accuracy of land use mapping using different data is different, and the accuracy of the Resnet-50 method is better than that of the Random Forest method. Making full use of the band features and index features of Sentienl-1 data and Sentinel-2 data, nighttime light data and POI data can improve the accuracy of land use mapping results. Among them, the land use mapping accuracy of the proposed method is the highest, with an OA of 88.11% and a Kappa coefficient of 0.83. Ranking the importance of different features found that VH band in January-March has the most important effect on the land use mapping results and the residential land in the POI data has the least important effect on the land use mapping results. This study provides a feasible reference program for efficiently and accurately obtaining land use mapping data for a large study area.

Automatic Estimation of Building Construction Year and Height from Earth Observation Data for Urban Risk Assessment

Mateo Gašparović¹, Filip Radić², Iva Gašparović³, Mario Uroš⁴

¹Chair of Photogrammetry and Remote Sensing, Faculty of Geodesy, University of Zagreb, Kačićeva 26, Zagreb, Croatia – mateo.gasparovic@geof.unizg.hr; ²Chair of Photogrammetry and Remote Sensing, Faculty of Geodesy, University of Zagreb, Kačićeva 26, Zagreb, Croatia – filip.radic@geof.unizg.hr; ³State Geodetic Administration, Gruška 20, Zagreb, Croatia – iva.gasparovic@dgu.hr; ⁴Department of Engineering Mechanics, Faculty of Civil Engineering, University of Zagreb, Kačićeva 26, Zagreb, Croat – mario.uros@grad.unizg.hr

Reliable urban risk assessment requires accurate and up-to-date information on building characteristics, particularly construction year and height, which are often incomplete or unavailable in existing databases. This study presents a cloud-based methodology for the automatic estimation of these parameters using multispectral and very high-resolution Earth Observation (EO) data. The proposed approach integrates temporal analysis of multispectral satellite imagery (Sentinel-2 and Landsat) with photogrammetric processing of very high-resolution stereo imagery (Pléiades). Building construction year is estimated by detecting temporal changes in spectral indices using spline-based modeling and discrete-difference analysis, achieving an accuracy of better than ±3 years. Building height is derived from digital surface models generated from satellite stereo imagery, with a mean accuracy of less than 2 m relative to LiDAR reference data (~1.40 m). The methodology was implemented in a cloud computing environment (Google Earth Engine and Google Colab) and tested in the City of Zagreb, Croatia. Validation results show robust performance, with an F1-score of 0.819 for construction year estimation and strong agreement between EO-derived and LiDAR-based height values. The results demonstrate the potential of EO-based methods for scalable, reliable extraction of building information, thereby supporting improved urban risk assessment and decision-making.

Change Detection and Future Land Use Projections in Zhejiang Province, China: A Case Study

Mahdieh Shirmohammadi¹, Jie Shen², Saied Pirasteh³, Yang Li⁴

¹School of Geography, Nanjing Normal University, Nanjing, Jiangsu 210023, China; ²School of Geography, Nanjing Normal University, Nanjing, Jiangsu 210023, China; ³Institute of Artificial Intelligence, Shaoxing University, Shaoxing, 508 West Huancheng Road, Yuecheng District, Zhejiang Province, Postal Code 312000, China; ⁴School of Geography, Nanjing Normal University, Nanjing, Jiangsu 210023, China

Zhejiang Province is experiencing rapid land use/land cover (LULC) transitions driven by urban expansion, infrastructure development, and increasing environmental pressures. Understanding historical dynamics and future trajectories of these changes is essential for informed regional planning and ecological management. This study analyzes land use changes from 2000 to 2020 and forecasts future patterns for 2025 to 2040 by integrating multi-temporal land use data with key spatial drivers, including elevation, slope, aspect, Normalized Difference Vegetation Index (NDVI), and proximity to roads and built-up areas. Change detection results reveal substantial declines in croplands and green spaces alongside rapid urban expansion, particularly around Hangzhou and Shaoxing and along major transportation corridors, reflecting an early phase of accelerated urbanization from a relatively small baseline. Future land-use dynamics were simulated using a hybrid Convolutional Neural Network - Long Short Term Memory (CNN-LSTM)-Cellular Automata (CA)-Markov framework that captures complex spatiotemporal dependencies and neighbourhood interactions under physical and anthropogenic constraints. Model projections indicate a more moderate growth regime from 2025 to 2040, with urban land increasing by 1.7%, croplands decreasing by 2.2%, and modest gains in water bodies (1.9%) and forest cover (1.1%), suggesting landscape saturation and policy-influenced land management. Validation using the observed 2025 land use map demonstrates strong predictive performance, achieving an overall accuracy of 86% and a Kappa coefficient of 79%. The results provide spatially explicit insights to support balanced development and enhanced ecological resilience.

Comparison of Supervised Classification Algorithms for Land Use Land Cover in Guayaquil: An Assessment with Landsat and MapBiomas

Divar Castro-Rodas^1,2, André Pico Molineros³, Rahisha Thottolil⁴, Dimitri Bulatov⁵, Eder MS de Paula⁶, Andrés Velastegui-Montoya^2,3

¹Faculty of Mechanical Engineering and Production Sciences, ESPOL Polytechnic University; ²Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ³Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ⁴Department of Urban and Regional Planning, Faculty of Engineering and the Built Environment, University of Johannesburg; ⁵Fraunhofer IOSB Ettlingen; ⁶Faculty of Geography, Federal University of Pará

Land use/land cover change (LULCC) analysis is essential for understanding environmental transformation and guiding sustainable territorial planning. Remote sensing offers a valuable source of information for monitoring these changes, but the accuracy of thematic maps depends heavily on the classification algorithm applied. This study compares the performance of three widely used supervised Machine Learning (ML) algorithms (Random Forest (RF), Support Vector Machine (SVM), and Artificial Neural Networks (ANN)) for LULC mapping in the Greater Guayaquil region, a tropical area with persistent cloud cover.

A mosaic of Landsat-9 images from 2023 was processed in Google Earth Engine, followed by the selection of representative training and validation samples. The algorithms were implemented in R Studio, and accuracy was evaluated through confusion matrices and external comparison with MapBiomas Ecuador.

Four LULC classes were defined: Forest, Crops, Vegetation-free areas (urban/bare soil), and Water. Results indicate that SVM achieved the highest performance, with 93% overall accuracy and a Kappa coefficient of 0.91, followed by RF (92%; κ = 0.89) and ANN (90%; κ = 0.86). SVM also showed the highest spatial agreement with MapBiomas (>90%). Discrepancies were concentrated in rapidly changing urban–agricultural boundaries.

The superior performance of SVM is attributed to its capacity to model non-linear class boundaries in complex tropical landscapes. Despite expectations that RF would perform best based on previous literature, SVM proved more effective for this specific AOI. The study confirms that Landsat-9 combined with supervised ML models, particularly SVM, offers a robust and cost-effective approach for environmental monitoring and land-use planning in data-limited regions.

Analysis of Sentinel-2A orbital imagery for the detection of deforested areas caused by artisanal mining activities in the Tapajós Environmental Protection Area, northern Jacareacanga municipality, Pará State, Brazil

Julia Klock¹, Liliana Osako¹, Solange Costa², Amintas Rossete³

¹Federal University of Santa Catarina, Brazil; ²Federal University of Amazonas, Brazil; ³Mato Grosso State University, Brazil

This study analyzed the advance of deforestation associated with artisanal mining in the Tapajós Environmental Protection Area (APA), north of Jacareacanga, Pará State, Brazil, for 2017 and 2024. The National Institute for Space Research (INPE) monitors deforestation in the Amazon using remote sensing, and the Tapajós APA stands out among protected areas for high rates of mining-related deforestation. Sentinel-2A images were used to generate the Normalized Difference Vegetation Index (NDVI) and RGB composites, assigning the 2017 NDVI to the red channel and the 2024 NDVI to the green and blue channels. Auxiliary data, including active mining processes in 2024 and the hydrographic network, were integrated for analysis. This approach enabled the identification of two distinct spectral responses: (i) cyan areas corresponding to regions that were non-vegetated in 2017 but exhibited regenerated vegetation in 2024; and (ii) red areas corresponding to regions that were non-vegetated in 2024. The results enabled visualization of the spatial progression of deforestation, particularly along drainage networks and in relation to active mining areas, revealing a pronounced expansion associated with artisanal mining across multiple waterways, with an upstream progression consistent with alluvial gold and cassiterite deposits. The data corroborate deforestation alerts issued by the Real-Time Deforestation Detection System (DETER) and the Deforestation Alert System (SAD)/Imazon, indicating the continuity of artisanal mining pressure in the Tapajós APA. The methodology demonstrated efficiency in detecting environmental changes and can be replicated in other areas under mining pressure, contributing to territorial monitoring and environmental management

Modeling Wildfire Burn Severity in Canadian Megafires

Eleanore Krieger-Pottruff, Sophie Wilkinson

Simon Fraser University, Canada

Wildfire activity in Canada has increased significantly in recent decades, shifting to larger, more frequent fires and the emergence of megafires (>10,000 ha) across various ecozones. These events typically exhibit complex spatial patterns of burn severity, including larger and more homogeneous patches of high severity. The burn severity patterns and their drivers in megafires remain unclear, in particular, across diverse ecozones. Remote sensing indices such as the Relativized Burn Ratio (RBR) provide an effective means of quantifying burn severity at large spatial scales. This study uses RBR to evaluate nine megafires (each >50,000 ha) representing the 95th percentile and above of fire size within varying ecozones between 2016 and 2022. These fires were used to develop two random forest models: one predicting RBR and another predicting the within-fire z-score of RBR. Within-fire standardization of RBR was conducted to see whether it alters the relative importance of environmental drivers. In the RBR model (OOB R² = 0.75), regional variables such as ecozone and fire ID, along with drought code, were dominant predictors. In contrast, the z-score model (OOB R² = 0.68) emphasized fuel characteristics, including biomass and canopy closure, with additional contributions from elevation and drought-related variables. These results suggest that broad regional and fire-regime controls exert a stronger influence on burn severity than local fuel conditions at the megafire scale. Standardizing burn severity within fires reduces this regional signal but does not improve predictive performance, highlighting the importance of accounting for regional variability in large-fire dynamics.

Local Climate Zone Mapping of Bologna: The Key Role of Training and Validation Sites

Gabriele Lo Grasso, Marco Ventura, Emanuele Mandanici, Gabriele Bitelli

Alma Mater Studiorum - University of Bologna, Italy

Urban Heat Islands (UHIs) represent one of the most pervasive manifestations of human-induced modification of the land surface. They arise from the replacement of natural surfaces with impervious materials, reduced evapotranspiration, waste heat emissions, and altered aerodynamic roughness, collectively causing cities to exhibit elevated temperatures relative to surrounding rural areas.

The Local Climate Zone (LCZ) framework introduced by Stewart & Oke (2012) provides a standardized, physically based classification system for describing urban and natural landscape types according to their surface structure, cover, and thermal properties. Unlike traditional land-use/land-cover schemes, LCZs are explicitly designed for urban climate studies and allow for consistent comparison of urban form, function, and thermal behaviour across cities worldwide.

While LCZ maps of Bologna already exist within the WUDAPT protocol, they are characterized by a declared not high level of accuracy. So, the present work aims to produce the first detailed and reliable LCZ thematic map for the Municipality of Bologna, using higher quality, remotely sensed, input data.

To assess the impact of multi and hyperspectral imagery on the classification results, a Sentinel-2 and a PRISMA image were considered for the study.

Overall, this study provides for a first time a detailed and accurate LCZ map for the Municipality of Bologna and confirms the value of combining UCPs with both multispectral and hyperspectral data.

This study was carried out within the Space It Up project funded by the Italian Space Agency, ASI, and the Ministry of University and Research, MUR, under contract n. 2024-5-E.0 - CUP n. I53D24000060005.

Prediction of Urban Spatial Feature Change Using Parallel Computing-Simulation Model with Multimodal Remote Sensing Imagery

Rong Wang^1,2, Panli Tang¹, Yongjiu Feng^1,2

¹College of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, China; ²Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Shanghai 201210, China

This study focuses on predicting the evolution of internal urban spatial features, a dimension often overlooked in research that prioritizes urban expansion. Using the Yangtze River Delta as the case study, the work integrates multimodal remote sensing data—including high-resolution optical imagery and SAR data—to capture detailed land-use patterns, structural textures, and functional–transportation relationships. These fused datasets support the Futureland model, which applies parallel computing and Generalized Logistic Regression to simulate future spatial configurations with high efficiency and accuracy.

The results suggest that from 2030 to 2050, major cities such as Shanghai, Suzhou, Wuxi, Changzhou, Jiaxing, and Hangzhou will develop into more connected and compact urban clusters. Transportation networks and functional areas are expected to evolve in general alignment, while localized deviations reflect the complexity of internal urban dynamics. Land-use types are projected to undergo new spatial combinations and reorganizations, indicating improved continuity and diversity within the urban structure.

By systematically revealing trends in land-use evolution, transportation–function coupling, and urban form transformation, this research provides a clearer understanding of future urban spatial development. The proposed predictive framework offers valuable guidance for urban planning, governance, and sustainable regional development.

Detecting Eucalypt Canopy Stress from ECOSTRESS Satellite Imagery and Airborne Remote Sensing in South Australia

Donna Fitzgerald¹, Stefan Peters¹, Gunnar Keppel¹, Andrew McGrath², Kerry Cawse-Nicholson³

¹Adelaide University, SA, Australia; ²Airborne Research Australia, SA, Australia; ³Jet Propulsion Laboratory, CA, USA

Global shifts in vegetation patterns highlight the need for effective monitoring as climate conditions intensify. Remote sensing provides valuable tools for detecting stress across landscapes. This study examines whether thermal satellite and airborne data can detect early stress linked to temperature, drought, and fire history. Within Australia, eucalypt species are increasingly vulnerable to climate-driven canopy dieback. Prolonged drought and extreme temperatures increase the risk of dieback. As eucalypts regulate leaf temperature through transpiration, when water is limited, leaf temperatures rise and can be detected using thermal imagery. Therefore, our research questions are:

• Can temperature-related stress patterns be identified?

• Does fire history affect long-term stress?

• Can thermal changes indicate dieback events?

• Does topography shape canopy stress?

Study Area:

Scott Creek Conservation Park in South Australia contains diverse native vegetation and steep terrain. The canopy is dominated by two stringybark eucalypt species, with areas burnt in the 2021 fire and unburnt controls.

Data and Methods:

Thermal satellite data from 2019–2025, including land surface temperature and water stress indicators, were analysed alongside local climate records. Airborne hyperspectral, LiDAR, RGB, and thermal imagery (50 cm) were processed to derive canopy structure, topography, and thermal patterns. A supervised classification was used to assess canopy condition.

Preliminary results:

Indicate that after a fire, high and moderate levels of vegetation stress increased and persisted into the following year. Vegetation in fire-affected areas showed no significant improvement in WUE during the recovery period. suggesting that fire-affected vegetation remained physiologically stressed despite visible regrowth.

Mapping environmental inequality through remote sensing: The afterlives of asbestos mining in Cyprus

Ewa Wilk, Malgorzata Krowczynska

University of Warsaw, Poland

The study investigates the long-term environmental and social impacts of asbestos mining in the Troodos Mountains of Cyprus, where chrysotile extraction between 1907 and 1988 left a lasting legacy of contamination and landscape degradation. Using multi-temporal aerial photographs, Sentinel-2 satellite data, and field observations, the research analyses land use transformations and vegetation recovery processes in the Amiandos mine area. A land use transfer matrix and Normalized Difference Vegetation Index (NDVI) were applied to assess ecological regeneration and detect spatial patterns of recovery.

To address ongoing environmental health risks, pre-trained deep learning models based on convolutional neural networks (CNNs) were used to identify asbestos-cement roofing in high-resolution aerial imagery. The results indicate measurable reforestation since the 1990s, but also reveal remaining asbestos waste deposits and deteriorated roofing materials posing persistent hazards to local communities.

The integration of remote sensing, vegetation indices, and deep learning methods provides a comprehensive approach to understanding environmental inequality in post-industrial landscapes. This framework supports the development of inclusive and data-driven restoration strategies consistent with the European Union’s environmental health goals. By combining spatial intelligence with machine learning, the study demonstrates the potential of remote sensing to monitor ecological recovery and mitigate asbestos-related risks in Cyprus and similar post-mining environments.

Mapping and Understanding the Synergy Between Land Surface Temperature and PM₂.₅ at 250 m Resolution in Wuhan: Implications for Climate Adaptation and Air Quality Management

Ruihan Qiu^1,2, Qingming Zhan^1,2, Shiqi Tu¹, Changling Li^1,2, Yilei Zhang^1,2

¹Wuhan University; ²Research Centre for Digital City

Urban heat and fine particulate matter (PM₂.₅) pollution are critical challenges for sustainable cities, but their high-resolution spatial and temporal patterns are not well understood. This study develops a multi-year 250 m downscaling framework to map the synergy between land surface temperature (LST) and PM₂.₅ in Wuhan, China. Using machine learning–based residual correction, annual, summer, and winter mean PM₂.₅ concentrations in 2015 and 2020 were downscaled from 1 km TAP data to 250 m grids. Correlation and spatial autocorrelation analyses were applied to reveal the spatial patterns of LST–PM₂.₅ interactions. The downscaled PM₂.₅ achieved high accuracy (R² > 0.80), and the heat–pollution relationship showed strong spatial heterogeneity. From 2015 to 2020, synergistic zones changed in the Urban area, consistent with the growth of impervious surfaces. These results provide a fine-scale spatial basis for understanding the coupled dynamics of urban heat and air pollution, supporting integrated strategies for climate adaptation and urban air quality management.

Impact of shoreline ecological restoration on suspended sediment concentration in Shanghai coastal waters

Tian Geng, YiWen Guo

Shanghai Surveying and Mapping Institute, China, People's Republic of

The coastal waters of Shanghai, situated at the confluence of the Yangtze River Estuary and the northern Hangzhou Bay, form a typical high-turbidity aquatic environment influenced by sediment discharge from the Yangtze River and strong tidal dynamics. Extensive urbanization and coastal development have led to the proliferation of hardened shoreline structures in this region, which have altered natural hydrodynamic conditions and sediment transport patterns, contributing to ecological issues such as wetland degradation. In recent years, Shanghai has initiated ecological restoration projects aimed at rehabilitating healthy coastal ecosystems. These restoration efforts, involving geomorphic reshaping, may directly disturb and modify sedimentary environments. However, their impact on suspended sediment concentration (SSC)—a key environmental parameter—across large spatiotemporal scales remains unclear. Traditional in-situ monitoring methods are inadequate for capturing such large-scale dynamic variations, whereas satellite remote sensing provides an effective alternative. This study utilizes multi-source remote sensing data to develop an inversion model for SSC suitable for Shanghai's coastal waters, systematically analyzing the influence of different shoreline types on sediment distribution. The findings illustrate how ecological restructuring of shorelines affects the spatial and temporal variations of SSC, thereby providing a scientific basis for optimizing coastal management strategies and assessing the effectiveness of ecological restoration efforts.

How Land Surface Temperatures Respond to Urban Morphological Block?

Ruirui Dong

Humboldt University Berlin, Germany

This study investigates the critical role of Urban Morphological Blocks (UMBs) in shaping Land Surface Temperature (LST) patterns across seasons and cities. Through a comparative analysis of Beijing, Wuhan, and Fuzhou, China, we integrated multi-source remote sensing and 3D building data to define UMBs based on building height and density. Employing robust statistical models (Geographical Detector and Random Forest Regression), we quantified the driving forces behind LST variations. Our results consistently identified Low-Rise, High-Density blocks as the primary heat contributors, while High-Rise blocks exhibited cooling effects. Crucially, we found a strong seasonality in dominant drivers: surface biophysical parameters (e.g., vegetation, impervious surfaces) governed LST in warm seasons, whereas 3D architectural morphology (especially building height) became paramount in winter. Furthermore, factor interactions revealed synergistic effects, with the combination of block type and vegetation yielding the highest explanatory power. These findings underscore the UMB as a vital unit for urban climate analysis. The study provides actionable insights for planners, recommending targeted mitigation in high-risk blocks, promotion of thermally efficient building forms, and the adoption of season-specific strategies to enhance urban resilience against heat stress.

Blending gauge, multi-satellite and atmospheric reanalysis precipitation products to facilitate drought monitoring

Shuai Cheng

Hohai University, Nanjing 210098, China

Accurate, long-term precipitation data is essential for reliable drought monitoring. This study addresses heterogeneous uncertainties in existing precipitation datasets across mainland China by developing two modifiable weighting schemes: a Cheng-Kling-Gupta Efficiency weighted-ensemble model (CWEM) and Bayesian Model Averaging (BMA). These methods were used to merge seven monthly precipitation datasets into new weighted products (BMAEP and CWEP). The precision and drought monitoring utility of these fused products were evaluated against the benchmark Multi-Source Weighted-Ensemble Precipitation (MSWEP) product using gauge data. Results show that the new weighted schemes outperform individual datasets and MSWEP. Specifically, BMAEP-2P achieved a superior composite CKGE index of 0.828, CWEP-4P attained a higher correlation coefficient (CC) of 0.905, and CWEP-2P excelled in relative bias (0.579%) and root mean square error (20.755 mm). Furthermore, BMAEP or CWEP performed optimally for drought monitoring across all sub-regions of China at multiple time scales (1-24 months), with average highest CC and probability of detection values reaching 0.919 and 0.844, respectively. Contribution analysis identified CPC as the dominant factor enhancing model performance. The study demonstrates that CWEM and BMA methods effectively generate superior precipitation datasets for drought monitoring applications.

Spatial Projection of PM₂.₅ under Mult-scenarios using the Futureland Model and Landsat Image Series in the Yangtze River Delta, China

Panli Tang¹, Yongjiu Feng¹, Mengrong Xi¹, Rong Wang²

¹College of Surveying and Geo-Informatics, Tongji University; ²Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University

Fine particulate matter (PM₂.₅) poses serious risks to environmental quality, human health, and sustainable development. However, existing studies seldom achieve long-term, pixel-level PM₂.₅ projections or comprehensively evaluate scenario-based predictions under different development pathways. This study proposes an integrated method for projecting PM₂.₅ distribution at a pixel-level under multiple scenarios by incorporating land-use simulations, land surface indices, and spatial dependence effects. Using multi-temporal Landsat series, ground-based PM₂.₅ observations, and socio-economic data, we generated land-use projections under the Shared Socioeconomic Pathways (SSPs) using the Futureland model. Corresponding land surface indices (NDVI, NDBI, NDWI) were derived and used within a spatial lag model to predict PM₂.₅ concentrations for the Yangtze River Delta (YRD) from 2010 to 2030. Results indicate that under SSP1, characterized by sustainability, forest area and NDVI increase, leading to a significant decline in PM₂.₅ levels. Conversely, under the fossil-fueled SSP5 scenario, urban expansion drives up NDBI and PM₂.₅ concentrations. These findings demonstrate that increased green space and reduced fossil fuel reliance are crucial for improving air quality. The proposed method provides decision-makers with actionable insights for policy formulation and highlights the environmental importance of land-use planning. Future work will integrate dynamic meteorological models and conduct uncertainty assessments. This scenario-based projection framework can be applied to support sustainable urban and environmental management in rapidly developing regions.

Vertical characterization and transport dynamics of UTLS aerosols over Hubei: a multi-year integrated analysis using CALIOP and MERRA-2

Jun LI, Jinjing Deng, Xin Huang, Chuan Zhan, Zuohua Miao, Qiaozhi Wang

Wuhan University of Science and Technology, People's Republic of China

The upper troposphere and lower stratosphere (UTLS) plays a vital role in the global climate system. Central China, particularly Hubei Province, is located directly downstream of the Tibetan Plateau, making it a key region for observing long-range aerosol transport. In this study, the vertical structure and transport mechanisms of UTLS aerosols over Hubei are investigated using a multi-year (2016-2018) satellite dataset. We applied rigorous Cloud-Aerosol Discrimination (CAD) scores and quality control procedures to nighttime CALIOP profiles to minimize cirrus cloud contamination. Our results show that the UTLS background over this region exhibits low aerosol loading during non-monsoon seasons. However, a substantial aerosol enhancement occurs in summer. The monthly mean extinction coefficient at 14-16 km reaches a peak of 8.05 × 10-3 km-1 in August, with a particulate depolarization ratio of ~0.25, indicating the presence of non-spherical particles related to the Asian Tropopause Aerosol Layer (ATAL). To investigate the physical drivers of this seasonal variation, we integrated MERRA-2 meteorological fields and HYSPLIT backward trajectories. The analysis reveals a dual transport mechanism: First, intense local deep convection driven by the East Asian Summer Monsoon (EASM) vertically pumps boundary-layer pollutants into the upper atmosphere. Second, the Asian Summer Monsoon Anticyclone (ASMA) and the westerly jet stream advect aged aerosols horizontally from the Tibetan Plateau to Central China. These findings provide direct observational evidence of how regional monsoon systems synergistically modulate stratospheric aerosol loading.

"Satellite Image Based Spatial Analysis of Urban Air Quality Index"

Darshana Rawal¹, Sindhu Ranganath², Neha Sharma³, Anjana Vyas⁴

¹Hochschule für Technik Stuttgart, Germany; ²George Washington University, DC,USA; ³ESRI , R&D Center, Aerocity, Delhi, India; ⁴CEPT University,Ahmedabad,India

All living organisms significantly impact air quality, which is vital for the Earth's ecosystems. Air pollution has increased in the Indian subcontinent, mainly due to harmful gases and particles from industrialization and urban development. CNN Health reported that as of February 25, 2020, 21 out of the 30 cities with the worst air quality globally are in this region, with six in the top ten. Urgent research and pollution control measures are needed, especially in urban areas where human health and the environment are most affected.

The Air Quality Index (AQI) measures pollution levels from key pollutants like PM10, PM2.5, ozone, sulfur dioxide, and others, using a scale from 0 to 500. While pollution control boards collect ground data, the limited number of sensors in large cities can be a challenge. Satellite imagery enhances coverage, although in situ data remains essential in many areas.

This research aims to connect satellite remote sensing with air quality monitoring by determining air quality indices for pollutants in Dobson units. In situ sensors measure concentrations in micrograms per cubic meter, while satellites use molecules per square meter (Dobson units). Different techniques, like regression analysis, are used to develop location-specific indices for urban and suburban areas.

The focus of this research is on methodology rather than final conclusions, highlighting the importance of accurate real-world representations through reliable data and atmospheric models.

Estimation and Prediction of PM2.5 and PM10 in Kathmandu District Using Satellite-Derived AOD, Meteorological Factors and Machine Learning

Salina Gurung¹, Nalina Maharjan¹, Sanjeevan Shrestha², Kundan Lal Shrestha³, Nikita Shrestha¹, Nista Shrestha¹, Namrata Shrestha¹

¹Department of Geomatics Engineering, Kathmandu University, Nepal; ²Ministry of Land Management, Cooperatives and Poverty Alleviation, Government of Nepal, Nepal; ³Department of Chemical Science and Engineering, Kathmandu University, Nepal

Air pollution remains a major environmental challenge in Kathmandu District, driven by rapid urbanization, increasing emissions, and meteorological influences. This study examines the spatial and temporal variability of PM₂.₅ and PM₁₀ from 2019 to 2024 by integrating satellite-derived Aerosol Optical Depth (AOD), ground-based measurements, and advanced statistical and machine learning techniques. Two regression approaches—a simple linear model using AOD and a multivariate model incorporating temperature, relative humidity, wind speed, wind direction, and planetary boundary layer height (BLH)—were evaluated using R² and RMSE metrics. The multivariate model consistently outperformed the simple linear regression, demonstrating improved predictive capability and was validated using PM data from the US Embassy monitoring station at Phora Durbar. Seasonal analysis showed pronounced pollution peaks in winter, with PM₂.₅ levels ranging from approximately 165–167 µg/m³, while summer exhibited the lowest concentrations (~51 µg/m³). PM₁₀ showed moderate seasonal variability with a notable decline during spring. The study also identified the influence of wildfire events and meteorological conditions on episodic pollution spikes. Despite limitations related to satellite resolution and uneven ground monitoring coverage, the integration of remote sensing, meteorological parameters, and machine learning proved effective for estimating particulate matter concentrations. Overall, the results highlight distinct seasonal pollution patterns and underscore the value of combined observational and modeling approaches for improving air quality assessment in Kathmandu District.

Combining Spectral and Texture Features of UAV-RGB, PlanetScope, and Sentinel-2 Images for Soybean Leaf Area Index and Aboveground Biomass Estimation and Model Transferability Across Spatial Extents and Resolutions

Michael Ekwe¹, Angela Kross¹, Heather McNairn², David Lapen², Mark Sunohara², Emilia Craiovan², Mark Edwards²

¹Concordia University; ²Agriculture and Agri-Food Canada

This study aims to systematically investigate the influence of spatial extent and spatial resolution on the estimation of soybean LAI and AGB and model transferability during the peak of the growing season. The research objectives are to: 1) assess and compare the predictive performance of Stepwise Multiple Linear Regression (SMLR) and Random Forest (RF) models for estimating LAI and AGB across different spatial extents and spatial resolutions; 2) evaluate the transferability of these models across spatial extents and resolutions to determine their robustness under varying scale conditions. Our results demonstrate that RF model outperformed SMLR and presented the highest LAI estimation accuracies across the three nested spatial extents with RMSE of 0.52m2/m2, 0.33m2/m2, and 0.31m2/m2, respectively, explaining 86%, 91%, and 91% of LAI variability at 1m2, 25m2, and 100m2 extents, respectively. Similarly, the RF model had the overall best accuracies with RMSE of 67.13g/m2, 76.98g/m2, and 58.03g/m2, respectively, explaining 83%, 86%, and 84% of soybean AGB variability at 1m2, 25m2, and 100m2 extents, respectively. Moreover, the results showed that the accuracies of both models increased for both LAI and AGB estimation at larger scales. We found that RF models outperformed SMLR in estimating soybean LAI and AGB at 3m resolution (LAI: R2=0.86, RMSE=0.39m2/m2, rRMSE=6.19%; AGB: R2=0.82, RMSE=59.09g/m2, rRMSE=18.09%) and 10m resolution (LAI: R2=0.92, RMSE=0.28m2/m2, rRMSE=4.36%; AGB: R2=0.80, RMSE=59.95 g/m2, rRMSE=18.35%), respectively. Further, the transferability of RF models showed weaker performance when applied to estimate soybean LAI and AGB at higher (or smaller) spatial extents and coarser (or finer) image resolutions.

Crop classification with random forest using fine-resolution synthetic aperture radar

Matthew Pineda¹, Aaron Berg¹, Qiaoping Zhang², Abbey Papadimitriou¹

¹University of Guelph, Canada; ²ICEYE, Finland

This study looks to use fine resolution Synthetic Aperture Radar (SAR) for crop classification of small scale fields. The study site is the University of Guelph's Elora Research Station and looks to conduct crop classification with a random forest on four plots divided into 28 fields of 7 x 14 m in size. Four crop types are planted in each field which include, alfalfa, corn, soybeans, and winter wheat. The datasets used for the analysis are 4 SAR scenes taken during the May to July growing season with two of the plots used as training sets, and the other two as testing. The dataset is provided by the Finnish microsatellite company, ICEYE, with the data products being 0.5-meter resolution VV images. Additional textural information known as Grey Level Co-occurrence Matrix (GLCM) are processed from the SAR scenes and added to the random forest. The analysis was conduced at the pixel level and a 70-30 training and test split is used, with the final output map being aggregated to display the most populated classes present in each separate field. Results of the study show that only 6 out of 56 fields were wrongly classified. Corn had a producer accuracy (PA) of 0.93 and a user accuracy (UA) of 0.97, and oats with a PA of 0.85 and a UA of 0.88. Soybeans had a moderate performance with a PA of 0.87 and a UA of 0.63, and alfalfa performed the worst with a PA of 0.54 and a UA of 0.88.

Differentiating Eelgrass and Kelps using Hyperspectral Satellite Imagery at the Eastern Shore Islands, Nova Scotia

Neve Foreman, Anders Knudby

University of Ottawa, Canada

The study of marine macrophytes is becoming increasingly important due to the threat of climate change to intertidal environments and the potential of macrophytes as nature-based climate solutions. Laminaria digitata (finger kelp), Saccharina latissima (sugar kelp), and Zostera marina (eelgrass) are three marine macrophytes whose habitats are known as blue carbon ecosystems due to their outstanding carbon sequestration capabilities. These species are found throughout the Eastern Shore Islands, Nova Scotia, an Area of Interest (AOI) for ecological and biological importance identified by Fisheries and Oceans Canada. Hyperspectral satellite imagery has been little explored as a solution to mapping marine macrophytes in comparison to other remote sensing data, including multispectral imagery and airborne hyperspectral imagery. To test the efficacy of hyperspectral satellite imagery for mapping marine macrophytes in cold temperate regions, we mapped finger kelp, sugar kelp, and eelgrass using a PRISMA image, near Sheet Harbour, NS, within the Eastern Shore Islands AOI. The results show that machine learning classifiers can use hyperspectral imagery to differentiate marine macrophytes, but it is more challenging to differentiate between species with very similar reflectance spectra, such as finger kelp and sugar kelp. The classification accuracy also decreases at deeper depths, where the benthos-reflected signal is diminished. Further investigation is needed to determine the value of narrow hyperspectral bands for species level mapping; initial results suggest that hyperspectral imagery can achieve improved discrimination of spectrally similar species of submerged aquatic vegetation compared to multispectral imagery of the same spatial resolution.

Detection of Phyllosphere Diseases and Damage Patterns in Norway spruce from UAV Multispectral High-resolution Images

Sara López Fernández¹, Louisa Eurich², Malin Elfstrand¹, Rosario García Gil³, Eva Lindberg²

¹Forest mycology and plant pathology dept., Swedish University of Agricultural Sciences, Sweden; ²Forest Resources Management dept.,Swedish University of Agricultural Sciences, Sweden; ³Forest Genetics and Plant Physiology dept., Umeå Plant Science Centre, Sweden

Forest damage is an increasing global concern, particularly as climate change intensifies the frequency and severity of both abiotic and biotic stressors. Early detection of stress-induced damage is essential for effective forest management, yet conventional methods remain labour-intensive and slow. A significant knowledge gap persists regarding how abiotic stress, such as drought, interacts with latent fungal pathogens that can shift from asymptomatic to aggressive under unfavourable conditions.

Multispectral imaging has demonstrated strong potential for detecting physiological disturbances in tree canopies, including pest outbreaks, but its capacity to identify pathogen-specific damage remains poorly explored.

In this study, we investigate whether UAV multispectral drone imagery can detect canopy damage linked to fungal pathogens in Norway spruce (Picea abies). Research was conducted in two contrasting trials in southern Sweden, representing optimal versus drought-prone growth conditions. Across the 2023–2024 growing sea-sons, tree vitality, needle condition, and phenology were monitored and paired with fungal community data to classify reference trees by pathogen type and stress response.

Weekly drone flights provided multispectral imagery that was radiometrically corrected, canopy-segmented, and processed to derive vegetation indices and individual-tree crowns. Using reference trees as training data, statistical models will assess damage patterns and vitality loss.

We expect to detect and distinguish stress signatures arising from combined biotic–abiotic interactions. And validate the Eurich et al. damage model in older trees.

Customized crop feature construction using genetic programming for early and in-season crop mapping

Miao Lu, Caiyun Wen, Yan Zha, Qian Song

Institute of Agricultural Resources and Regional Planning, Chinese academy of agricultural science, China, People's Republic of

Early- and in-season crop mapping provides vital information for precision agriculture. It is still a challenge for early- and in-season crop mapping because of the limited available images and similar spectral information. This study aims to enhance early- and in-season crop mapping by developing a Genetic Programming (GP) method to construct customized crop features. GP automatically generated candidate features for the target-crop using early- or in-season images, selected programs with substantial value disparities between target and non-target crops through the fitness function, and finally outputted the customized feature after the evolutionary process. These customized features were then compared with commonly used spectral bands and vegetation indices to evaluate their effectiveness for early- and in-season crop mapping. The results proved that the customized crop features had significant advantages in both early- and in-season crop mapping. The early-season accuracy in April after crop planting was 3.97% to 9.53% higher than spectral features and vegetation indices. Based on the classification for the in-season crop mapping, the customized crop features maintained the best performance. Advantages of customized crop features include the ability to automatically select effective bands of useful months without requiring expert knowledge, the ability to catch and enlarge the subtle spectral differences with the early- and in-season images, and the little information redundancy compared with spectral features and vegetation indices. It can be concluded that the customized crop features are outstanding for early- and in-season crop mapping.

In-Season Potato Nitrogen Prediction from Multispectral UAV Imagery

Ehsan Chatraei Azizabadi, Nasem Badreldin

University of Manitoba, Canada

Efficient nitrogen (N) management is a key factor for sustainable potato production, as over- or under-fertilization can significantly affect yield, quality, and environmental outcomes. This study explores the potential of unmanned aerial vehicle (UAV) multispectral imagery and machine learning (ML) to predict in-season potato nitrogen status under field conditions in Manitoba, Canada. A DJI Mavic 3M equipped with four spectral bands (green, red, red-edge, and near-infrared) was used to capture canopy reflectance at 15 m altitude during the 2023 and 2024 growing seasons. Vegetation indices (VIs) such as NDVI, GNDVI, CIgreen, TCARI, and SRRE were extracted from orthomosaics and evaluated for their relationships with petiole nitrogen concentration (PNC). Feature selection methods including Recursive Feature Elimination (RFE), Boruta, and Partial Least Squares Regression (PLSR) were applied to enhance model efficiency. Random Forest (RF), Support Vector Machine (SVM), and Gradient Boosting Regression (GBR) algorithms were compared for prediction accuracy. RF combined with RFE achieved the highest performance (R² = 0.57), confirming its robustness to multicollinearity and nonlinear relationships. The results highlight the strong relevance of CIgreen and red-edge indices to N variability and demonstrate the potential of UAV-based spectral sensing integrated with ML for precision nitrogen management in potato systems.

Joint Use of Super-resolution and Semantic Segmentation on Sentinel-2 and Sentinel-1 Image Stacks for Detailed Mapping of Mangrove Forests

Misganu Debella-Gilo¹, Eliska Sieglova¹, Richard Adade², Belachew Gizachew³

¹Norwegian University of Life Sciences (NMBU), Norway; ²University of Cape Coast (UCC), Ghana; ³Norwegian Institute of Bioeconomy Research (NIBIO), Norway

Satellite remote sensing remains central to global mangrove forest mapping, yet the effectiveness of existing products is often limited by coarse spatial resolution and insufficient locally representative training data. These constraints are particularly evident in many African coastal regions, where access to very high-resolution satellite imagery and field observations is scarce. Deep learning–based super-resolution offers a promising alternative by enhancing the effective resolution of freely available imagery. This study evaluates the utility of applying semantic segmentation to super-resolved Sentinel-2 and Sentinel-1 data for mangrove mapping in two ecologically distinct regions: Tanzania and Ghana. Using analysis-ready data from Digital Earth Africa, temporal median composites of Sentinel-2 VNIR, red-edge, and SWIR bands, together with Sentinel-1 VH and VV images, were generated at 10 m resolution. A modified ESRGAN model was trained to produce imagery with a five-fold increase in spatial resolution. Both the original and super-resolved datasets were used to train a U-Net–based binary segmentation model, supported by training labels derived from Global Mangrove Watch data, Google Earth imagery, drone surveys, and fieldwork.

Results indicate that super-resolved imagery substantially improves the accuracy and precision of mangrove classifications relative to the original-resolution images. The enhanced spatial detail supports the detection of small mangrove patches, complex shoreline features, and local degradation patterns, yielding more complete estimates of mangrove extent. Incorporating Sentinel-1 backscatter further improves mapping accuracy. The study demonstrates that deep learning–based super-resolution can overcome key limitations of open-access satellite data, enabling more reliable, fine-scale mangrove mapping.

Remote Sensing-Based System for Automated Quantification of Forest Aboveground Biomass

Modeste Meliho, Mathieu Varin, Emmanuelle Boulfroy, Jean-François Prieur, Jonathan Sabourin, Julien Tremblay, Benoit Desrochers, Batistin Bour

CERFO, Canada

Operational quantification of forest aboveground biomass remains one of the most demanding components of Verified Carbon Standard (VCS) project monitoring, largely due to the need for repeated large-scale field inventories. To reduce costs and enable automated updates, CERFO developed for Ecotierra a hierarchical modelling system integrating field measurements, drone photogrammetry, and Sentinel-1/2 imagery. The system is designed to generate biomass updates autonomously every 1–3 years.

In 2024, seventy-two field plots were installed, and plot-level biomass was computed using regional allometric equations. Drone acquisitions from RGB and multispectral sensors produced high-resolution structural and spectral predictors (>300 variables). A machine learning ensemble (AutoGluon), trained on a stratified split of 68 plots, achieved strong accuracy (R² = 78.5%, relative bias = 1.7%). Bias-corrected drone predictions (via empirical quantile mapping) were then used as pseudo-observations for the satellite modelling stage. The Sentinel-based model, combining optical and radar indices, reached R² = 67.8% with a relative bias of 0.79%, demonstrating the value of multi-sensor integration.

A comprehensive uncertainty analysis using one million Monte Carlo simulations confirmed the stability of aggregated results (T² = 84.2%, mean bias = –0.22%), meeting MRV requirements for carbon reporting. The final operational product is a fully automated processing chain that retrieves satellite images, performs preprocessing, computes predictors, applies the trained model, and exports biomass maps.

This approach provides a robust and scalable solution for continuous biomass mapping and forest carbon monitoring. Future improvements include expanding field sampling across broader ecological gradients to reduce uncertainty in underrepresented environments and further strengthen model generalization.

Lidar waveform reconstruction using multi-source remote sensing data for improved forest structure and agb estimation

Benyamin Hosseiny¹, Mahdieh Zaboli², Pouria Ramzi³, Milto Miltiadou⁴, Saeid Homayouni⁵

¹University of Bristol, United Kingdom; ²University of Tehran, Iran; ³Australia Government of Western Australia, Australia; ⁴University of Exeter, UK; ⁵Centre Eau Terre Environnement, Institut National de la Recherche Scientifique, Canada

Accurate estimation of aboveground biomass (AGB) is critical for understanding carbon dynamics and forest structure at regional and global scales. Waveform LiDAR, with its ability to capture detailed vertical profiles of vegetation, has proven highly effective for AGB estimation. However, spaceborne waveform LiDAR missions such as NASA’s GEDI face limitations due to sparse sampling, necessitating integration with complementary remote sensing datasets for continuous coverage. This study develops a comprehensive framework to evaluate the contribution of multispectral optical imagery (Sentinel-2) and dual-polarized SAR data (Sentinel-1 C-band and ALOS PALSAR L-band) in reconstructing forest structure across multiple canopy layers in a tropical forest in French Guiana. Using LVIS waveform LiDAR as a reference and an AGB map derived from LiDAR–SAR fusion, Random Forest models were trained to predict LiDAR waveform metrics at relative heights (RH10–RH98), followed by SHAP analysis to quantify feature importance. Results reveal that satellite data exhibit greatest sensitivity at mid-canopy levels (RH55–RH85), with SWIR bands outperforming other optical features, particularly during the dry season when canopy moisture is reduced. SAR features, especially cross-polarized channels, provide consistent contributions across biomass ranges, though their effectiveness declines in very dense forests (>350 Mg/ha).

Large-Scale Forest Structural Complexity Learning from GEDI WSCI Using Multi-source Remote Sensing Data

Yuqi Lei¹, Zheng Xu¹, Wen Xiao^1,2

¹School of Geography and Information Engineering, China University of Geosciences, 430074 Wuhan, China; ²National Engineering Research Center of Geographic Information System, China University of Geosciences, 430074 Wuhan, China

Forest structural complexity is a key component of forest ecosystems and generally reflects the combined characteristics of tree height, diameter at breast height (DBH), canopy cover, tree spacing, and species composition. While spaceborne LiDAR systems such as GEDI provide near-global full-waveform observations for deriving the Waveform Structural Complexity Index (WSCI), the discrete distribution of GEDI footprints limits their spatial continuity. This study addresses this challenge by integrating GEDI-derived WSCI samples with multisource remote sensing data to enable large-scale mapping of forest structural complexity. We developed and compared machine learning models (RF, SVR) and a deep learning architecture (ConvNeXt) to evaluate their ability to predict WSCI from multisource remote sensing data. The results show that the deep learning framework, supported by multisource remote sensing data, effectively overcomes the discrete footprint limitation of GEDI and enables spatially continuous mapping of forest structural complexity at the regional scale. The ConvNeXt model demonstrated clear advantages, reducing RMSE and MAE to 0.55 and 0.43 (compared with 0.60/0.49 for RF) and improving IoA and correlation to 0.73 and 0.61, thereby enhancing the reliability of regional-scale complexity mapping despite the sparse GEDI footprint distribution. This provides a practical and scalable pathway for large-scale forest structure characterization and supports regional forest monitoring and management. Future work will include expanding the study area, incorporating additional field measurements, integrating terrain-related variables for improved modeling under complex topography, and exploring multi-year datasets to assess temporal dynamics in forest structural complexity.

Soil-SSNet: A Spectral–Spatial Cross-Attention Network for Cropland Soil Salinity Inversion and Environmental Response Analysis

He Li

Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, China, People's Republic of

By integrating remote sensing observations, topography, and crop growth parameters, a multimodal deep learning model named Soil Spectral–Spatial Cross-Attention Network (Soil-SSNet) is proposed. Soil-SSNet includes a spectral sequence convolution module to capture dynamic spectral features, a spatial attention module to address surface heterogeneity and uncover the response relationship between salinity and natural environmental variables, and a multi-head cross-attention mechanism that uses spatial features as Query to guide the selection of spectral-index responses. Compared to traditional machine learning models such as Random Forest (RF) and Support Vector Regression (SVR), the overall accuracy of Soil-SSNet improves by approximately 40%. After incorporating multi-source covariates (water conditions, crop growth status, and topographic factors), the model’s accuracy further increases by about 25%. With the addition of the cross-attention mechanism, accuracy improves by another 35%, significantly enhancing the fusion capability of spectral and environmental information and achieving soil salinity inversion with higher accuracy and stronger generalization. Finally, spectral sensitivity analysis reveals that the 705–750 nm and 1580–2350 nm bands contribute the most to salinity inversion. Mechanism analysis further uncovers a significant coupling effect among salinity, crop growth, and topography: vegetation growth characteristics reflect the intensity of salt stress, topographic factors dominate the redistribution pattern of water and salt, and soil moisture dynamics determine the accumulation and dispersion patterns of salinity. In summary, Soil-SSNet not only improves the accuracy and interpretability of soil salinity inversion in saline-alkali farmland but also provides quantitative evidence for understanding the environmental processes and mechanisms of salinization.

Remote Sensing–Based Spatial Modelling of Avoided Deforestation in Tanzania’s Protected Areas

Belachew Gizachew

Norwegian Institute of Bioeconomy Research (NIBIO), Norway

Tanzania hosts one of Africa’s largest Protected Areas (PA), yet deforestation remains widespread in surrounding unprotected landscapes. Assessing the effectiveness of PAs requires analytical approaches that account for environmental and accessibility biases inherent in PA placement. This study presents a remote-sensing-based spatial modelling workflow that integrates Global Forest Change (GFC) forest-loss time series (2012–2022) with terrain, accessibility, and demographic covariates to quantify avoided deforestation attributable to protection.

Biophysical and anthropogenic variables influencing forest-cover change, including elevation, slope, distance to roads, settlement density, and population distribution, were harmonised to a 30 m grid and combined with protected area boundaries from the World Database on Protected Areas. To address spatial biases, Propensity Score Matching (PSM) was applied to match protected forest pixels with statistically similar unprotected pixels, reducing confounding effects and enabling a credible counterfactual baseline. A binomial logistic regression model was then fitted to the matched dataset to estimate the likelihood of deforestation under different conservation categories.

Results show that protected forests were, on average, about three times more likely to avoid deforestation than comparable unprotected forests. National Parks and Game Reserves demonstrated the strongest outcomes, being nearly ten times more effective at avoiding deforestation. Nature Forest Reserves were around three times more effective, while Forest Reserves and Game Controlled Areas showed more modest effects, being roughly twice as likely to avoid deforestation.

The analysis is transparent, reproducible, and scalable, demonstrating how Earth observation and spatial causal inference can strengthen national forest monitoring, support conservation planning, and inform policy processes.

Alfalfa Fractional Vegetation Cover Estimation Using Sentinel-2 Multispectral Imagery and Machine Learning

Hazhir Bahrami, Karem Chokmani, Saeid Homayouni

Institut national de la recherche scientifique (INRS), Canada

Climate change is increasingly disrupting agricultural ecosystems, particularly in Canada, where rising temperatures and altered precipitation patterns are impacting crop resilience. Alfalfa (Medicago sativa L.), a key forage crop valued for its productivity and nutritional quality, is especially vulnerable to winter stress due to reduced cold tolerance and increased damage from thaw cycles. This study presents a remote sensing-based framework for estimating fractional vegetation cover (FVC), a critical indicator of crop health and ecosystem stability. By integrating Sentinel-2 satellite imagery with high-resolution UAV data, the approach leverages machine learning algorithms, including random forest (RF) and gradient boosting (GB), to efficiently predict alfalfa FVC. UAV-derived RGB orthoimages provide detailed spatial reference data, minimizing the need for extensive field surveys. The proposed method demonstrates the potential of combining multi-source remote sensing with ML to capture complex vegetation dynamics and improve monitoring accuracy. Although both models showed high potential in estimating the alfalfa FVC, RF outperformed GB in terms of all evaluation criteria. The resulting FVC maps provide actionable insights for early spring field assessments, enabling the timely identification of damaged areas and supporting informed crop rotation decisions. The proposed framework was tested across multiple alfalfa fields in 4 provinces of Canada, including Quebec, Ontario, Nova Scotia, and Manitoba, demonstrating robust performance under varying environmental conditions. Its scalability and adaptability make it suitable for broader applications in precision agriculture and climate-resilient crop monitoring.

From Ravaged to Regreened: Declining Gully Erosion in the India’s Chambal Badlands

ATHIF SAYYAF, Ali P Yunus

Department of Earth and Environmental Sciences, Indian Institute of Science Education and Research Mohali, Punjab - 140 306, India

The Chambal–Yamuna Badland Zone (CYBZ) is among the most severely degraded semi-arid landscapes in India, where persistent gully development reduces soil productivity and threatens long-term environmental stability. This study examines how vegetation dynamics and surface deformation have evolved in the CYBZ over the past 25 years by integrating long-term optical and SAR-based remote sensing observations. MODIS NDVI data (2000–2024) processed through Google Earth Engine were used to track vegetation greening and browning patterns, while Sentinel-1A/B SAR datasets (2017–2024) were analysed using Persistent Scatterer Interferometry (PSI) and the Small Baseline Subset (SBAS) approach to quantify deformation linked to active erosion.

The NDVI time series shows a clear and statistically significant rise in vegetation cover across the region, with the strongest greening occurring in the eastern badlands, particularly after 2015. This widespread improvement aligns with increasing rainfall and indicates a gradual transition from highly eroded terrain to more vegetated and potentially stabilised surfaces. InSAR results reveal minimal ground deformation within major gullies, suggesting that gully erosion during the study period has been low. Seasonal fluctuations observed in PS displacement curves correspond to vegetation cycles rather than ongoing surface lowering. Negative deformation signals are predominantly associated with agricultural zones adjacent to the badlands.

Overall, the combined use of MODIS NDVI and Sentinel-1 InSAR provides a robust framework for monitoring ecological recovery and erosion dynamics in geomorphically fragile landscapes. The findings highlight increasing vegetation stability and reduced gully activity, offering new insights into the contemporary evolution of the Chambal badlands

Winter Wheat Yield Prediction Using Machine Learning Algorithms Based on Climatological and Remote Sensing Data

Syed Amer Mahmood, Hafiza Sayyeda Farah Batool Batool, Shaiza Farooq

Institute of space science, university of the punjab, Lahore, Pakistan

Accurate prediction of wheat yield is crucial for ensuring food security through the use of machine learning techniques. This research aims to forecast wheat yield in Pakistan by integrating five remote sensing indices, including Green Normalized Difference Vegetation Index, Normalized Difference Vegetation Index, Enhanced Vegetation Index, Soil Adjusted Vegetation Index, and Atmospherically Resistant Vegetation Index, with five climatic variables: maximum Temperature, Minimum Temperature, Rainfall, Soil Moisture, and Windspeed alongside the drought index, Standardized Precipitation Evapotranspiration Index. Ten model combinations are created within two wheat season scenarios: Full Seasonal Mean and Peak Seasonal Mean. Employing two nonlinear ML algorithms, Random Forest and Support Vector Machines, as well as two linear models, LASSO and Ridge, the study aims to determine the most effective combination and ML algorithm in both scenarios. Results indicate that in SC1, the RF model combination (GNDVI + SPEI + WS + SM) outperformed other models (R2 = 0.75, RMSE = 2.40, MAE = 1.98). Similarly, in SC2, the RF regression surpassed SVM, with the model combination demonstrating the highest performance, achieving R2 = 0.78, RMSE = 2.25, and MAE = 1.88, followed by (NDVI + Tmax + Tmin + PPT + PET + WS + SM; R2 = 0.75). Notably, the linear LASSO model also exhibited comparable performance to RF, achieving R² values of 0.74–0.69 in both scenarios. The findings support SC2 for yield prediction, underscoring the significance and potential of ML methodologies in timely crop yield prediction, establishing a robust foundation for ensuring regional food security.

Investigating AlphaEarth Embeddings for Wetland Mapping: a case study in the Stockholm Region

Ioannis Ioannidis¹, Andrea Nascetti², Jenny Lindblad¹, Lina Suleiman¹

¹Division of Urban and Regional Studies, KTH Royal Institute of Technology, Stockholm, Sweden; ²Division of Geoinformatics, KTH Royal Institute of Technology, Stockholm, Sweden

This study evaluates the utility of AlphaEarth Foundation (AEF) embeddings, a pre-trained geospatial foundation model, for regional wetland mapping in Stockholm County, Sweden. Accurate and up-to-date spatial information is crucial for planning, but traditional methods are challenged by the heterogeneity and variability of wetland environments.

Our research assesses how AEF's 64-dimensional feature vectors, summarizing multi-sensor satellite time series (Sentinel-1, Sentinel-2, Landsat) at 10m resolution, perform when integrated with established remote sensing variables (topographic, hydrological, and LiDAR derivatives) within standard machine-learning workflows (MLP). The methodology employs a two-step hierarchical classification based on the BIOTOP SE inventory: a Level-1 land-cover prediction (Huvudklass) followed by binary wetland identification within suitable classes.

Preliminary results demonstrate the potential of this approach. The Level-1 classification showed strong performance for certain classes (e.g., klass6 F1-score: 0.98). For the binary classification within klass4, the model achieved a robust F1-score of 0.87 for the target Wetland subclass (Precision: 0.90, Recall: 0.88). This work highlights the possibility of adapting global pre-trained satellite embeddings with traditional remote sensing inputs using light machine learning models for practical, policy-relevant environmental applications, such as updating national biotope inventories.

GLSTM-MLP: a deep learning framework for crop type classification in smallholder farms with PlanetScope images

Valinho António¹, Eric Umuhoza², Catherine Nakalembe³, Moise Busogi²

¹African Centre of Excellence in Internet of Things, University of Rwanda, Rwanda; ²Carnegie Mellon University Africa, Rwanda; ³Department of Geographical Sciences, University of Maryland, USA

Food insecurity remains a major challenge in Rwanda, particularly in rural regions where stunting and anemia rates remain high. Because agriculture is dominated by smallholder farms (0.1–0.5 ha) with fragmented fields and frequent intercropping, accurate crop type mapping is both essential and difficult. Traditional machine learning approaches struggle to model the spatial–temporal variability of such landscapes, whereas CNN-based models require large annotated datasets that are costly to obtain.

We introduce GLSTM-MLP, a hybrid framework that integrates LSTM and MLP classifiers with precomputed Haralick descriptors to efficiently encode spatial context. By combining spectral bands (SB), radiometric indices (RI), and elevation data, the model decouples spatial and temporal dependencies, enabling robust crop type classification even with limited training samples.

Using 3 m PlanetScope time series imagery and drone-based ground-truth data from two Rwandan villages, we evaluated GLSTM-MLP against MLP, RF, and SVM across three feature scenarios: (i) SB + RI; (ii) SB + Haralick features; and (iii) SB + RI + Haralick + elevation. We further compared performance with 2DCNN-LSTM and 3DCNN.

GLSTM-MLP consistently outperformed all baselines, achieving F1-scores of 91%, 91%, and 93%, compared with 87–91% for MLP, 89–91% for RF, and 86–89% for SVM. While 2DCNN-LSTM and 3DCNN underperformed in this data-scarce setting (F1 < 85%), highlighting the advantage of integrating domain-driven feature engineering with sequential modeling.

These results demonstrate that combining temporal dynamics with engineered spatial context provides a practical, data-efficient pathway for accurate crop type classification in heterogeneous, smallholder-dominated farms in SSA, even under limited ground-truth availability.

Tree Health Assessment Using Mask R-CNN on UAV Multispectral Imagery over Apple Orchards

Mohadeseh Kaviani¹, Brigitte Leblon¹, Thangarajah Akilan², Dzhamal Amishev¹, Armand LaRocque³, Ata Haddadi⁴

¹Faculty of Natural Resource Management, Lakehead University; ²Department of Software Engineering, Lakehead University; ³Faculty of Forestry and Environmental Management, University of New Brunswick; ⁴Atkinsrealis, Woodbridge, Ontario, Canada

Accurate monitoring of tree health is important for ensuring sustainable and efficient orchard management in precision agriculture. We evaluated a modified Mask R-CNN deep learning framework for assessing apple tree health using multispectral UAV imagery. The model was tested with four backbone architectures (ResNet-50, ResNet-101, ResNeXt-101, and Swin Transformer) on three image combinations: RGB, 5-band multispectral imagery, and three principal components (3PCs) derived from five spectral bands and twelve vegetation indices. Among all configurations, the Mask R-CNN with a ResNeXt-101 backbone trained on 5-band multispectral imagery achieved the highest performance, reaching an F1-score of 85.70%. In comparison, PCA-based 3-component inputs performed lower (F1 = 82.75%), indicating that while dimensionality reduction reduces computational cost, it may also discard critical information relevant to vegetation health.

Testing the Suitability of Portable SLAM LiDAR to derive Structural Traits of Holm Oaks (Quercus ilex)

Leon Vehlken¹, Arnaud Carrara², M. Pilar Martín³, Gerardo Moreno Marcos⁴, Jan Wolf¹, Georg Bareth¹, Christoph Hütt¹

¹University of Cologne, Germany; ²Fundación Centro de Estudios Ambientales del Mediterráneo; ³SpecLab, Spanish National Research Council; ⁴Universidad de Extremadura

Holm oaks (Quercus ilex) are a keystone species of the Mediterranean savannas in the southwest of the Iberian Peninsul, which are of high ecological and socioeconomic value. This ecosystem is increasingly threatened by Seca, a decline process of oaks driven by abiotic factors and the pathogen Phytophthora cinnamomi. Monitoring tree vitality is therefore essential, and structural traits such as diameter at breast height (DBH) provide early indicators of stress-related growth reduction.

LiDAR remote sensing enables efficient derivation of these metrics, but existing methods involve trade-offs: terrestrial laser scanning offers high detail but limited coverage, while airborne and UAS-LiDAR cover larger areas but often lack sufficient point density. Portable SLAM (Simultaneous Localization and Mapping) LiDAR systems offer a promising alternative, providing flexible, high-resolution data collection across broad areas.

This study assesses the potential of a portable SLAM system to derive holm oak structural attributes. In July 2025, approximately 450 trees across 17 ha in Majadas de Tiétar (Spain) were scanned. In a first attempt, based on an Outer Hull Model, DBH was derived for 9 trees by fitting convex hulls to point cloud stem slices extracted at 1.3m height. Initial validation against field measurements showed strong agreement (R² = 0.971; RMSE = 3.33 cm).

These first results demonstrate that portable SLAM LiDAR can reliably capture stem structure and support large-scale monitoring.

Application of PRISMA Hyperspectral data for Improving Landcover Mapping in Kenya’s Dryland Forest Stratification Zone

Merceline Awuor Ojwala¹, Maria Antonia Brovelli², Polychronis Kolokousis³

¹Sapienza University of Rome, Italy, Politecnico di Milano; ²Politecnico di Milano; ³Politecnico di Milano

A study focusing on investigating how hyperspectral data can be used towards enhancing landcover mapping accuracy in the Drylands ecosystem, so as to support evidence-based decision-making, strengthen restoration planning, promote conservation efforts and enhance national and international reporting frameworks.

The study looks at spectral separability analysis to quantitatively show how best PRISMA hyperspectral data can distinguish land-cover classes being mapped in the study area as compared to Sentnel-2.

The study also presents supervised classification analysis results performed using Random Forest classifier on Sentinel-2 and PRISMA original image, PCA transformed image and MFN transformed image and comparing their accuracy levels.

Finally, the study looks at spectral un-mixing to be able to quantify in terms of abundance, which landcover class is present in each pixel and to what proportion.

Towards Operational Grapevine Cultivar Discrimination Using Hyperspectral Data: From Proximal Analysis to Satellite-Based Mapping

Kyle Loggenberg¹, Albert Strever², Zahn Münch¹

¹Stellenbosch University; ²South African Grape and Wine Research Institute

This study advances precision viticulture by developing a scalable hyperspectral and GeoAI framework for grapevine cultivar discrimination. Using proximal spectrometry and satellite hyperspectral imagery, the research demonstrates the methodological and feature-level transferability of spectral information from in-field spectrometry to spaceborne data. Ten machine learning and deep learning algorithms were evaluated, with Support Vector Machines (SVM) and a 1D Convolutional Neural Network (1D CNN) achieving the highest accuracies. A novel Partial Least Squares (PLS) ensemble feature selection approach reduced data dimensionality by 95%, identifying key red–NIR, Green and SWIR spectral regions for cultivar mapping. Transferring these features to pansharpened PRISMA hyperspectral satellite imagery yielded high classification accuracies (>80%) at 5 m resolution, confirming the operational potential of hyperspectral GeoAI for vineyard characterisation. The findings establish a foundation for scalable, satellite-driven cultivar mapping to support site-specific management and digital viticulture practices.

A Tree Crown Delineation Method Based on a Gradient Feature-Driven Expansion Process

Jiaxuan Jia, Lei Zhang

Aerospace Information Research Institute, Chinese Academy of Sciences, China

TBA ...

Spatial Analysis of Mining Intensity in Buffer Zones of Protected Areas of the Ecuadorian Amazon

Henrry Lojan-Tenesaca¹, Isabel Adriana Chuizaca-Espinoza^2,3, Andrés Velastegui-Montoya^1,3, Paulo Escandón-Panchana^4,5, Fabiola D. Yépez-Rincón⁶, Kevin Rodriguez González⁶

¹Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ²Earth Observation and Geoinformatics Division, National Institute for Space Research (INPE); ³Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ⁴Departamento de Ingeniería Cartográfica y Topografía, Universidad Politécnica de Madrid (UPM); ⁵Escuela de Ciencias Ambientales, Universidad Espíritu Santo; ⁶Universidad Autónoma de Nuevo León, Faculty of Civil Engineering, Department of Geomatics, San Nicolás de los Garza, Nuevo León, México

The Ecuadorian Amazon faces growing socio-environmental pressure from gold mining, which threatens biodiversity and ecosystem integrity. In Zamora Chinchipe, a province that hosts more than 600,000 ha under protection (18% of Ecuador’s continental protected areas), mining expansion reveals a critical tension between conservation and extraction. This study evaluates the spatial distribution and intensity of gold mining in the buffer zones of six protected areas, using data from MapBiomas Ecuador and Geographic Information System (GIS) techniques.

Mining areas within 5 km of each protected area were extracted from MapBiomas LULC maps and analysed through Kernel Density Estimation (Epanechnikov function, 500 m cell size, 2500 m radius). The results reveal heterogeneous mining pressure, with hotspots concentrated in Cerro Plateado, Podocarpus, and El Zarza, often within 1.6–5 km of official boundaries. Spatial correlation shows that 89% of hotspots lie within 500 m of watercourses and 78% in slopes between 15°–35°, highlighting the geomorphological and hydrological dependency of mining activities.

Conversely, areas such as Yacuambi and Tiwi Nunka show minimal pressure, where local governance and indigenous territorial control have effectively limited extractive expansion. These results demonstrate that governance factors are as critical as physical conditions in determining conservation outcomes.

The integration of MapBiomas data and KDE offers a replicable, low-cost tool for monitoring mining dynamics, providing spatial evidence to strengthen protected area management and inform sustainable territorial planning in the Amazon region.

Approaches to atmospheric modelling and multi-Source Data collection and processing for the FINCH CubeSat

Noam Tal-Siegel^1,4, Kemal Emirhan Uygun^2,4, Shuo Chen^1,4, Fedor Pavlov^1,4, Logan Norton^3,4, Zoe Augspach^1,4, Zara Graham^1,4, Andrei Akopian^1,4

¹University of Toronto Faculty of Arts and Science, Toronto, Canada; ²University of Toronto Scarborough, Toronto, Canada; ³University of Toronto Mississauga, Toronto, Canada; ⁴University of Toronto Aerospace Team Space Systems Division, Toronto, Canada

We are presenting an atmospheric modeling and inversion pipeline for the FINCH (Field Imaging Nanosatellite for Crop residue Hyperspectral mapping) hyperspectral imaging CubeSat, built by the University of Toronto Aerospace Team Space Systems division. Such a pipeline will allow us to validate our spectral unmixing pipeline under possible FINCH imaging conditions, and permit scientifically useful data collection. FINCH, through innovations in crop residue cover mapping, will further enable sustainable agricultural practices.

Estimation of diurnal hydraulic status of spruce trees using drone-based hyperspectral images and green shoulder indices

Chiara Zabeo¹, Luiz Cosimo¹, Leonie Schönbeck², Jose Lopez³, Langning Huo¹

¹Department of Forest Resource Management, Swedish University of Agricultural Sciences, Umea, Sweden; ²Southern Swedish Forest Research Centre, Swedish University of Agricultural Sciences, Alnarp, Sweden; ³Department of Forest Ecology and Management, Swedish University of Agricultural Sciences, Umea, Sweden

Tree hydraulic functioning varies strongly over the diurnal cycle as transpiration, stomatal conductance, and xylem tension shift with radiation and vapor pressure deficit (VPD). Capturing these within-day dynamics remotely is essential for interpreting stress signals from airborne or satellite sensors, yet most remote-sensing studies treat forest condition as static. For conifers such as Norway spruce (Picea abies), even brief midday hydraulic limitation can contribute to long-term drought vulnerability and bark-beetle susceptibility, underscoring the need for diurnal monitoring.

Optical indicators such as the Photochemical Reflectance Index (PRI) track rapid photosynthetic-efficiency changes via xanthophyll-cycle activity but often saturate in dense canopies and are sensitive to geometry and structural effects. PRI’s two-band design also underrepresents the broader green-shoulder region (520–550 nm), where carotenoid–chlorophyll interactions more reliably reflect stress in evergreen species.

To address these limitations, we apply a family of green-shoulder indices (GSCR) that integrate information across multiple narrow bands in the 520–550 nm plateau. These indices capture both rapid xanthophyll dynamics and slower pigment adjustments linked to declining water status. Recent UAV studies show that GSCR metrics are highly sensitive to early physiological stress in spruce.

This study extends GSCR use from static health detection to tracking diurnal hydraulic processes. We (1) test whether spectral indices can reproduce the within-day trajectory of spruce physiological activity, including midday depression, and (2) quantify relationships between spectral indices, leaf water potential, and sap-flow velocity. By combining drone-based hyperspectral imaging with high-frequency hydraulic measurements, we establish a framework linking pigment dynamics to diurnal hydraulic status at the crown scale.

Spatiotemporal Modelling of Ground-Level Air Temperature in an agricultural context: Rigorous Evaluation of LST Modis and Landsat-8 Imagery Data

Marica Teresa Rocca¹, Andrea Bergamaschi², Christian Massimiliano Baldin¹, Fabio Dell'Acqua², Vittorio Marco Casella¹

¹Dept. of Civil Engineering and Architecture, University of Pavia, Italy; ²Dept. of Industrial and Information Engineering, University of Pavia, Italy

Ground-level air temperature (Tair) is an essential variable for climate monitoring, agricultural management, and hazard prevention. Conventional ground-based measurements often fail to capture the fine-scale spatial variability, especially in regions with complex terrain. Land Surface Temperature (LST) remote sensing offers a complementary solution, providing spatially continuous and temporally frequent observations. This study evaluates the potential of MODIS and Landsat-8 LST products to estimate Tair in a heterogeneous agricultural landscape. We developed spatiotemporal regression models linking satellite-derived LST to ground observations from meteorological stations over five years (2018–2022). MODIS data provided high temporal coverage through 8-day composites, while Landsat-8 offered higher spatial resolution LST via the Statistical Mono-Window algorithm. The models were validated using Leave-One-Out Cross-Validation, achieving high predictive accuracy for MODIS-based Tair estimation (R² = 0.981, RMSE = 1.1 °C), whereas Landsat-8 captured finer spatial variability (R² = 0.859, RMSE = 3.4 °C). Our results demonstrate that integrating multi-resolution LST products enables accurate, dense mapping of Tair, supporting operational forecasting for precision agriculture. The study also discusses limitations related to land-cover heterogeneity, temporal representativeness, and potential extensions using spatial correlation methods or radar-derived crop-structure information.

Ontario Ministry of Agriculture, Food and Agribusiness

Sam De La Mothe¹, Brian McLaren¹, Brigitte Leblon¹, Armand LaRocque¹, Christine O'Reilly²

¹Lakehead University; ²Ontario Ministry of Agriculture, Food and Agribusiness

This study uses UAV imagery and videos to monitor cattle behaviour in a rotational grazing system in Thunder Bay, Ontario, Canada.

Assessing effect of droughts and heatwaves on Indian tropical forests using time-series meteorological and vegetation biophysical parameters

Hitendra Padalia, Lipsa Pradhan, Rohan MS

Indian Institute of Remote Sensing, India

Heat waves and droughts are recurring global phenomena that profoundly influence terrestrial ecosystems. This study assessed the impacts of drought and heat waves on the functioning of tropical evergreen (Kerela, South India) and moist deciduous (Barkot, North India) forest sites using meteorological and satellite vegetation products, including the FAPAR, SIF, GPP, and ET during 2007– 2018. The SPI, calculated from ERA5 daily data, was used to identify drought occurrences in space and time, while the Mann–Kendall test detected historical trends. Heat waves were characterized using hourly maximum temperature data from ERA5, following the criteria of IMD. Temporal anomalies were quantified using Z-scores, along with the Mann–Kendall trend test and Theil–Sen’s slope analysis. Moist deciduous showed consistent and pronounced declines in productivity and moisture related variables during droughts, particularly during monsoon season, indicating strong sensitivity to water stress. Evergreen forests exhibited more variable responses, with weaker and less consistent drought signals during the pre-monsoon season and mixed responses in vegetation variables even during monsoon drought conditions. Heat wave impacts also varied across forest types. Evergreen forests showed contrasting responses depending on the timing of heatwave events, with early and mid-heatwave phases associated with reductions in productivity, while late heatwave events showed relatively positive or mixed responses among vegetation indicators. In moist deciduous, heatwaves resulted in more consistent negative anomalies in productivity-related variables, although some years exhibited contrasting behaviour in SIF relative to other indicators. The findings underscore the heterogeneous response of forest ecosystems to extreme climatic events.

A single Hyperspectral image for predicting Soil Organic Matter in saline semi-arid lands: insights from Random Forests and optimal band selection models

Soufiane Hajaj^1,2, Ahmed Laamrani^1,2,3,4, Jamal-Eddine Ouzemou^1,2, Nadir El Bouanani^1,2, Ali EL Battay^1,2, Abdelghani Chehbouni^1,2

¹Center for Remote Sensing Applications (CRSA), Mohammed VI Polytechnic University (UM6P), Ben Guerir 43150, Morocco; ²College of Agriculture and Environmental Sciences (CAES), UM6P, Ben Guerir, Morocco; ³Department of Geography, Environment & Geomatics, University of Guelph, Guelph, ON N1G 2W1, Canada; ⁴Institut de recherche sur les forêts (IRF), Université du Québec en Abitibi-Temiscamingue (UQAT), 445 boul. de l’Université, Rouyn-Noranda, Québec J9X 5E4, Canada

Accurate prediction of soil organic matter (SOM) in saline semi-arid regions is vital for sustainable land management. This study evaluates EnMAP hyperspectral imagery combined with machine learning (ML) for SOM estimation using Random Forest (RF) regression applied with three feature selection (FS) algorithms. Embedding RF-FS, recursive feature elimination (RFE), and Competitive Adaptive Reweighted Sampling (CARS), alongside five spectral pre-processing techniques have been analysed. The first derivative (FD) transformation significantly enhanced model performance, outperforming other processing methods. FD combined with RF-FS demonstrated the highest accuracy (PCCC = 0.697, R² = 0.446, RMSE = 0.825) compared to other FS methods. In contrast, all feature-selection approaches applied to the original reflectance showed substantially lower performance, with OR-RFE, OR-RFFS, and OR-CARS yielding PCCC values of only 0.617, 0.610, and 0.547, respectively, and consistently higher RMSE values near 0.90. Frequency analysis identified key informative bands in the SWIR-2 (2207–2445 nm) and visible–NIR (418–801 nm) regions, aligning with known organic matter absorption features. These results demonstrate that integrating derivative spectroscopy with robust feature selection substantially improves SOM prediction in challenging semi-arid environments, providing a effective framework for operational remote sensing of soil fertility.

Should UAV-lidar be collected at night? Impacts of Solar Illumination on Intensity, Penetration, and Ground Surface Detection Over Dense and Wet Vegetation

Cristian Iranzo¹, Nicholas Pontone², Fernando Cabello¹, Jorge Angas³, Koreen Millard²

¹Institute of Research into Environmental Sciences of Aragón (IUCA), Universidad de Zaragoza, C/ Pedro Cerbuna, 12, Zaragoza, 50009, Zaragoza, Spain; ²Department of Geography and Environmental Studies, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, K1S5B6, Canada; ³Department of Ancient Sciences and Institute of Heritage and Humanities (IPH), ARAID–University of Zaragoza, 50009 Zaragoza, Spain

UAV-based lidar systems operate at relatively low power compared to airborne platforms, making them especially sensitive to environmental conditions that influence return strength, canopy penetration, and ground-surface detection. Daytime acquisitions are affected by solar illumination, which introduces photon noise and reduces signal-to-noise ratio, while nighttime missions eliminate this interference and should theoretically enhance intensity and penetration. However, nighttime humidity, dew, and shallow fog can attenuate or scatter the laser pulse, complicating the expected benefits. This study evaluates the combined influence of illumination and atmospheric moisture on UAV-lidar performance across dense and wet vegetation.

We conducted paired day and night flights (Hesai XT32M2X lidar) at 40 m, 75 m, and 100 m over two contrasting peatland sites in eastern Canada: the humid, densely vegetated Alfred Bog (Ontario) and the drier Quinces Bog (Nova Scotia). Independent GNSS checkpoints were used to assess positional accuracy without constraining point-cloud processing.

Nighttime flights at the dry site yielded the clearest benefits, including increased third-return frequencies, stronger intensity values, and slightly improved ground-surface accuracy. In contrast, nighttime flights at Alfred Bog were affected by fog and canopy-level moisture, which produced increased scattering, duplicate points, and reduced ground detection, particularly at lower altitudes. These results show that while nighttime acquisition can improve data quality, its effectiveness depends strongly on humidity and surface wetness.

Overall, both illumination and environmental conditions must be considered when planning UAV-lidar missions. Nighttime flights offer advantages under dry conditions but may degrade outcomes in humid environments where fog or dew is present.

Geospatial Technology for Natural Resource Management in K-J Watershed North India amid Changing Climate

Bhagwan Singh Chaudhary

Kurukshetra University, Kurukshetra-136119, INDIA

Koshalya‑Jhajhara (K-J) watershed in north India, tributaries of Ghaggar River, covering an area of 134.92 km2 in north India was assessed using multisource geospatial datasets. Satellite imagery, ASTER digital elevation model, Survey of India topographic maps and ancillary thematic data were processed to derive key maps including land use and land cover, geology, geomorphology, drainage density and slope. A temporal comparison of land use for 1999–2000 and 2015–2016 reveals rapid urbanization and shrinking forest cover. Built‑up area expanded from 7.12 km2 to 24.84 km2 while forest area contracted from 109.35 km2 to 96.78 km2, indicating substantially increased water demand and reduced natural recharge capacity. Groundwater potential was assessed by integrating geology, geomorphology, drainage density, slope and land use/land cover through an Analytic Hierarchy Process based multi‑criteria evaluation. The resulting Groundwater Potential Zone map shows that the majority of the watershed is stressed: 61.83 km2 falls in the poor category and 37.87 km2 in the very poor category for groundwater availability. These findings highlight critical zones where recharge and demand‑management interventions are urgently required. Surface water enhancement options were identified through geospatial suitability analysis. Fourteen sites were delineated for check dams and fifteen sites for percolation tanks, selected to maximize recharge potential, minimize sedimentation risk, and complement existing drainage patterns. When implemented, these structures will increase local infiltration, raise groundwater tables, and reduce peak surface runoff, thereby improving water security for domestic, agricultural and ecological needs. Measures have been recommended for sustainable land and water management in the watershed.

Synthesizing hyperspectral Data using generative Models to train spectral Unmixing Methods for low-cost Crop Residue Cover Mapping

Ege Artan^1,3, Shuo Chen^2,3, Andrew Peng^2,3, Sammuel Aldrich Karya^2,3, Kyaw Thiha^1,3

¹University of Toronto Scarborough, University of Toronto, Toronto, Canada; ²Faculty of Arts and Science, University of Toronto, Toronto, Canada; ³Space Systems, University of Toronto Aerospace Team, Toronto, Canada

The spectral range of the Field Imaging Nanosatellite for Crop residue Hyperspectral Mapping (FINCH), developed by the University of Toronto Aerospace Team’s Space System Division, poses significant challenges for hyperspectral unmixing to determine crop residue cover fractional abundances. The severe lack of standard indices necessitates the use of complex, data-driven unmixing models. Complex unmixing models require dense, well-distributed manifolds, whereas ground-truth datasets are sparse and expensive. To better leverage the information content in existing ground-truth datasets, this study presents a framework that decouples interpolation and abundance-mapping estimation via an intermediary conditional data generator, contrasting with traditional single-model unmixing pipelines. This decoupling enables the inclusion of physical prior knowledge about spectra, which is not natively accessible to unmixing models, thereby artificially expanding the dataset to serve as a Monte Carlo approximation of the population risk. To test this hypothesis, we have proposed two generator models: a 1D Conditional U-Net with Conformer Layers (GD-Streamline) and a Dual-Path Transformer Conditional Variational AutoEncoder (TCVAE), and two unmixing models: Multi-Layered Perceptrons (MLP) and Fourier Neural Operators (FNO). The results of the study indicate a need for rigorous integration and introduction of spectral priors within the scope of the proposed decoupling framework to prevent domain and mode collapse and hallucination by the generator models; otherwise, this leads to falsely approximated data manifolds, ultimately resulting in unsatisfactory and out-of-distribution unmixing performance.

Advancing Species-Level Mapping of Savannah Woody Vegetation with Multitemporal EnMAP and Sentinel-2 data

Christina Karakizi^1,2,3, Eleni Sofikiti^1,3, Akpona Okujeni^4,5, Vasileios Tsironis³, Athina Psalta³, Konstantinos Karantzalos³, Patrick Hostert⁴, Elias Symeonakis²

¹Hellenic Space Center, Greece; ²Department of Natural Sciences, Manchester Metropolitan University, UK; ³Remote Sensing Laboratory, National Technical University of Athens, Greece; ⁴Geography Department, Humboldt-Universität zu Berlin, Germany; ⁵Helmholtz Center Potsdam, GFZ German Research Center for Geosciences, Germany

Savannahs are vital ecosystems whose sustainability is endangered by the spread of woody plants. This research targets the accurate mapping of fractional woody cover (FWC) at the species level in a South African savannah, using EnMAP hyperspectral data combined with Sentinel-2 multispectral imagery. Field annotations were intergrated with very high-resolution multispectral drone data to produce land cover maps that included three woody species. The high-resolution labelled maps were then used to generate FWC samples for each woody species class at the 30-m spatial resolution of EnMAP. Several machine learning regression algorithms were tested for FWC mapping on multi-seasonal and/or multi-annual EnMAP and Sentinel-2 imagery. Highest accuracy rates were achieved when incorporating data from both the dry and wet seasons, and for most experiments, data acquired across more than one year. The achieved results demonstrated the suitability of our approach for accurately mapping FWC at the species level and highlighted the synergistic potential of EnMAP and Sentinel-2 data for monitoring savannah ecosystems.

Sensitivity of Spaceborne LiDAR, Optical, and SAR Features for Forest Biomass Modeling: A GEDI–Sentinel-2–SAOCOM Analysis

Eren Gursoy Ozdemir¹, Omer Gokberk Narin², Saygin Abdikan³

¹Bartın University, Türkiye; ²Afyon Kocatepe University, Türkiye; ³Hacettepe University, Türkiye

This study investigates the sensitivity of LiDAR, optical, and L-band SAR features for estimating forest above-ground biomass (AGB) in Istanbul’s Belgrade Forest by integrating GEDI lidar measurements with Sentinel-2 and SAOCOM 1A data. Using GEDI L4A AGBD footprints as reference, the authors derived 27 multisource predictors, including spectral indices, SAR backscatter, and polarimetric decomposition parameters. Four machine-learning models—MLP, Kernel Ridge, Lasso, and Elastic Net—were trained and evaluated using a 70/30 train–test split and repeated k-fold cross-validation. Results show limited but notable predictive capability, with R² values ranging from 0.15 to 0.20. The MLP achieved the highest accuracy (R² = 0.20), which can be attributed to its ability to model nonlinear relationships between biomass and multispectral features. Feature-selection experiments reveal that Sentinel-2 red-edge and SWIR bands, along with vegetation indices such as NDVI_red, LSWI, and Cire, consistently provide the strongest contribution to biomass estimation. SAR features contributed less significantly. The study concludes that optical features currently outperform L-band SAR for biomass modelling in this environment and recommends incorporating topography, multitemporal SAR, and additional machine-learning approaches to improve AGB prediction accuracy in future work.

Water Quality Inversion and Spatiotemporal Analysis of Changshu City Based on Multi-source Remote Sensing Data

Siao Liu, Ye Tian, Jianyu Wang, Lei Chen, Ning Wang, Yihuan Li

Satellite Communications Branch China Telecom Co. Ltd.

Rapid urbanization in the plain river network of Changshu City has led to prominent water quality degradation and eutrophication risks. Traditional in-situ monitoring is constrained by sparse sampling and high costs, while conventional remote sensing approaches struggle with accurate water body extraction and stable parameter inversion in turbid, fragmented rivers. This study establishes a targeted remote sensing monitoring framework using 2024 multi-source data (Gaofen-2, Sentinel-2) and field measurements. An optimized modified Normalized Difference Water Index (mNDWI) combined with a spatially weighted adaptive threshold algorithm is adopted to precisely extract complex river networks. Based on Pearson correlation analysis, sensitive spectral bands and band combinations are screened for four key indicators: Chlorophyll-a (CHL-a), Total Nitrogen (TN), Total Phosphorus (TP), and Secchi Depth (SD). Statistical regression and weighted Principal Component Analysis-Random Forest (PCA-RF) models are developed for quantitative inversion, and their accuracy is verified using cross-validation with R² and RMSE. A weighted modified Carlson Trophic State Index suitable for plain river networks is applied for eutrophication assessment, and the single-factor pollution index method combined with the worst-factor principle is adopted to conduct comprehensive water quality evaluation in accordance with the national surface water environmental quality standards. The integrated inversion–evaluation–mapping workflow realizes spatially continuous water quality analysis, providing a reliable and region-adapted technical solution for remote sensing monitoring and scientific management of water environment in plain river network areas.

Mowing Event Temporal Localization on Dense Satellite Time Series using Foundational Models

Tilemachos Moumouris, Vasileios Tsironis, Athena Psalta, Konstantinos Karantzalos

National Technical University of Athens, Greece

Mowing event temporal localization in dense satellite time series is crucial for monitoring agricultural practices and supporting sustainable land use policies. This study presents an innovative approach using a foundational model (FM), Prithvi-EO-2.0, tailored for Earth Observation time series, to precisely detect and temporally localize mowing events in grassland parcels. Unlike traditional methods that primarily identify mowing occurrence or frequency, this work advances the temporal pinpointing of individual mowing events, addressing challenges related to sparse annotations and diverse agro-climatic contexts. The methodology leverages high-resolution Harmonized Landsat and Sentinel-2 (HLS) optical data, treating time series as sliding temporal windows to capture rapid vegetation changes. The FM backbone is combined with a trainable localization head to predict the precise timing of mowing events, supported by a postprocessing step to reduce false detections. The dataset used includes over 450 newly annotated parcels from Central Greece, enabling robust training and comprehensive evaluation with metrics such as F1-Score, Precision, Recall, and Temporal Distance between predicted and actual events. Preliminary results demonstrate a significant improvement in localization performance, with a 6% increase in F1-score and an average temporal deviation of 2.7 time steps from ground truth. Ablation studies validate the impact of temporal window length, model architecture, and postprocessing on performance. The study highlights the strong generalization capabilities of FM-based approaches despite limited fine-tuning data, paving the way for enhanced agricultural monitoring using multi-temporal satellite data.

Assessing flowering dynamics from a remote sensing perspective in macadamia orchards, South Africa

Simone Kriel¹, Angelique Kritzinger¹, Abel Ramoelo^1,2, Philemon Tsele¹

¹University of Pretoria, South Africa; ²South African National Space Agency (SANSA)

Macadamias are the fastest-growing fruit tree crop in South Africa, but the industry is met with challenges due to changing environmental conditions exacerbated by climate change. One of the challenges increasing in frequency and intensity is out-of-season flowering events. These events result in serious problems for orchard management, harvesting practices, and orchard sanitation. Understanding macadamia phenology is, therefore, important and should be investigated as timely phenology changes are crucial in the agricultural sector, particularly the timing of flowering phenology.

Multispectral remote sensing has been successful in quantifying flowering using conventional vegetation indices. However, based on the canopy distribution of macadamia flowers occurring predominantly below the dense evergreen canopy, the use of multi-spectral vegetation indices needs to be complemented to ensure the dependability of the phenology assessments. Synthetic aperture radar data could potentially address these limitations by facilitating the monitoring of within-canopy structural changes throughout the phenology evolution. Furthermore, to ensure validation of flowering phenology signals captured by satellite sensors, the use of unmanned aerial vehicles offers an intermediate level of observation. Therefore, this study aimed to investigate macadamia phenology through the integration of multi-sensor, multi-scale remote sensing data to advance the detection of flowering dynamics in macadamia, located in Barberton, Mpumalanga, South Africa.

This study highlights that macadamia flowering can be detected from a remote sensing perspective, despite the limitation of the flowers being inconspicuous, underscoring the value of integrating optical and synthetic aperture radar data to improve flower detection.

Mapping Perennial Crops in Complex Tropical Landscapes with Harmonized Landsat Sentinel Time Series

Victória Beatriz Soares¹, Édson Luis Bolfe², Taya Cristo Parreiras¹, Danielle Elis Garcia Furuya², Katia de Lima Nechet³

¹State University of Campinas, Brazil; ²Embrapa Agricultura Digital; ³Embrapa Meio Ambiente

Mapping perennial crops in tropical regions remains challenging due to high spectral complexity, frequent cloud cover, and phenological overlap between different types of vegetation. This study evaluated the potential of the Harmonized Landsat-Sentinel (HLS) dataset to identify perennial crops in the municipality of Jacupiranga, São Paulo, Brazil, an area representative of the Atlantic Forest mosaic. A hierarchical classification was applied using the Random Forest (RF) algorithm on temporal compositions of 2024 NDVI, NDWI, and BSI indices, structured into three analytical levels: (1) natural vegetation versus anthropic areas, (2) perennial crops versus other uses, and (3) banana versus peach palm. Accuracies ranged from 0.86 to 0.98 and F1 ranked between 0.86 and 0.95. The most influential variables were concentrated in transitional periods of the annual cycle, reflecting subtle changes in canopy moisture and vegetative vigor rather than a clear distinction between dry and wet seasons, which are not well-defined in this tropical humid environment. The final maps indicate that approximately 25% of Jacupiranga is agricultural land, of which 4,320 ha correspond to perennial crops, with 80% occupied by banana plantations. The results demonstrate the potential of HLS open data to generate accurate multiscale mapping of perennial crops in complex tropical landscapes, supporting digital agriculture and sustainable management in family farming regions.

Land Cover Change and its Drivers in Chile: The Roles of Infrastructure, Population, Topography and Climate Factors

Lissette Cortés, Valentina González, Danniela Espinoza, H. Jaime Hernández

Geomátics and Landscape Ecology Lab, Forestry and Nature Conservation Faculty, Universidad de Chile, Chile

This study identifies the primary drivers of natural land conversion in Chile's 343 municipalities from 1999 to 2024. Using comprehensive spatial data, the analysis reveals that urban and road density are the most significant drivers of this change. In contrast, higher human development acts as a mitigating factor. The strength of these drivers is also shown to be highly dependent on local precipitation patterns. These findings provide a critical, evidence-based foundation for targeted land-use policy and ecosystem conservation in Chile.

Research on inversion technology of empirical models for water chlorophyll concentration based on sentinel-2 images

Jiqiang Tan¹, Yilong Gong¹, Fuzhong Yin², Zhenfa Zhou², Song Bai¹, Hengfei Liu¹

¹Heilongjiang Geomatics Center of MNR; ²Heilongjiang Administration of Surveying, Mapping and Geoinformation

To address the limitations of traditional fixed-point sampling for monitoring water chlorophyll concentration and improve the inversion accuracy of complex inland waters, this study took the Naoli River Nature Reserve as the research area and conducted research based on Sentinel-2 images and measured chlorophyll point data. First, the study performed combined calculations on multispectral bands to generate multiple derived bands, and selected "B4+B5+B6" as the optimal band combination through the coefficient of determination (R²). Then, using this combination as input, various empirical models were constructed and evaluated using multiple indicators. The results showed that the univariate cubic function model had the highest R² and the smallest multiple error indicators, which was significantly better than other models, and successfully realized the spatial inversion of chlorophyll concentration in the study area. This study reveals the complex nonlinear relationship between chlorophyll and band combinations, provides a high-precision inversion technical scheme for water chlorophyll, offers data support for algal bloom early warning and water quality fluctuation tracking, and provides scientific references for the optimization of river basin management measures and ecological protection decisions.

Casting a Neural Net: Satellite-based Coastline extraction with Neural Networks across diverse coastal Environments in British Columbia, Canada

Piper Steffen, Mohsen Ghanbari, Maycira Costa

University of Victoria, Canada

As climate change is increasingly affecting marine and terrestrial ecosystems, researchers, resource managers, and coastal communities are using satellite-based remote sensing, such as Sentinel-2 multispectral imagery, to monitor coastal environments at large scales. The ability to automatically define the position of the coastline from imagery, referred to as “coastline extraction”, is a valuable tool in extending monitoring of coastal ecosystems, such as kelp forests and eelgrass meadows, to regional and provincial scales. In this work, we present a new dataset for water segmentation, and thus coastline extraction, consisting of manually annotated Sentinel-2 images acquired at low tide, specific to the Pacific coast of British Columbia (BC), Canada. We then evaluate three methods for coastline extraction: an adaptive thresholding method, and two convolutional neural networks trained on firstly, a global dataset and secondly, our newly created BC dataset. The model trained on the BC dataset achieved the highest accuracy across standard image segmentation metrics and in coastline positional error measured relative to a manually defined reference coastline. Additionally, very-high resolution unmanned aerial vehicle data collected at validation sites with comparable tide levels to the Sentinel-2 dataset imagery showed that training on BC specific data decreases pixel misclassification, and therefore coastline positional error, due to the presence of subtidal and intertidal algae and vegetation at various validation sites in the study area.

An Explainable Climate-Aware Generative and Predictive Modelling Framework for Simulation of “What-if” Plausible Climatic Scenarios across Multiple Crops

Jayati Vijaywargiya, Omer Waqar

University of the Fraser Valley, British Columbia, Canada

Climate fluctuations influence many aspects of agriculture including crop growth, soil conditions, distribution of fertilizer and water resources. These climatic fluctuations thereby pose significant challenges for agricultural productivity worldwide. However, the availability of agricultural datasets to study the impact of various adverse climatic conditions on different crops remains limited. To address the data availability limitation for agro-climatic impact study, this paper introduces an Explainable Climate-Aware Generative AI framework (XCA-GenAI). The framework combines a Conditional Tabular GAN (CTGAN) to generate realistic synthetic datasets, a Random Forest (RF) regressor to predict crop yield and stress-related parameters, and a SHAP-enabled “what-if” simulation module that evaluates and explains crop responses under varying temperature and rainfall conditions. The proposed framework is employed to generate synthetic representations of ten climatic variations ranging from Very Hot–Dry to Very Cool–Wet using SF24 dataset. Crop-specific predictive models then estimate how change in climatic condition alters crop density, pest pressure, and frost risk. Further, explainability analysis provides interpretable insights of climate impact across multiple crops represented in the dataset. Comprehensively, this work introduces a climate-aware agricultural decision-support framework to aid farmers and agronomists for informed decision making under varying climatic conditions.

A Comprehensive Framework for Remote Sensing and AI-Driven Real-Time Cotton Health Monitoring and Disease Detection

Rohma Ilyas¹, Muhammad Shahzad Sarfraz¹, Ahmad Ali Tabassam², Hafiz Muhammad Danish³

¹National University of Computer and Emerging Sciences, Chiniot-Faisalabad Campus, Pakistan; ²National University of Computer and Emerging Sciences, Multan Campus, Pakistan; ³Harbin Engineering University, Harbin, China

Effective monitoring of cotton fields, especially at the regional level, while also detecting diseases in individual plants, remains a significant problem in precision agriculture. This paper presents a combined framework for monitoring cotton in Pakistan, using satellite remote sensing and artificial intelligence-based leaf image classification. Multi-temporal Sentinel-2 imagery from the 2022 kharif season was used to map cotton fields and evaluate canopy condition during the growing season.

Cotton fields were mapped using a Random Forest classifier with an overall accuracy of 93% and a Kappa coefficient of 0.82. The estimated cotton acreage of 65,269 ha nearly matched official figures. The crop state inside the mapped cotton area was then evaluated using a Fused Health Index constructed from NDVI, EVI, NDMI, NDRE, and SAVI. The results showed geographic variability in canopy condition, with 24.5% of the region falling into the low-health class, 50.9% in the moderate-health class, and 24.6% in the high-health class. A Vision Transformer model achieves 97% accuracy in classifying RGB images of cotton leaves into eight diseases and conditions. The satellite analysis identifies where stress is concentrated at the district scale, while the image-based model gives symptom-level diagnostic help. Together, these results suggest that combining remote sensing and artificial intelligence can improve timely cotton monitoring and allow more targeted field management.

From Pixels to Policy: Using Geospatial Technologies to Assess Sand Mining Regulations

Har Amrit Singh Sandhu, Rajan Dabral

Punjab Engineering College, Chandigarh, India

Sand mining is the practice of removing sand from lakes, rivers, and streams using various techniques, including dry and wet pool mining, bar excavations, skimming, and scalping. Excessive and illegal mining activities can lead to severe environmental hazards, including deforestation, water pollution, soil pollution, and air pollution. Fallacious mining practices can also lead to the exhaustion of resources on open lands and riverbeds. In this study, geospatial techniques have been utilised to investigate and audit sand mining practices surrounding a city in North India, where resources are extracted for the city's development. The present study was conducted with the objective of identifying unauthorised mining activities in allotted sites as well as nearby areas, mapping and verifying mining operations in relation to approved mining plans and environmental clearances, and estimating the minor minerals extracted at the mining site. Satellite images from Sentinel-2 and Google Earth, along with the coordinates of the lease area and EIA reports for the site, served as the data sources for the research. Violations of the rules, such as flow obstruction, mining along riverbanks, and mining outside the lease area, were observed through the use of remote sensing images. Furthermore, it can be concluded from the present study that satellite-derived analysis offers a time and cost-effective means of inspecting mining areas.

Repeated Airborne Laser Scanning for Analyzing Drought-Related Crown Dynamics in Mature Norway spruce

Alice Penanhoat, Langning Huo

Swedish University of Agricultural Sciences, Sweden

Repeated airborne laser scanning (ALS) offers new opportunities to quantify individual-tree structural dynamics over time. In this study, we analysed annual to multi-year changes in height growth and crown structure of mature Norway spruce in southern Sweden using repeated ALS acquired in 2016, 2017, 2019, and 2022. Individual trees were delineated from normalized point clouds, and tree height increment, maximum crown radial extension, crown projection area, and crown-boundary metrics were derived to evaluate temporal structural change. To interpret the results in relation to recent climate extremes, the observation period was divided into pre-drought (2016–2017), during-drought (2017–2019), and post-drought (2019–2022) phases, and structural changes were expressed on a yearly basis. Tree height increased significantly in all periods. Annualized median height growth was 24.9 cm yr⁻¹ before drought, 26.5 cm yr⁻¹ during drought, and 16.8 cm yr⁻¹ after drought. In contrast, annualized maximum crown radial expansion was limited before drought (2.4 cm yr⁻¹), peaked during drought (19.0 cm yr⁻¹), and was nearly absent after drought (0.17 cm yr⁻¹). Crown-boundary metrics further suggested an upward shift of the upper crown and a reorganization of the middle crown over time, although lower-crown estimates were more uncertain. Driver analyses showed that height growth was mainly related to tree size before and during drought, whereas post-drought growth became more influenced by stand competition. Overall, this study shows that repeated ALS can be a useful tool for analysing crown structural dynamics during and after drought, providing a promising basis for monitoring how tree architecture responds to stress.

Assessment model of the soil fertility potential of Yatsuda rice fields based on humus derived nitrog en balance using UAV hyperspectral sensor

Kyohei Sato¹, Shintaro Goto², Yuya Tanaka¹

The objective of this study is to build an assessment model of nitrogen balance from reservoir to valley fields linked to reservoirs by combining spatial information processing, chemical analysis and bioanalysis. To this end, the biomass of rice and weeds was detected from hyperspectral images mounted on UAVs, the growth processes of rice and weeds were understood with GIS, and the biomass of rice and weeds was estimated separately to estimate the amount of nitrogen fixation in rice.

This study showed the possibility of determining the distribution of humus from the distribution of carbon and nitrogen in a paddy field by using a UAV hyperspectral sensor, a random forest. Furthermore, by qualitatively assessing the contribution of soil micro-organisms to nitrogen fixation in rice using soil microbial diversity and activity values (BIOTREX), a model was constructed to enable an assessment of the nitrogen cycle derived from organic nitrogen supplied by the reservoir, and the nitrogen balance was estimated.

We showed that the nitrogen balance can be evaluated from the balance of soil-derived nitrogen and humus-derived edible nitrogen in reservoirs, rainfall, and rice paddies by chemical analysis. By combining this with BIOTREX, it was shown that when humus-derived edible nitrogen is high and BIOTRX values are high, the change of organic nitrogen in humus to inorganic nitrogen is promoted. It was shown that this could be used as an indicator of the need for fertilizer inputs and as a method for assessing the potential of agricultural land.

Integrating Multi-Source Agricultural Data with Machine Learning to Improve Crop Mapping Accuracy: A Case Study of the Navajo Nation

Varatharajaperumal Thangavel¹, Sudhagar Nagarajan¹, Madasamy Arockiasamy¹, Pias Kumar Biswas², George Sklivanitis¹

¹Florida Atlantic University, Florida, United States of America; ²Navajo Technical University, New Mexico, United States of America

Accurate crop maps are important for agricultural monitoring in water-limited regions because they provide spatial information for crop inventory assessment, land management, and resource planning. In the Navajo Nation, crop classification is challenging because agriculture is influenced by arid environmental conditions, limited water availability, and unevenly distributed cultivated land. This study evaluates a crop-classification workflow for a selected agricultural Region of Interest (ROI) within the Navajo Nation using Sentinel-2 imagery, the USDA/NASS Cropland Data Layer (CDL), and the CDL confidence layer in Google Earth Engine. High-confidence CDL pixels (confidence ≥ 95%) were used to construct pseudo-reference samples for the 2017 and 2022 growing seasons, and a 3 × 3 neighborhood homogeneity filter was applied to reduce local label uncertainty. Spectral predictors derived from Sentinel-2 imagery included the Normalized Difference Vegetation Index (NDVI), Enhanced Vegetation Index (EVI), Green Chlorophyll Vegetation Index (GCVI), and Land Surface Water Index (LSWI). A Random Forest classifier was implemented separately for each year using an 80% training and 20% testing split. The resulting classifications achieved overall accuracies of 87.30% for 2017 and 90.88% for 2022. These results show that confidence-screened CDL samples combined with multi-temporal Sentinel-2 features can support reliable crop classification within the selected ROI under limited reference-data conditions and provide a practical basis for agricultural monitoring in the Navajo Nation.

Remote Sensing and AI-Driven Sustainable Cotton Farming for a Resilient Future

Ahmad Ali Tabassam¹, Muhammad Shahzad Sarfraz², Saad Rafiq³, Muhammad Fayyaz²

¹National University of Computer and Emerging Sciences, Multan Campus, Pakistan; ²National University of Computer and Emerging Sciences, Chiniot-Faisalabad Campus, Pakistan; ³GIS LAB, Forestry and Wildlife Department, Govt. of Punjab, Lahore, Pakistan

The remote sensing (RS) and geographic information systems (GIS) technologies combined with artificial intelligence (AI) enable more efficient and sustainable agricultural ecosystems. In recent years, the use of the machine learning and the deep learning models trained over the geospatial data have emerged as a pivotal catalyst for sustainable and smart agriculture initiatives. The unmanned aerial vehicle (UAVs; drones) has become as a transformative force in the context of crop health monitoring, disease detection and yield predictions combined with supervised and unsupervised machine learning that helps to revolutionizing the motoring, deep analysis and timely decision making.

This study presents the integration of remote sensing technologies (e.g., UAVs, drones) combined with data-driven artificial technologies to help the farmers in precision agriculture for cotton framing for plant health monitoring against pest infestation, micro irrigation and crop yield prediction. A UAV-based dataset for cotton crops is prepared from a region in Pakistan. The prepared dataset is evaluated through multiple experiments using classical supervised machine learning algorithms for the classification; Naïve Bayes (NB), Support Vector Machine (SVM), and Random Forest (RF). These classification algorithms helped to classify the cotton crop health; healthy or unhealthy. The experimental results indicate that the RF algorithm outperforms the other applied machine learning methods, in terms of its accuracy and precision.

Mapping urban green spaces using an analysis of vegetation indices

Joel Aldrighettoni, Maria Grazia D'Urso

Department of Engineering and Applied Sciences, University of Bergamo

The research focuses on the use of advanced remote sensing techniques to fight the effects of climate change in urban areas, with particular reference to heat islands. The proposed methodology, applied to the metropolitan city of Naples, is based on the analysis of very high-resolution satellite images from the WorldView-3 constellation, combining panchromatic and multispectral data through pan-sharpening to obtain detailed maps of urban vegetation, including smaller green spaces such as flower beds, tree-lined avenues, private gardens, green roofs, which are often overlooked because they are difficult to map in a sustainable and widespread manner. Through the calculation of spectral indices (NDVI, MSAVI2, GNDVI, NDRE), the study has enabled not only the precise geolocation of photosynthetically active areas, but also the monitoring of their health status by comparing satellite datasets acquired in June and September 2023. The results highlight marked water stress during the summer period, manifested by a reduction in the average values of the indices. These results constitute a valuable decision-making tool for resilient urban planning and the implementation of Nature-based Solutions, and demonstrate the sustainability and replicability of the methodology in other territorial contexts.

Transferable Remote Sensing Prediction of Aboveground and Belowground Carbon Consumption from Boreal Wildfires across North America

Chinyere Ottah¹, Lucas Diaz², Zilong Zhong¹, Spencer Christensen³, Sander Veraverbeke^2,4, Alemu Gonsamo¹

¹School of Earth, Environment & Society, McMaster University, Hamilton, ON, Canada; ²Faculty of Science, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands; ³Faculty of Science, University of Amsterdam, Amsterdam, The Netherlands; ⁴School of Environmental Sciences, University of East Anglia, Norwich, United Kingdom

Accurate estimation of wildfire-driven carbon loss in boreal forests requires spatially explicit prediction of both aboveground and belowground combustion, yet most existing approaches remain region-specific and are rarely evaluated for transfer across fires or geographic domains. Here, we develop and evaluate a transferable modelling framework for plot-level aboveground and belowground carbon combustion using field observations linked to remotely sensed burn severity, vegetation structure, biomass, climate, terrain, peat occurrence, and fire-weather predictors.

Models were trained in western Canada and evaluated using fire-wise hold-out data within the training region and independent transfer domains in Alaska and Québec. To avoid optimistic performance estimates, all tuning and validation were conducted using grouped cross-validation at the fire-event level. Predictor formulations were defined a priori to represent alternative ecological hypotheses about combustion controls. Predictors included Canadian Fire Weather Index components (FFMC, DMC, DC, BUI and FWI), calculated using MODIS-derived burn dates and 7-day antecedent means. After recursive feature elimination, compact non-collinear predictor subsets were retained for modelling.

Predictive performance varied more strongly among predictor formulations than among model families, indicating that ecological representation exerts greater influence on transferable combustion modelling than algorithm choice. For aboveground combustion, the strongest model achieved R² = 0.31 and RMSE = 682.7 g C m⁻². Belowground combustion was more difficult to predict and was best represented by a climate-augmented nonlinear structure. Transfer to Alaska was weakest for both responses, and high-combustion observations were systematically underpredicted, highlighting uncertainty associated with rare extreme burns.

Spectral Unmixing and Design Requirements for a low-cost Crop Residue Cover Mapping Nanosatellite

Zoe Augspach^1,3, Andrei Akopian^1,3, Zara Graham^1,3, Fedor Pavlov^1,3, Ege Artan^1,2, Noam Tal-Siegel^1,3, Kemal Emirhan Uygun^1,2

¹Faculty of Arts and Science, University of Toronto, Toronto, Canada; ²University of Toronto Scarborough, University of Toronto Toronto, Canada; ³Space Systems, University of Toronto Aerospace Team, Toronto, Canada

FINCH is a student-led satellite mission whose novel sensor and cost effective form seek to provide crop residue mapping at a much lower cost and aid in smart-agriculture initiatives. To achieve this, crop residue must be accurately quantified using the limited reflectance range of 900 nm to 1700 nm. Hence, novel unmixing methods must be developed. Two datasets were evaluated: a laboratory-acquired dataset and a simulated, atmospherically propagated dataset. Multiple unmixing methods were tested, including Linear Regression, a Bayesian Linear Dirichlet model, K-Nearest Neighbors, Random Forest, and deep learning approaches such as a Multi-Layered Perceptron. Strong performance was achieved on the laboratory dataset, with the Multi-Layered Perceptron achieving an R2 for crop residue of 0.8436, total R2 of 0.8935, and an RMSE of 0.0909 when plotting true to predicted abundances, demonstrating the feasibility of accurate unmixing in controlled conditions. However, performance decreased substantially on the atmospherically propagated dataset, likely due to nonlinearities and other stark differences between datasets that limit transfer learning. These findings indicate that while the lab results are highly promising, additional atmospheric measurements and model adaptations are necessary to achieve full confidence in FINCH’s predictions. Further testing and validation will be critical to establish robustness and guide the development of operational unmixing methods for determination of optical design and imaging requirements.

Annual variability in phenological responses of natural vegetation in Indus river watershed of Ladakh

Kartik Sharma, Dr. Subrat Sharma, Dr. M Surabuddin Mondal

University of Ladakh, India

Understanding vegetation phenology in high-altitude regions is critical for assessing ecosystem responses to climate variability (Cleland et al., 2007). The Indus River Watershed in Ladakh (69,548 sq.km) spans elevations from 953m to 8,546m with diverse vegetation types adapted to extreme conditions. This study addresses the research question: How does vegetation phenology vary annually across 2018–2023? We employ satellite-based NDVI analysis to quantify phenological patterns, map spatiotemporal vegetation dynamics, and identify climate-driven changes in this data-sparse, high-altitude region.

LiDAR vs. SfM: Which is better for analysing habitat of the harvest mouse (Micromys minutus)?

Naoko Miura¹, Susumu Yamada²

¹The University of Tokyo; ²Tokyo University of Agriculture

The harvest mouse, Micromys minutus (Pallas, 1771) is the smallest rodent in Japan and now listed in the Red Data Books of Tokyo, 2 prefectural capitals, and 28 prefectures in Japan due to drastic decline of grasslands. For the harvest mouse, the height and density of the tall grass species where nesting occurs are considered particularly important. However, it has been difficult to continuously and extensively acquire information on the three-dimensional structure of herbaceous vegetation. With recent development of UAV technology, UAV data are beginning to be applied to the analysis of herbaceous vegetation. For acquiring three-dimensional information via UAV, methods include using LiDAR sensors or generating 3D point cloud data from aerial photographs using SfM. This study evaluates whether UAV LiDAR or UAV SfM is more suitable for estimating the height of tall grass species such as Japanese silver grass (Miscanthus sinensis), which serve as important nesting sites for the harvest mouse. As a result of analysis, the proposed method was found to be effective to estimate grass height regardless of whether UAV LiDAR or UAV SfM is used. However, when comparing the accuracy of canopy height estimation using UAV LiDAR data alone, UAV SfM data alone, and combined UAV LiDAR and SfM data, combined UAV LiDAR and SfM data found to perform best. Maximum canopy height was found to be best estimated using the combination of median of hand-measured five maximum canopy height values and maximum height calculated using the combined UAV LiDAR and SfM data.

Lights that Extinguish Nature in Protected Forests: A Look at the Impact of Light Pollution

David Freire-Granda^1,2, Sheyla Cevallos-Misco³, Gina Peña-Villacreses⁴, Julian Perez-Correa³, Julia Nieto³, Eder MS de Paula⁵, Andrés Velastegui-Montoya^2,6

¹Faculty of Mechanical Engineering and Production Sciences, ESPOL Polytechnic University; ²Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ³Faculty of Life Sciences, ESPOL Polytechnic University; ⁴Laboratório de Oceanografia Costeira e Estuarina, Instituto de Estudos Costeiros, Universidade Federal do Pará; ⁵Faculdade de Geografia, Belém, Universidade Federal do Pará; ⁶Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University

Light pollution is an emerging global environmental issue driven by the intensified use of artificial light at night, with emissions increasing by nearly 50% in the last decades. In Ecuador, the rapid urban and industrial expansion of Guayaquil has led to a significant rise in nighttime radiance, raising concerns about its effects on nearby protected forests such as Cerro Blanco, Papagayo, Estero Salado, and Prosperina.

This study evaluates the impact of light pollution on local flora and fauna using VIIRS nighttime satellite imagery for 2014, 2019, and 2024. Average radiance values were processed in Google Earth Engine and classified into low, medium, and high pollution levels. Species occurrence data from iNaturalist and GBIF were integrated to identify taxa exposed to elevated light levels.

Results reveal a marked increase in light radiance, especially in areas adjacent to urban growth. In Cerro Blanco, radiance has intensified since 2017, disrupting natural light–dark cycles. Nocturnal and endemic species (such as Engystomops guayaco and Sylvilagus dauleensis) were identified among the most exposed, with potential alterations in reproductive, foraging, and behavioural patterns.

The study demonstrates that artificial light is encroaching upon protected ecosystems, threatening biodiversity and compromising ecological processes. The findings underscore the urgent need for conservation strategies that reduce light emissions, promote sustainable lighting technologies, and preserve natural darkness in nocturnal habitats. This work provides critical insights for the management of protected areas in Guayaquil and contributes to the broader understanding of light pollution impacts in megadiverse regions.

Fusing Satellite Remote Sensing and Argo Float Data for Enhanced Monitoring of Microplastic Concentrations in the West Pacific (2018–2020)

Ting On Chan, Zhijie Chang, Wing Sum Yan

School of Geography and Planning, Sun Yat-sen University, China, People's Republic of

With the continuous intensification of marine plastic pollution, monitoring the transport and dispersion of microplastics has become a critical global concern. However, predicting microplastic concentrations remains highly challenging due to the lack of direct satellite signatures and the complex non-linear physical mechanisms governing their dispersion. This study develops an interpretable machine learning framework to monitor surface microplastic concentrations in the West Pacific from 2018 to 2020. We profoundly integrated Japanese AOMI in-situ microplastic observations, ERA5 meteorological/wave reanalysis, and Euro-Argo subsurface profile data utilizing a 3D Inverse Distance Weighting (3D-IDW) spatiotemporal interpolation algorithm. A Random Forest (RF) model was subsequently trained, achieving robust predictive accuracy (R² = 0.64, 0.76, and 0.87 for 2018, 2019, and 2020, respectively). Crucially, we incorporated SHapley Additive exPlanations (SHAP) to overcome the "black-box" limitations of traditional ensemble models. The SHAP analysis explicitly revealed a distinct, year-by-year regime shift in dominant environmental drivers: microplastic distribution was primarily governed by stable hydrographic and biological conditions in 2018; by dynamic wave forcing (e.g., long-period swells and Stokes drift) in 2019; and by extreme meteorological events (e.g., typhoon-induced terrestrial flushing) in 2020. Ultimately, this physics-informed framework successfully elucidates the dynamic transition of microplastic transport mechanisms between hydrographic–biological dominance and meteorological–physical forcing, providing vital scientific support for targeted pollution mitigation and coastal resilience planning.

Seasonal Variability between Major Air Pollutants and Physical Landscapes in the Greater Nairobi Metropolitan Region

Patricia Mwangi

Kenyatta University, Kenya

Satellite data is crucial in regions lacking ground monitoring stations and is helpful in identifying areas likely to have high concentrations of pollutants harmful to human health. As these cities expand and grow, the quality of life and conditions will also be changing. The study seeks to determine the correlation between land surface temperature (LST), elevation, enhanced vegetation index (EVI), rainfall, particulate matter (PM2.5), nitrogen dioxide (NO2) and ozone (O3) both day and night during the months of October-December, January-February, June-August in the year 2019, 2020, 2024 and 2025. The results indicate a varied strength in relationship between variables in each season, day and night. During the day the highest negative correlation is obtained between elevation and carbon monoxide, while during the night the highest negative correlation is obtained between elevation and LST in all periods analysed. Results from the study thus indicate that it is critical to study the spatial and temporal variations of aerosols and temperature over varied conditions in a geographical region.

Geospatial Exploration of Urban Heat Island Behaviour and Thermal Discomfort Patterns in Pune

Vinayak Bhanage^1,2, Han Lee^1,2,3

¹1Transdisciplinary Science and Engineering Program, Graduate School of Advanced Science and Engineering, Hiroshima University, Hiroshima 739-8529, Japan; ²Centre for Planetary Health and Innovation Science (PHIS), The IDEC Institute, Hiroshima University, Hiroshima 739-8529, Japan; ³Smart Energy, Graduate School for Innovation and Practice for Smart Society, Hiroshima University, Hiroshima 739-8529, Japan

This study analyses the spatio-temporal dynamics of urban heat island (UHI) in Pune, focusing on the impact of Land use land cover (LULC) changes on the thermal environment. Using satellite imagery from the summer and winter seasons of 2015 and 2024, LULC, land surface temperature (LST), and geospatial indices were analysed at citywide and ward levels. Results indicate that from 2015 to 2024, the urban area increased by 12.77 km2, with the highest urbanization over Hadapsar. Between 2015 and 2024, Pune's mean LST and UHI increased by 8.18°C and 2.65°C in summer but dropped by 5.19°C and 0.54°C in winter. At a ward scale, during both seasons, the highest alterations in LST (UHI) were experienced at Sangam wadi ward. Among geospatial indices, NDMI was the most significant regulating LST across both seasons and years. Ward-level analysis for 2024 shows that a 1% rise in latent heat can lower UHI by 0.3°C in summer and 1°C in winter. Human thermal discomfort in the city is in the less comfortable category, with wards like Sangam wadi showing an increased discomfort across both seasons. The outcomes of this research can serve as a basis for decision-making to improve the resilience and sustainability of the region.

Mapping the 21st-century Global Wetland Dynamics by Seamless Data Cube and Deep Learning

Shuai Yuan¹, Jie Wang², Peng Gong¹

¹Dept. of Geography, The University of Hong Kong, Hong Kong, China; ²Pengcheng Laboratory, Shenzhen, China

Wetlands are among the most dynamic and ecologically important ecosystems, yet they remain one of the least temporally monitored environments globally. Existing wetland datasets provide only static or low-frequency snapshots, making it impossible to track rapid hydrological fluctuations, disturbance events, and long-term degradation processes. To bridge this gap, we introduce GWD30, the first-ever global wetland dynamics dataset with near-daily temporal frequency (4-day interval) and 30-m spatial resolution, covering the period 2000–2024.

GWD30 is generated using a seamless data cube and a dynamic sample generation approach that converts static training labels into full time-series dynamic labels using temporal–spectral pattern embedding. A two-stage classification system combining machine learning and knowledge-guided refinement produces a globally consistent wetland taxonomy with 14 detailed classes.

This dataset enables unprecedented monitoring of wetland ecosystem behaviour across regions, timescales, and climate zones. GWD30 opens new opportunities for ecological modelling, biodiversity monitoring, hydrological analysis, climate research, and global conservation planning.

High-fidelity Planetary Simulation Environment for Rover Evaluation

Long Chen¹, Dong Li², Ruilin Wang¹, Dujun Nie¹, Xiaotong Zhang¹, Yuchen Li³

¹Institute of Automation, Chinese Academy of Sciences, Beijing, China; ²WAYTOUS Inc., Beijing, China; ³Department of Information, Technische Universit¨at M¨unchen, Munich, Germany

Deep-space exploration depends heavily on remote sensing as its primary data source, with the Moon and Mars serving as the main targets for scientific investigation and future human expansion. In these harsh planetary environments, rovers have become the essential platforms for surface exploration and sample acquisition. To support the development of next-generation rover systems, high-fidelity simulation environments are crucial. They enable safe, efficient, and repeatable testing of rover mechanics, perception, localization, and mapping algorithms under realistic planetary conditions, reducing mission risks and accelerating system development. This paper provides a comprehensive comparative analysis of existing lunar and Martian simulation environments, assessing them in terms of scene fidelity, rendering engines, supported robotic platforms, and intended application tasks. Building on this analysis, we introduce a generalized and reproducible workflow for constructing high-fidelity planetary simulation environments grounded in authentic remote sensing data products. Finally, we demonstrate the fidelity and practical utility of a state-of-the-art planetary simulation environment through a set of targeted validation experiments, followed by a discussion of key findings and future directions for the development of next-generation planetary simulation platforms.

An Online Semantically-Rich 3D Information System for Collaborative Exploration of Planetary Surfaces

Sau Yuen Chan, Bo Wu, Qichen Fan, Jiapei Hu, Ran Duan

Poly U, Hong Kong S.A.R. (China)

The ability to perceptually interpret complex planetary surface environments is essential for successful robotic or crewed exploration. In this research, we present an online semantically-rich 3D information system that offers an immersive, high-fidelity simulation environment, accurately reproducing lighting and terrain conditions to support multi-disciplinary investigation of planetary surfaces. Built for both desktop and VR environments, it allows users to transition from conventional desktop analysis to fully immersive exploration, where spatial perception and cognitive engagement are significantly enhanced. Using candidate landing sites at the lunar south pole as case studies, we evaluate the performance of the proposed online semantically-rich 3D information system. Preliminary results indicate that the system enables users to interpret complex surface data more efficiently and intuitively than conventional observation methods.

DNN-Based Lichen Mapping Using AVIRIS-NG Hyperspectral Imagery and UAV Images in a Rocky Canadian Shield Landscape

Shahab Jozdani¹, Dongmei Chen¹, Robert. H. Fraser², Wenjun Chen²

¹Department of Geography and Planning, Queen's University, Canada; ²Canada Centre for Remote Sensing, Natural Resources Canada

Forage lichen fractional cover mapping using multi-spectral remote sensing (RS) data is challenging, especially over rocky landscapes where there is a high spectral correlation between lichens and non-lichen features. Given this, it is deemed that the use of airborne or satellite hyperspectral imagery may improve lichen mapping. In this study, we report the first results of using AVIRIS-NG hyperspectral imagery and UAV images to estimate forage lichen fractional cover (Cladonia spp.) in a rocky Canadian shield landscape where non-lichen bright features were prevalent. To estimate forage lichen fractional cover, we conducted a regression approach based on deep multi-layer perceptron (MLP) models whose number of hidden layers and neurons were determined using exhaustive grid search procedures. The three MLP models were trained and tested on four scenarios with different hyperspectral compression AVIRIS-NG band images and WorldView-3 (WV3) data of three sites. Our experiments showed that mapping lichen fractional cover using the 5 m AVIRIS-NG surface reflectance imagery was more accurate (i.e., higher R2 and lower RMSE values) than the one using a 4-band WorldView-3 (WV3) image with a spatial resolution of 2 m in most cases.

Toward early warning of tailings dam failures through InSAR, surface moisture, and deep learning: insights from the Brumadinho disaster

Ramin Farhadiani, Sayyed Mohammad Javad Mirzadeh, Saeid Homayouni

Institut national de la recherche scientifique (INRS), Québec, QC, Canada

The catastrophic failure of the Córrego do Feijão Tailings Dam I in Minas Gerais, Brazil, on January 25, 2019, resulted in approximately 270 fatalities, underscoring the potential risks posed by tailings dams and the necessity for stringent monitoring of these structures. We utilized Sentinel-1 SAR data to generate deformation time series via Interferometric Synthetic Aperture Radar (InSAR) and to retrieve surface soil moisture (SSM), enabling pre- and post-failure analyses. InSAR analysis revealed significant pre-failure deformation within the tailings impoundment behind the dam crest, with a line-of-sight velocity reaching −69 mm/yr. In contrast, the post-failure period showed no significant ongoing deformation, indicating a relative stabilization. SSM showed a progressive increase in near-surface moisture, peaking on January 22, 2019, three days before the collapse. Although increased near-surface moisture alone cannot be a sign for liquefaction, continuous saturation might foster conditions prone to failure. We also proposed a spatiotemporal Graph Attention Network–Gated Recurrent Unit (GAT-GRU) technique to predict deformation time series derived from InSAR. The proposed GAT-GRU technique exhibited efficacy in predicting deformation trends by modeling spatial and temporal dependencies within the InSAR-derived time series. Overall, this study emphasizes the potential of InSAR, soil moisture analysis, and predictive models as reliable and complementary tools for managing tailings dam safety.

Mapping wildfires in seconds

Simon Jones¹, Karin Reinke¹, Konstantinos Chatzopoulos Vouzoglanis¹, Nur Trihantoro¹, James Hilton²

¹RMIT University, Australia; ²Covey Pty Ltd

This paper presents a method and results for detecting, mapping and modelling the progression of wildfire in Australasia, Europe and North America within seconds. Initial detections are achieved using the BRIGHT algorithm (Engel et al., 2020, 2021). BRIGHT uses 10-minute Geostationary satellite observations, to dynamically threshold the satellite observation time stamp comparing it to the 28-day bioregion average at each respective timestamp. This produces a wildfire location (hotspot) and an estimate of FRP (Fire Radiative Power), Engel et al., 2022; Chatzopoulos-Vouzoglanis et al., 2022, within 20-45 seconds. Once detected, grouped fire locations are passed onto a fire behaviour simulator Spark / Inferno (Miller et al., 2015), to deliver a comprehensive bushfire analytics model framework which predicts fire behaviour.

At present this product is available in real time for Australia and is available on-demand for Europe and North America.

Mapping urban flood risk under the combined effects of climate change and urbanization

Qinke Sun^1,2, Xianglong Yang¹, Yabin Zhang¹, Liang Zhou¹

¹Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou, 730070, China; ²Department of Coastal and Urban Risk & Resilience, IHE Delft Institute for Water Education, Delft 2601DA, the Netherlands

Low-lying and densely populated coastal cities are not only crucial areas for human survival and rapid development but also highly vulnerable regions sensitive to climate change. In recent years, tropical cyclone-induced flooding has emerged as a major hazard threatening the sustainable development of coastal cities. At the same time, rapid land use changes in these urban areas have significantly altered the original landscape structures and land use patterns, becoming key drivers of escalating flood risks. Therefore, when mapping and assessing urban flood risks, it is essential to comprehensively account for the combined effects of climate change and urbanization. This study uses Shanghai, a typical coastal city, as a case study to propose an integrated framework for simulating and evaluating coastal flood hazards while incorporating land use changes. The framework realizes the numerical simulation of flood disasters in coastal cities based on physical processes by coupling the SFINCS fast flood inundation model, the land use change model and the Delft 3D storm surge numerical nested model. The results indicate that by 2100, urban land use changes will expand the inundation area of a 1,000-year tropical cyclone flood by 4.91% to 34.00%. Neglecting future urban land use changes would underestimate the inundation extent of storm surges. Moreover, the findings highlight the critical need to account for the long-term impacts of land use changes on urban flood risks in coastal areas. The proposed methodology is applicable to coastal regions worldwide that are susceptible to tropical cyclones.

Exploratory Study on Using Deep Learning for Monitoring Vertical Ground Displacement

Felix Enyimah Toffah^1,2, Ruben Cartuyvels³, Patrick Ebel³, Francesco Pirotti^2,4

¹Department of Civil, Building and Environmental Engineering, Sapienza University of Rome, Via Eudossiana 18, 00184 Rome, Italy; ²Interdepartmental Research Centre of Geomatics (CIRGEO), University of Padova, Viale dell’Università 16, 35020 Legnaro, Italy; ³ϕ - lab of the European Space Agency, Via Galileo Galilei, 1, 00044 Frascati, Italy; ⁴Department of Land, Environment and Agro-Forestry (TESAF), University of Padova, Viale dell’Università 16, 35020 Legnaro, Italy

Advances in artificial intelligence have opened new frontiers in Earth observation, particularly in modeling complex geodynamic phenomena such as Vertical Ground Displacement (VGD). VGD is driven by numerous environmental and hydro-climatic factors, making prediction inherently challenging. This study develops a novel CNN-ConvLSTM hybrid deep-learning architecture that seamlessly integrates static soil characteristics and dynamic spatio-temporal features to predict VGD across the Italian territory. The model achieved an R2 value of 0.59 and a Mean Absolute Error (MAE) of 4.35 mm on the validation dataset, effectively capturing approximately 60% of the VGD variations. Additionally, Explainable AI (XAI) using SHAP (SHapley Additive exPlanations) values was incorporated to interpret the model's predictions. The analysis confirms that while hydro-climatic factors (such as drought and temperature) are the primary drivers of VGD temporal variability, static soil properties (including bulk density and volumetric water content) are the most globally influential predictors, dictating the overall spatial susceptibility of the medium. These findings provide a framework for identifying the key environmental drivers of VGD, which is essential for resource allocation, hazard management, and the development of effective early warning systems in geologically sensitive regions.

Hydrological Modelling and Flood Vulnerability Assessment of the Yola South Watershed Using GIS and HEC-HMS

Temitope Paul, Adedayo Adeleke

University Pretoria, South Africa

This study presents an integrated GIS-based and hydrological modelling assessment of flood vulnerability in the Yola South watershed of Adamawa State, Nigeria—an area experiencing increasingly severe flood events due to rapid urban expansion, land degradation, and intense rainfall. Using high-resolution spatial datasets, the watershed was delineated from a 30-m DEM, and land-use and soil information were utilized to compute Curve Numbers (CN) using the SCS-CN method. A composite CN of 65.33 was derived, indicating moderate infiltration capacity and substantial susceptibility to runoff generation during heavy storms.

The HEC-HMS hydrological model was used to simulate the July 31, 2025, rainfall event across delineated sub-basins. Model outputs revealed peak discharges ranging from 5.3 to 6.0 m³/s, direct runoff volumes of approximately 19 mm, and lag times between 205 and 266 minutes. Sub-basins with increased imperviousness and exposed soils generated faster and higher runoff responses, identifying hydrological hotspots that contribute disproportionately to downstream flooding.

The study demonstrates the utility of combining GIS with HEC-HMS simulation to evaluate watershed behaviour under current land-use conditions. Findings provide actionable guidance for flood risk planning, including targeted drainage improvements, land-use regulation, and nature-based solutions such as vegetation restoration. This research highlights the value of geospatial technologies in supporting climate resilience and aligns with ISPRS priorities on sustainable environmental management and hazard mitigation

Landslide Dating and Activity Mapping using Transformer-Based Multi-Sensor Time-Series Framework

Tsung-Han Wen, Tee-Ann Teo

National Yang Ming Chiao Tung University, Chinese Taipei

This study presents a Transformer-based framework for estimating landslide occurrence dates using fused Sentinel-1 SAR and Sentinel-2 optical time-series data. By leveraging multivariate temporal features and long-range attention, the model substantially improves dating accuracy compared with single-sensor methods, with over 80% of events dated within a 0–15-day offset. The derived occurrence dates enable the creation of landslide activity maps at daily temporal resolution, offering a major advancement over conventional annual assessments. The resulting landslide activity index highlights spatial and temporal variations in slope activity, supporting more precise identification of highly dynamic landslides. The framework offers a valuable tool for monitoring slope hazards and enhancing landslide risk assessment at regional scales.

A Multi-Temporal SAR–DEM Integrated Framework for Flood Dynamics Assessment and Recurrent Flood-Zone Identification in Sri Lanka

Sasitha Supun Abeysinghe, Sanka N. Perera

Department of Remote Sensing and GIS, Faculty of Geomatics, Sabaragamuwa University of Sri Lanka

Floods remain one of the most frequent and disruptive natural hazards in Sri Lanka, particularly within low-lying monsoon-driven basins where cloud-covered conditions hinder optical monitoring. This study develops a multi-temporal flood-mapping framework that integrates Sentinel-1 C-band SAR data with DEM-derived terrain information to assess flood dynamics and identify recurrent inundation zones in the Attanagalu Oya Basin. Three major flood events in 2016, 2017, and 2018 were analysed using pre- and peak-flood SAR acquisitions processed through a standard workflow of orbit correction, radiometric calibration, speckle filtering, and terrain correction. Threshold-based segmentation of VV backscatter (−15 to −13 dB) was applied to delineate inundation, followed by hydrologically guided refinement using slope (<1–3°) and elevation constraints to reduce false positives associated with built-up areas, vegetation, and radar shadow. The results illustrate distinct spatial variations across the three years, with 2016 showing the most extensive inundation and 2018 presenting spatially concentrated flooding. DEM integration significantly improved classification accuracy by eliminating physically implausible detections. Validation against the Survey Department–Sri Lanka Navy inundation map for 2016 produced a spatial agreement of 72.18%, demonstrating the reliability of the SAR–DEM fusion approach. Multi-year overlay of flood layers revealed a persistent high-risk corridor stretching from Gampaha to Katunayake, reflecting entrenched drainage limitations and ongoing floodplain encroachment. The proposed framework provides an operational, scalable method for flood monitoring in cloud-prone environments and offers essential insights for risk-sensitive land-use planning, hazard zoning, and infrastructure design. The approach also forms a basis for future automated, machine-learning–enhanced flood early-warning systems.

Spatiotemporal trends of extreme precipitation in Caraguatatuba (SE Brazil) from CHIRPS data (1981–2024) using GEE and climate indices

Yasmim Carvalho Guimarães¹, Andrés Velastegui-Montoya², Gabriela Alves Carreiro¹, Luana Albertani Pampuch Bortolozo^1,3, Enner Alcântara^1,3

¹Graduate Program in Natural Disasters (UNESP/CEMADEN), São José dos Campos, Brazil; ²Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University, Guayaquil, Ecuador; ³Institute of Science and Technology, Environmental Eng. Dept., Unesp, São José dos Campos, Brazil

Climate change has intensified the variability and frequency of extreme events, particularly affecting vulnerable coastal urban

areas. In Caraguatatuba, located on the north coast of S˜ao Paulo, the impacts are exacerbated by social inequalities and inadequate

infrastructure. However, detailed analyses of climate indices in the region are still scarce. This study analyzes precipitation patterns

in Caraguatatuba, calculating climate indices to identify trends and support risk mitigation. CHIRPS daily precipitation data at

0.05° spatial resolution were processed using Google Earth Engine and R to compute the ETCCDI indices PRCPTOT, CDD, CWD,

R95p, R99p, Rx1day, and Rx5day. Statistical tests were applied to detect significant trends. The results indicate an increase in

consecutive dry periods (CDD, p=0.0065) and at the intensity of daily extreme rainfall (Rx1day, p=0.0044) in the southwestern

area, while other indices did not show significant trends. These findings highlight the city’s increasing climate vulnerability and the

urgent need for adaptation strategies. By offering a replicable framework based on open-access remote sensing and cloud platforms,

this study supports policy development to enhance urban resilience and monitoring climate-related disasters in coastal cities.

Geospatial Assessment for Agricultural Drought Management in the Semi-arid Regions of Southern India using Remote Sensing Time-series Data

Suresh Mondal, Arun Prasad K

Department of Geography, School of Earth Sciences, Central University of Tamil Nadu, Tamil Nadu, India

Agricultural drought is a recurring and complex hazard that significantly impacts food security, water resources, and rural livelihoods, particularly in vulnerable regions such as the southern agro-climatic region of Tamil Nadu, India. Over the past few decades, the region has experienced numerous drought-related phenomena. In this context, the primary objective of this study is to monitor the dynamics of agricultural drought across the region between 2015 and 2023, with the aim of promoting sustainable agricultural management and climate-resilient practices.

MODIS NDVI and LST products were utilized to derive drought-related indices. Rainfall data from CHIRPS and temperature data from TerraClimate (1982–2023) were used to calculate the SPEI, enabling the identification of representative dry and wet years. The analysis uses the Normalized Vegetation Supply Water Index (NVSWI), derived from remote-sensing time-series datasets. Finally, a correlation analysis was conducted between monthly NVSWI, one-month SPEI, and VHI during the primary growing season of Rabi crops (October to December).

Results reveal that NVSWI identified spatial and temporal patterns of acute water stress in vegetation. Ramanathapuram, Sivaganga, Virudhunagar, Thoothukudi, and Pudukkottai were consistently drought-prone, experiencing moderate to severe drought intensity in multiple years. In 2021, drought conditions were minimal. A strong positive correlation was observed between monthly NVSWI and both SPEI and VHI, confirming its suitability for drought monitoring.

The findings highlight the effectiveness of NVSWI and multi-source satellite data for drought detection, supporting the development of early warning systems and climate-resilient agricultural planning in drought-prone regions.

Analysing vulnerable GLOF sites in High Mountain Asia using geospatial techniques for disaster early warning: Northern Pakistan

Syeda Saleha Fatim Ali¹, Maria Ali¹, Raja Umair Arshad², Syed Waqar Hussain Shah¹, Muhammad Bilavel Raza³

¹Institute of Space Technology, Islamabad, Pakistan; ²COMSATS University, Abbottābād, Pakistan; ³University of Bremen, Germany

The deglaciation due to global warming and changing climate has given rise to the formation and expansion of numerous glacial lakes, particularly in the High Mountain Asia region. Many of these glacial lakes are susceptible to experiencing Glacial Lake Outburst Floods (GLOFs) events which can release millions of cubic meters of water and debris, leading to widespread impacts on lives, property, infrastructure, agriculture and livelihoods amongst remote downstream communities. The research investigates the potential of multi-source data, focusing on District Chitral in Northern Pakistan, with elevated potential implications for GLOF and associated risk. A total of 12 vulnerable sites are identified, out of which 5 are highly susceptible to GLOF. A spatio-temporal analysis of the vulnerable sites have been carried out in Google Earth Engine (GEE). The maps were generated in the GIS environment of ArcMap considering key contributing factors with high impact potential including, lake area change, elevation, slope, aspect, temperature and precipitation, LULC, change in snow and glacier cover area, distance from fault line, and proximity to impact area, among others. A pronounced decline in the snow and glacial cover, and an increase in land surface temperature (LST) retrieved from satellite data could be responsible snow/glacial melting resulting to higher frequency of GLOFs and flash floods. The potential implications on population, infrastructure, schools, forest and agriculture, and water quality of Chitral have been estimated. The findings are of great significance for policymakers and disaster management authorities, providing valuable insights to formulate efficient and effective measures for mitigating the risks.

Analysing Surface Dynamics and Polynomial Trend Patterns: A Case Study from the HKH Region, Northern Pakistan

Shaiza Farooq¹, Dr. Syed Amer Mahmood²

¹University of Bayreuth, Germany; ²Institute of space science, university of the punjab, Lahore

Remote sensing and GIS-based geomorphometric mapping are powerful tools for analyzing neotectonic activity. This study focuses on the Nanga Parbat Syntax (NPS) and its adjoining regions, among the fastest uplifting zones of the Himalayas, rising at 8–10 mm/year. Using SRTM DEM, Trend Analysis of Polynomial Surfaces (TAPS), Local Base Level (LBL), and Vertical Dissection (VD) maps were produced to interpret surface dynamics. The study area, located along the Main Mantle Thrust (MMT) and below the Main Karakoram Thrust (MKT) in Gilgit-Baltistan, encompasses five key geomorphometric zones—two defined by drainage dissection, two by relative relief, and the expansive, relatively flat Deosai plateau in the southeast with prominent VD signatures. A residual elevation map was derived by subtracting a 12th-order polynomial trend surface from the DEM, highlighting spatial elevation anomalies. This trend surface effectively captures the NE–SW and NW–SE uplift patterns across the Nanga Parbat Haramosh Massif Zone (NPHMZ). Elevated anomalies align with the Sassi Raikot Fault Zone (SRFZ), NPHMZ, MKT, and northwest of Jaglot toward the Hindukush. In contrast, areas such as Deosai Plateau, Skardu, Kachura, Gorikot (Astore Valley), Jaglot, and Gunar exhibit negative elevation anomalies. LBL maps generated from 2nd- and 3rd-order Strahler streams yielded insightful correlations with tectonic structures. Both isobase and VD analyses indicate significant dissection and elevation in regions near the NPHMZ, MKT, and upper Astore Valley. The spatial alignment of high residuals with these structural features underscores active tectonic uplift, reaffirming ongoing neotectonic processes across the NPHMZ and its surroundings.

Integrating Machine Learning and Classification methods for Wildland Fire Danger Mapping

Saeideh Sahebi Vayghan, Tarmo K. Remmel

Faculty of Environmental and Urban Change, York University, Toronto, Ontario, M3J 1P3

The frequency of wildland fires are increasing due to warmer and drier conditions resulting from climate change. Identifying fire prone areas is essential for planning and mitigating potential impacts. This study aims to create a wildland fire danger map using Random Forest (RF) and competing classification methods for Ontario’s Managed Forest (MF). The critical role of the classification method in wildland fire danger mapping motivated us to evaluate and compare the effectiveness of three classification methods, including Natural Break, Geometric Interval, and Standard Deviation Interval. A total of 42 key static and dynamic variables were analyzed, covering the period from 2020 to 2022. The static variables: distance from roads, railways, settlements, rivers, and water bodies and topographic features like elevation and various indices derived from the NASA Digital Elevation Model (NASADEM). To capture these dynamic environmental conditions, several environmental variables and indices as well as key meteorological parameters were incorporated into the modelling from MODIS and ERA5 land. We used Recursive Feature Elimination and Cross-Validation (RFECV) to select optimize features for the model. To address the opacity inherent in machine learning models, SHapley Additive exPlanations (SHAP) were utilized to quantify the marginal contribution of each variable to the predicted distance from fire. Our results showed that the GI classifier provided the most consistent and well-balanced performance and reliable predictions across all evaluation metrics. The resulting fire danger map highlights high-risk areas, supporting targeted management, prevention, and resource allocation to reduce future wildfire impacts.

Assessing hydrometeorological disaster impacts through spectral change detection: Insights from the 2025 flash flood in Dharali, Uttarkashi

Harshita Tiwari, Aditya Kumar Thakur, Rahul Dev Garg

Indian Institute of Technology Roorkee, Haridwar, India, India

Dharali, a Himalayan village in Uttarkashi, Uttarakhand, lies along the narrow Kheer Ganga valley, a terrain marked by steep slopes, high relief, and complex topography- conditions that render it highly susceptible to geomorphological and hydrological hazards. On August 5, 2025, a catastrophic flash flood and debris flow, triggered by a sudden cloudburst or upstream slope failure, caused extensive destruction across 0.54 km² of the settlement. Despite the increasing frequency of such high-magnitude events in the Himalaya, quantitative assessments of recent localized geomorphic and hydrological impacts remain limited, particularly in small, high-altitude villages like Dharali. This study addresses that gap by employing high-resolution PlanetScope imagery (3 m) from pre-event (July 19, 2025) and post-event (August 22, 2025) periods to detect and quantify surface alterations using spectral thresholding and spatial change detection methods. The analysis revealed pronounced spectral shifts, a 39% increase in surface water extent, and topography-driven hydrological redistribution. The statistical association (𝜑 = 0.35; ϗ = 0.34) indicates moderate spatial agreement between the temporal datasets. The findings demonstrate the utility of fine-resolution satellite data in capturing rapid, small-scale landscape transformations and emphasize the urgent need for systematic, event-based monitoring frameworks to improve disaster preparedness and resilience in fragile Himalayan environments.

PICANTEO: A Modular Change Detection Framework for Remote Sensing Applications

Dimitri Lallement¹, Christian Hümmer^1,2, David Youssefi¹, Paul Mauduit³, Agnès Verzeni¹

¹CNES (French Space Agency); ²DLR (German Aerospace Center); ³Thales Services SAS

This paper presents PICANTEO, a modular and multi-modal change detection framework designed for remote sensing applications in natural disaster response. The framework aims to support damage assessment during both the rapid mapping phase, which occurs in the immediate aftermath of a disaster, and the longer recovery phase. PICANTEO provides automated, reliable disaster-related change detection maps and associated impacted areas to support a wide range of disaster monitoring activities. The integration of uncertainty and ambiguity concepts ensures reliable and qualified results.

PICANTEO handles multi-modal remote sensing data, including very high-resolution optical imagery, Digital Surface Models, and Synthetic Aperture Radar (SAR) data. Its modular architecture enables users to apply ready-to-use pipelines or implement their own workflows. The provided scalable components can be combined or extended by custom methods to define new applied pipelines. Several real-world case studies demonstrate PICANTEO’s ability to address various disaster scenarios across diverse geographic contexts. Source code is available at: https://github.com/CNES/picanteo.

Integrated Coastal Vulnerability Index (ICVI) for Kerala state, India using Multi-criteria Spatial analysis approaches

Chinni V Naga Kumar Kommireddi, Noobiya C, Naveena K, Venu Prasad H D

Centre for Water Resources Development and Management (CWRDM), India

Coastal regions are the foci of intense economic activity, but, these dynamic and ecologically sensitive low-lying lands are increasingly threatened due to climate change induced eustatic sea level rise, and anthropogenic activities at regional and local scale leading to relative sea-level rise, thereby necessitating to understand the vulnerability of a coast for their protection and sustainable development. In this background, the present study is focused along the densely populated ~590 km long coastal stretch of Kerala state, India to build an Integrated Coastal Vulnerability Index (ICVI) by using the i) physical vulnerability index (PVI) variables such as a) Geomorphology, b) Coastal Slope, c) Bathymetry, d) Shoreline change history, e) Spring tide range, and f) Significant wave height, and ii) socio-economic vulnerability index (SVI) variables like a) population density, b) land use/land cover, c) number of household, d) fisher-folk population density, e) literacy rate, f) occupation, g) road density, h) railway network and i) tourist spots integrated in Geographic Information System (GIS) environment, and through Analytic Hierarchy Process (APH) technique. Kerala state is located on the south-western margin of the Indian Peninsula, with nine administrative districts i.e., Kasargod, Kannur, Kozhikode, Malappuram, Thrissur, Ernakulam, Allapuzha, Kollam and Thiruvananthapuram districts (from North to South). The Integrated Coastal Vulnerability Index (ICVI) along the Kerala coast revealed that 96.82 km (16.4%) under very low vulnerability, 105 km (18%) under low vulnerability, 145.75 km (24.6%) under moderately vulnerable, 123 km (20.9%) under highly vulnerable and the remaining 119.27 km (20.2) under very highly vulnerable.

Post-Fire Urban Runoff Assessment in a Mediterranean Basin Using Integrated UAV–SWMM–HEC-RAS Modelling

Alaa Kadora¹, Anna Brook², Lea Wittenberg³

¹Spectroscopy and Remote Sensing Laboratory, School of Environmental Science, University of Haifa, Israel; ²Spectroscopy and Remote Sensing Laboratory, School of Environmental Science, University of Haifa, Israel; ³School of Environmental Science, University of Haifa, Israel; ⁴Haifa university, Israel

This study examines how wildfire disturbance changes urban runoff behaviour in a small Mediterranean basin. The research uses high-resolution UAV images together with GIS processing to create detailed surface models and land-cover maps after the fire event. These spatial datasets were then integrated with SWMM and HEC-RAS 2D to simulate different rainfall scenarios and to understand how the basin responds during storms.

The results show that burned areas have faster runoff, higher peak flow, and stronger surface connectivity even under moderate rainfall. The modelling framework was able to map sensitive zones where water tends to accumulate and where the risk of local flooding increases. The study also demonstrates that UAV-based photogrammetry can improve hydrological and hydraulic simulations by providing more accurate information on terrain shape and surface conditions.

Overall, this contribution presents a practical workflow that combines remote sensing and process-based modelling to support flood-risk assessment in fire-affected urban environments. The approach is suitable for other regions with similar challenges and contributes to ISPRS goals of using geospatial technologies for climate-resilient and sustainable water-management planning.

Elevation accuracy assessment of typical areas in Oceania based on ICESat-2ATLAS data

Jing Guo, Haipeng Chen, Hongjing Tu, Bingquan Yao, Chang Li, Haolin Liu

National Quality Inspection and Testing Center for Surveying and Mapping Products, China, People's Republic of

Digital elevation models are the primary data used in remote sensing and geographic information systems (GIS) for terrain analysis and three-dimensional spatial data processing [1]. In surveying and mapping investigations, SRTM1 (Shuttle Radar Topography Mission) DEM, ASTER GDEMV3 (Advanced Spaceborne Thermal Emission and Reflection Radiometer Global Digital Elevation Model), and AW3D30 (ALOS World 3D-30m) DEM have become key data sources. This study compares the elevation accuracy of three open-source DEM datasets to high-precision ICESat-2/ATLAS altimetry data. GIS statistical analysis, error correlation analysis, and mathematical statistical approaches are used to compare elevation accuracy in DEM.

Four error assessment metrics are utilized: Mean Error (ME), Standard Deviation of Error (SDE), Root Mean Square Error (RMSE), and Correlation Coefficient (CORR). Furthermore, error correlation analysis is conducted to visually characterize the spatial distribution patterns and error features of the three open-source DEMs in relation to ICESat-2/ATL08 observations.

The AW3D30 DEM has the highest accuracy in plains, with the SRTM1 DEM coming in second. In mountainous terrain, SRTM1 DEM was the most accurate, followed by AW3D30 DEM. Although ASTER GDEMV3 fared less well in the two study locations listed above than previous studies in plateau regions, its accuracy in mountainous and plateau areas is comparable to that of SRTM1 DEM and AW3D30 DEM.The RMSE for all three DEM datasets is roughly 15 m in wooded mountainous regions, around 5 m in artificial surfaces and barren areas, and exhibits the greatest inaccuracy in forested and grassland regions on plains, with the least error occurring in wetlands.

The advantage of reflectance measurements in radiometric adjustment of aerial imagery

Susanne Scholz, Marc Muick

Vexcel Imaging GmbH, Austria

The radiometric adjustment of aerial imagery is a process of very high importance considering the influences this step can generate not only on the look of the image data (white balance), but even more importantly on derived information like indices (NDVI). In comparison to the Aerial Triangulation where it is relatively straight forward to set up thresholds that need to be met to achieve a high-quality result, the world of radiometric adjustment is dramatically different. There is no single standard or guideline that dictates what a high-quality radiometric result will look like. Apart from these challenges there is also a rather big gap between the rich and longstanding academic work done in the field of radiometry and actual application in real-life projects. The by far biggest discrepancy is the usage of single images, especially when dealing with absolute radiometry approaches versus multiple thousand color balanced images in a single block in an actual production environment. In this paper we present the advantages of utilizing reflectance measurements as a method to stabilize radiometric adjustments, as well as utilizing them as anchor to create indices like the NDVI that actually correspond to the value range given by literature.

Techniques and methods of product quality inspection Urban Spatial Monitoring

Wenjuan Mao, Haitao Zhao, Bingquan YAO, Jin Zhou, Hongjing Tu

National Quality Inspection and Testing Center for Surveying and Mapping Products, China, People's Republic of

Urban Spatial Monitoring (USM) data is a new form of fundamental geographic information product. As an important component of the natural resources survey and monitoring system, USM provides strong support for the construction of the China Spatial Planning Observation Network (CSPON). The quality of USM data results is related to the accuracy of statistical analysis outcomes, the scientific decision-making for national economy, people's livelihood and social development, as well as the reliability of natural resources management applications. Based on the analysis of the characteristics of USM results, this paper proposes an inspection process including overall general inspection, detailed inspection, and out-of-sample general inspection; analyzes inspection parameters and items, and determines the inspection methods for different items; further identifies quality elements that can be automatically inspected by programs, and realizes batch automatic inspection of some items by establishing a rule base; then, studies the implementation methods of other quality inspection items, clarifies common problems, and improves the human-computer interactive inspection for quality items. Finally, the quality inspection results of urban space monitoring data products covering 170 prefecture-level city survey areas spanning approximately 6 million square kilometers demonstrate that the technical route proposed in this paper is feasible and the quality evaluation results are objective.

Assessing the Impact of Sun Glint on Seagrass and Benthic Habitat Classification Accuracy across various Algorithms using PlanetScope Imagery

Margaux Angelica Cruz

University of the Philippines Diliman, Philippines

Seagrasses are ecologically-important yet highly threatened blue carbon ecosystems that play a critical role in environmental protection, biodiversity conservation, and carbon sequestration. However, their spatial heterogeneity and dynamic temporal behavior pose challenges to accurate mapping and long-term monitoring. The availability of publicly accessible satellite images with high spatial and temporal resolution, and advances in machine learning, have gradually expanded seagrass geospatial research and led to more accurate and robust image classifications. This study evaluated the performance of traditional and machine learning methods for seagrass and benthic habitat mapping using clear and sun-glinted 3-meter resolution PlanetScope imagery. Classification accuracy metrics were compared across multiple algorithms and varying image quality, using two different reference datasets. Results indicate that the Maximum Likelihood Classification and Support Vector Machine Classification achieved the highest overall accuracy and kappa statistics for the clearest image used, the 8-band PlanetScope image acquired on February 16. As expected, the application of the sun glint correction procedure improved classification accuracies for lower-quality images, particularly for the Random Forest Classification which showed consistent and pronounced gains after deglinting. These findings demonstrate the potential of PlanetScope images for seagrass and benthic mapping, keeping in mind that careful image selection remains essential due to the imagery’s inherent sensitivity to sun glint and other radiometric inconsistencies affecting classification performance. In the absence of optimal or clear images, scenes with lower image quality may still be effectively utilized with the application of radiometric correction procedures such as sun glint removal.

Current Status of the National Ecological Observatory Network's Airborne Observation Platform

Tristan Goulden, Bridget Hass, Shashi Konduri, John Musinsky

Battelle - NEON, United States of America

The National Ecological Observatory Network (NEON) operates the Airborne Observation Platform (AOP) which collects airborne lidar, imaging spectroscopy and RGB camera information to support the characterization and forecasting of environmental and environmental processes. NEON operates at a continental scale, including observations at sites across the continental Unites States, Alaska, Hawaii and Puerto Rico, and will operate for 30 years. The AOP has been collecting data at sites with NEON for over 10 years, representing a highly valuable resource for conducting ecological change analysis. NEON AOP data highly standardized, and undergoes rigorous quality control and quality assurance processes. Data collected by the AOP is processed into a series of data products that are made freely available for educational and scientific endeavors. This presentation details the current status and future plans for NEON's AOP.

Mapping Peatland Sub-classes and Swamps across Canada using Multi-sensor remote Sensing and hierarchal Classification

Nicholas Pontone¹, Leila Rashid¹, Vincent Ribberink¹, Murray Richardson¹, Dan K. Thompson², Nicolas Pelletier³, Koreen Millard¹

¹Department of Geography and Environmental Studies, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario, K1S5B6, Canada; ²Canadian Forest Service, Great Lakes Forestry Centre, 1219 Queen Street East, Sault Ste. Marie, Ontario, P6A 2E5, Canada; ³Environment and Climate Change Canada, 335 River Road, Ottawa, Ontario K1V 1C7, Canada

Peatlands are a major component of Canada’s boreal region and play critical roles in carbon storage, biodiversity, hydrology, and climate regulation. Different peatland types respond differently to climate-driven changes in temperature, precipitation, and wildfire risk. Accurate maps of these sub-classes are essential for conservation planning, carbon accounting, and wildfire management. Although national-scale wetland maps for Canada have advanced in recent years, many lack detailed peatland sub-class information and often omit swamps.

This research builds on recent efforts by expanding the spatial extent of peatland sub-class mapping across Canada (excluding the Northern Arctic and Arctic Cordillera) and explicitly incorporating swamps as a separate class. A three-stage hierarchical framework was developed using a combination of optical, radar, and terrain-derived variables. Predictor datasets included Landsat spectral mosaics, NDVI harmonic coefficients, canopy height and closure, ALOS-2/PALSAR-2 L-band backscatter, seasonal Sentinel-1 coherence, and hydrological and geomorphometric derivatives from FABDEM. Reference data were compiled from multiple validated wetland inventories.

A Random Forest classifier was trained and validated at each stage: (1) wetlands vs. uplands and water, (2) peatlands vs. mineral wetlands, and (3) peatland sub-classes and swamps. Accuracy exceeded prior national efforts, with 87% accuracy at Stage 1, 94% at Stage 2, and 72% at Stage 3. Shapley Additive Explanations showed that the SAGA Wetness Index was consistently among the most important predictors, highlighting the central role of topography and moisture distribution. These results demonstrate the value of integrating multi-sensor remote sensing with terrain metrics to improve national-scale wetland classification.

Integrating spectral Indices with terrestrial Laser Scanner for Biomass Estimation in Hong Kong Mangroves

Muhammad Uzair Mahmood¹, Majid Nazeer¹, Man Sing Wong^1,2,3,4

¹Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University PolyU, Hong Kong S.A.R. (China); ²Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ³Research Institute for Land and Space, The Hong Kong Polytechnic University, Hong Kong SAR, China; ⁴Otto Poon Research Institute for Climate-Resilient Infrastructure, The Hong Kong Polytechnic University, Hong Kong, China

Mangrove forests along Hong Kong’s densely populated subtropical coastline fulfil significant blue carbon storage, shoreline protection and habitat functions but are vulnerable to hydroclimatic and anthropogenic pressures. This study integrates long-term satellite spectral analysis with field-based laser scanning to examine mangrove canopy dynamics and above-ground biomass. Seasonal composites of Landsat-8 imagery (2013 to 2025) were processed in Google Earth Engine, where NDVI, EVI, ATVI and GEMI were calculated. GreenValley DGC-50 SLAM-based backpack laser scanning system to collect plot-scale structural data. We registered, denoised, normalized and segment point clouds to retrieve tree height, crown diameter and diameter at breast height for being further used in species specific allometric equations to estimate biomass. The spectral time series indicates a persistent greening pattern with recurrent seasonal cycles and stronger canopy development after 2018. Comparison with field observations showed that laser-derived tree height was more consistent than DBH and crown diameter, indicating variable parameter accuracy in dense mangrove stands. The LiDAR survey provides valuable detailed structural information and supports biomass estimation for inaccessible areas. The LiDAR survey provides detailed structural information and supports the estimation of biomass, where traditional measurements are hard to obtain. The field survey does not validate long-term spectral trends but rather serves a contemporaneous structural reference frame for interpreting seasonal and interannual spectral variability. The combined framework enables enhanced estimation of mangrove biomass, blue carbon stock monitoring and coastal ecosystem management in Hong Kong.

5:30pm - 7:30pm

SEPT-TIF: ISPRS STudent and Early Professional (STEP) - TIF Evening Reception (by invitation only)

Awards Ceremony:

TIF Travel Grants
Professional Travel Award
ESA Awards

Date: Friday, 10-July-2026

8:30am - 10:00am

WG II/2E: Point Cloud Generation and Processing
Location: 713A

8:30am - 8:45am

Appearance-aware Scaling Diffusion Model for 3D Point Cloud Upsampling

Sunghwan Yoo, Gunho Sohn

York University, Canada

This paper introduces the Appearance-guided Scaling Diffusion Model (AGDM), a novel diffusion-based framework designed to densify sparse airborne laser scanning (ALS) point clouds while preserving fine geometric detail. Traditional diffusion models for 3D upsampling, such as LiDiff and PUDM, operate solely on intrinsic 3D information and struggle to reconstruct sharp edges and continuous surfaces when input data are extremely sparse. AGDM addresses these limitations by integrating two complementary conditional priors: multi-view appearance cues and geometry-aware 3D features.

Sparse point clouds are first rendered into ten synthetic viewpoints, and a Vision Transformer extracts high-level visual embeddings that encode surface appearance and boundary structures. In parallel, a Minkowski-based encoder processes the input geometry to capture spatial continuity and local shape characteristics. A cross-attention fusion module aligns and combines these modalities, producing a unified conditioning signal that guides a scaling diffusion network during iterative denoising.

AGDM is trained and evaluated on the YUTO dataset, where dense ground-truth scenes are reconstructed from multi-mission ALS data. Experiments demonstrate that AGDM achieves superior performance across Chamfer Distance, Jensen–Shannon Divergence, F1 score, and multi-scale IoU metrics. Qualitative results further show that the model produces more uniform, edge-preserving, and structurally coherent point clouds than existing diffusion approaches.

By leveraging appearance guidance alongside geometric priors, AGDM significantly improves the fidelity and practicality of LiDAR point-cloud upsampling, offering an effective pathway for scalable and cost-efficient 3D digital-twin generation.

8:45am - 9:00am

Scan Outlier Ratio (ScOR): LiDAR Scanning and Survey-Aware Filtering of Detached Points in Terrestrial and Permanent Laser Scanning Point Clouds

Ronald Tabernig^1,2, Bernhard Höfle^1,2

¹3DGeo Research Group, Institute of Geography, Heidelberg University, Heidelberg, Germany; ²Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Heidelberg, Germany

Accurate 3D surface reconstruction and change analysis relies on point clouds representing persistent solid surfaces and should neglect very small (< laser footprint size) and temporary objects that create outliers. Terrestrial and Permanent Laser Scanning (TLS/PLS) data often contains transient or detached points, which violate assumptions of common cloud-, mesh-, and surface-based 3D change analysis methods. Those points cause wrong correspondences and change values in multi-temporal point cloud comparison. We address this with the Scan Outlier Ratio (ScOR) filter, a LiDAR scanning and survey-aware descriptor designed to identify points unsuitable for most point cloud-based change analysis methods. ScOR compares the measured point spacing with the expected spacing, assuming the surface is locally planar and orthogonal to the incoming laser beam. ScOR works with a single scan or multiple scans acquired from the same position, enabling multi-temporal neighborhoods for filtering. Using data from natural and urban environments, we analyze ScOR across different surfaces, neighborhood sizes, temporal neighborhoods, and compare it with the Statistical Outlier Removal (SOR) algorithm. Results show that ScOR successfully removes non-surface points, while preserving surface information. In our experiments, the true positive rate exceeds 95% in all but one case, while the false positive remains below 10% throughout. With neighborhoods from subsequent and aggregated epochs, the method automatically detects and removes large temporary objects (e.g., a person). Due to its interpretability, efficiency, and range-aware design, ScOR provides an effective pre-processing method for automated and near real-time 3D surface change analysis with TLS/PLS.

9:00am - 9:15am

LiDAR-Enhanced 3D Gaussian Splatting SLAM for Planetary Rover Exploration

Lingxiao Zhang¹, Rong Huang^1,2, Yusheng Xu^1,2, Zhen Ye^1,2, Xiong Xu^1,2, Changjiang Xiao^1,2, Xiaohua Tong^1,2

¹College of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, China; ²Shanghai Key Laboratory for Planetary Mapping and Remote Sensing for Deep Space Exploration, Shanghai 200092, China

Autonomous positioning and scene reconstruction are crucial to the exploration and scientific research tasks of planetary rovers. 3D Gaussian splatting (3DGS) provides a new paradigm for dense reconstruction. However, the reconstruction method that relies only on monocular images will cause scale blur and insufficient geometric consistency. These problems are more prominent in planetary scenes that lack geometric constraints and weak textures. In order to overcome these limitations, we proposed a lidar-enhanced 3DGS-SLAM pipeline. By introducing sparse lidar measurements as prior information to improve depth prediction and ensuring consistent Gaussian initialization on the physical scale. Optimize the camera poses and Gaussian parameters through differentiable rendering to achieve robust localization and photometric-geometric consistency. Experiments on the Erfoud, a planetary similarity dataset, show that our method is superior to the advanced 3DGS-based SLAM system. The ATE has reduced by more than 50%. The PSNR, SSIM, and LPIPS have all improved significantly.

9:15am - 9:30am

Sensor Domain Adaptation for 3D Object Detection via LiDAR Super-Resolution

June Moh Goo, Zichao Zeng, Jan Boehm

University College London, United Kingdom

LiDAR-based perception models’ performance can degrade sharply when applied to data from sensors different to those they were trained on. LiDAR super-resolution aims to enhance sparse point clouds from low-cost sensors. This can help to bridge the sensor domain gap to higher resolution LiDAR. Prior work has primarily focused on reconstruction quality metrics for super-resolution with limited evaluation of downstream perception tasks. We address this gap by conducting a systematic analysis of how super-resolution quality impacts 3D object detection performance. We evaluate detection capability through zero-shot transfer experiments on the KITTI object dataset. Four representative detectors (SECOND, PointPillars, PV-RCNN, PointRCNN) trained on high-resolution data are directly applied to super-resolved low-resolution data without fine-tuning. Results reveal a critical insight: reconstruction improvements yield vastly different detection gains across architectures. PointPillars shows minimal improvement until reaching high reconstruction quality, then performance improves significantly. In contrast, PV-RCNN exhibits steady gains throughout. The highest-quality reconstruction closes up to 86% of the performance gap and enables detection in safety-critical scenarios, including distant vehicles and small pedestrians, where lower-quality methods fail entirely. This work establishes that LiDAR super-resolution effectiveness depends on both reconstruction quality and detector architecture.

9:30am - 9:45am

Ray Queries On Raw Point Clouds

Balthasar Teuscher, Paul Walther, Kwasi Nyarko Poku-Agyemang, Martin Werner

Technical University of Munich, Germany; TUM School of Engineering and Design, Department of Aerospace and Geodesy, Professorship of Big Geospatial Data Management

Retrieving information from point clouds for analysis and visualization has gained ever-increasing interest. A growing niche in this regard is ray queries, commonly used for image synthesis. Ray tracing is widely used in computer graphics, with a multitude of solutions based on bounding volume hierarchies. However, these solutions are rarely straightforward to integrate with raw point cloud data and geospatial analytical workflows. To overcome this, we present a novel approach to ray tracing in raw point clouds that builds upon and extends existing geospatial indices. The solution is exemplified by a fast octree implementation that supports versatile query semantics, such as neighborhood queries with constraints on k and radius for both points and rays, while offering configurable data organization schemes, including layered, fixed, and adaptive depth. The evaluation demonstrates satisfactory speed and capabilities for many scientific use cases, while simultaneously exhibiting low implementation costs, high flexibility, and simplicity in integrating ray tracing into analytical point cloud workflows.

9:45am - 10:00am

Analysis of free large Area covering Elevation Models and improvement by ICESat-2

Karsten Jacobsen

Leibniz University Hannover, Germany

Accuracy analysis of free elevation models TDX-EDEM, AW3D30, SRTM and ASTER GDEM-3. Determination of systematic elevation model errors by Z-shift, model tilt and systematic errors as function of X and Y. Comparison with ICESat-2 data, determination of the systematic elevation model errors by ICESat-2 ATL08 data and correcting the free elevation models. Accuracy analysis of the corrected elevation models by airborne LiDAR data.

The corrections based on the ICESat-2 data significantly improved the free elevation models.

8:30am - 10:00am

WG III/1L: Remote Sensing Data Processing and Understanding
Location: 713B

8:30am - 8:45am

Enhancing digital soil texture mapping accuracy using high-resolution remote sensing data and a hierarchical modelling approach

Mahsa Mozaffari¹, Jean-Daniel Sylvain^2,4, Osvaldo Valeria¹, Mickael Germain³, Maxence Martin¹

¹Université du Québec en Abitibi-Témiscamingue, Canada; ²Ministère des Ressources naturelles et des Forêts (MRNF); ³Université de Sherbrooke, Sherbrooke, QC, Canada; ⁴École de technologie supérieure, Université du Québec, Montréal, QC, Canada

Accurate and spatially detailed soil information is essential for sustainable land management, agriculture, and environmental monitoring, yet existing soil maps often lack the resolution required to represent fine-scale soil texture patterns. This study investigates a hierarchical modelling framework that integrates high-resolution remote sensing data, including Sentinel-2 imagery and LiDAR-derived terrain attributes, with soil texture predictions from the provincial SIIGSOL dataset. The approach is evaluated across three contrasting regions in Quebec, eastern Canada, selected for their diverse landscape conditions and soil variability. Two modelling strategies were compared: a model based solely on Sentinel-2 and LiDAR predictors, and a hierarchical model that incorporates SIIGSOL covariates to examine their added value. The findings show that integrating multi-source information improves the representation of soil texture patterns and enhances model stability. This work highlights the potential of hierarchical, multi-scale approaches for producing more accurate digital soil maps. Future efforts will extend this modelling framework across the broader landscape to support high-resolution soil mapping for land management applications.

8:45am - 9:00am

Operational Crop Type Mapping Using Sentinel-1/2 Data with Intermodal and Temporal Mamba Fusion for the Case Study of Brandenburg, Germany

Minghui Chang^1,2, Shuaifeng Peng², Shihua Li², Ayshah Xuan Chan^3,4, Yi Yuan^1,2, Marco Körner^3,4, Katharina Anders²

¹University of Electronic Science and Technology of China; ²TUM School of Engineering and Design, Technical University of Munich, Germany; ³Remote Sensing Technology, TUM School of Engineering and Design, Technical University of Munich, Germany; ⁴Munich Data Science Institute (MDSI), Technical University of Munich (TUM)

Crop type mapping is essential for agricultural monitoring, food security assessment, and regional management, yet large-scale operational mapping remains challenging. Reliance on a single modality and the absence of explicit spatio-temporal constraints limit existing methods from fully capturing diverse crop-rotation patterns and phenological trajectories over the growing season. To address this limitation, we propose a multi-source, multi-temporal crop mapping framework. Multi-epoch Sentinel-2 and Sentinel-1 observations are preprocessed in Google Earth Engine to produce co-registered optical and SAR time series, including spectral and vegetation indices as well as radar backscatter descriptors. The proposed model couples cross-sensor interaction with seasonal dynamics: an intermodal Mamba fusion mechanism exploits the complementarity between optical vegetation signals and SAR structural information to strengthen parcel boundaries and reduce sensor-specific artefacts, while a temporal Mamba module explicitly models crop development over time, capturing phenological evolution and differences in the diagnostic value of individual observation dates. Decoding the spatiotemporal representation yields the final crop type map. We evaluate our framework for the Federal State of Brandenburg in Germany, where results demonstrate field-aligned, spatially coherent predictions and robust suppression of speckle- and cloud-induced artifacts, validating joint multi-sensor, multi-temporal modeling for operational crop mapping.

9:00am - 9:15am

Assessing the impact of spatial resolution on morphological spatial pattern analysis of urban green infrastructure connectivity: a case study of Miami-Dade County, USA

Oumayma Moufid¹, Sarbeswar Praharaj²

¹Hassania School of Public Works, Casablanca, Morocco; ²Department of Geography and Sustainable Development and School of Architecture, University of Miami, FL, USA

Urban green infrastructure plays a crucial role in supporting ecological connectivity, enhancing climate resilience, and promoting human well-being. As cities densify, maintaining functional green networks increasingly depends on understanding the structural continuity of vegetation within complex urban fabrics. Morphological Spatial Pattern Analysis (MSPA) provides a practical framework for quantifying green infrastructure structure; however, its sensitivity to spatial resolution remains insufficiently examined—particularly at metropolitan scales, where high-resolution data are becoming increasingly available. This study examines the impact of spatial resolution on MSPA outputs for mapping and interpreting urban green connectivity in Miami-Dade County, USA. Two scenarios were compared using 10-m canopy data and 2-m high-resolution canopy data processed across 23 tiles. The workflow integrated vegetation preprocessing, MSPA classification, and quantitative and visual comparisons of structural classes to assess scale effects. Results demonstrate that fine-resolution MSPA (2 m) preserves continuous canopy structures and narrow vegetated corridors that the 10-m analysis tends to fragment or omit. High-resolution outputs provide a more realistic representation of neighborhood-scale connectivity, especially in tree-dense areas such as Coral Gables, while also revealing the computational demands of metropolitan-scale MSPA processing. The findings confirm that MSPA results are inherently scale-dependent and that the choice of resolution critically shapes the interpretation of connectivity. This research provides an operational foundation for incorporating high-resolution morphological analyses into urban resilience planning, nature-based solutions, and socio-ecological equity assessments.

9:15am - 9:30am

Pseudo-labeling strategy and U-Net for high-resolution LULC mapping using CBERS-04A imagery in the Servidão river basin, Brazil

Danilo Marques de Magalhães¹, Murilo Marinho Pardo Lucas¹, Edgar Auler Galvão de França²

¹Dept. of Geography and Environmental Planning, São Paulo State University, Rio Claro, Brazil; ²Institute of Computing, University of Campinas, Campinas, Brazil

Accurate Land Use and Land Cover (LULC) data are vital for effective land planning and management. This study evaluates the U-Net model for LULC mapping using high-spatial-resolution (2 m) imagery from the WPM sensor on the CBERS 04A satellite. The research focuses on the Servidão River Basin in Rio Claro, Brazil, an urban watershed susceptible to flooding. A pseudo-labeling framework is proposed to reduce reliance on manually annotated training data. Training samples were automatically generated by integrating spectral indices (NDVI, NDWI, SOCI, CI, NISI), Principal Component Analysis, and unsupervised Iso-Cluster classification. Several U-Net configurations were evaluated, with a ResNet-34 backbone with class weighting achieving the highest performance. The model was then retrained using a manually refined reference dataset to enhance the representation of spectrally complex classes. Accuracy assessment resulted in an Overall Accuracy of 0.93, average Precision and Recall of 0.92, and a mean Intersection over Union (IoU) of 0.86. These findings indicate that the proposed pseudo-labeling strategy, combined with a U-Net, offers a robust approach for LULC mapping in complex urban environments using freely available CBERS 04A imagery.

9:30am - 9:45am

First-order branch modelling based on bidirectional searching

Hanwen Qi, Xinlian Liang

Wuhan University, China, People's Republic of

A first-order branch modelling method based on bidirectional searching was proposed, the key steps included skeletonization using local separators, trunk extraction based on path straightness and first-order branch extraction using bidirectional searching. The method was tested on ForestSemantic dataset, and results showed that the extraction precision was 80.29%, and RMSE of the pitch angle estimation was 9.74°, indicating that the method can effectively recover the topological structure of branches.

9:45am - 10:00am

Advancing GRACE/GRACE-FO Hydrology: Deep Learning-based Reconstruction and Downscaling

Joseph Awange, Jielong Wang

The Hong Kong Polytechnic University, Hong Kong S.A.R. (China)

Long-term and high-resolution terrestrial water storage (TWS) monitoring is critical for water-resource management, climate adaptation, and understanding hydroclimatic variability. Satellite gravimetry missions such as GRACE and GRACE-FO provide unprecedented observations of TWS but are limited by coarse spatial resolution, short observational records, and temporal gaps. This study presents an integrated deep-learning framework for reconstructing and downscaling GRACE/GRACE-FO data to produce century-scale, high-resolution TWS datasets. We apply RecNet and an enhanced RecNet (ERecNet) to reconstruct historical TWS anomalies in the Sudd Wetland, Lake Victoria Basin, and Nile River Basin, leveraging climate variables and lake-level observations. To overcome spatial limitations, we develop DownGAN, a novel generative adversarial network with a high-to-high downscaling strategy, producing fine-scale TWS patterns while maintaining mass consistency. The fusion of reconstruction and downscaling enables detailed, long-term monitoring of wetland dynamics, droughts, and hydroclimatic variability. Reconstructed datasets reveal multi-decadal wetting/drying phases and strong links between TWS fluctuations and climate teleconnections such as ENSO and the Indian Ocean Dipole. This framework advances the application of GRACE/GRACE-FO for climate resilience, ecosystem monitoring, and water-resource management in data-scarce regions, demonstrating the potential of deep learning to extend satellite-based hydrological observations both spatially and temporally.

8:30am - 10:00am

ThS2: Remote Sensing of Methane: Technological and Methodological Advances
Location: 714A

8:30am - 8:45am

A Self-Supervised Learning Framework for Methane Emission Detection Using Sentinel-2

Mohammad Marjani¹, Masoud Mahdianpari¹, Eric. W Gill¹, Fariba Mohammadimanesh²

¹Memorial University of Newfoundland, Canada; ²Canada Centre for Remote Sensing, Natural Resources Canada, 580 Booth Street, Ottawa, ON, Canada

Methane (CH4) is a major greenhouse gas; however, large-scale monitoring remains challenging due to the high costs and spatial limitations of ground-based and airborne observations. In contrast, Sentinel-2 shortwave infrared (SWIR)–based plume detection is hindered by its coarse spectral resolution, surface artifacts, and limited real-world annotations. This study proposes a self-supervised learning (SSL) framework based on the Simple Framework for Contrastive Learning of Visual Representations (SimCLR) to learn transferable CH4 plume representations from unlabeled Sentinel-2 data. A real-world dataset of 456 Sentinel-2 image tiles was manually annotated using the multi-band–multi-pass (MBMP) approach and utilized to evaluate six encoder backbones. Across five labeled-data portions ranging from 20% to 100%, SimCLR pretraining improved plume segmentation compared to ImageNet-only initialization. In the full-data scenario, MobileNet achieved an F1-score of 0.90 with an Intersection over Union (IoU) of 0.80, while Shifted Window Transformer (SwinT) reached an F1-score of 0.85 with an IoU of 0.75. The benefit of self-supervised pretraining was most evident with limited labeled data, where ImageNet-only models degraded substantially, while SimCLR-pretrained encoders achieved higher accuracy. Moreover, the Integrated Mass Enhancement (IME) method was employed for quantifying the emission flux rate. MobileNet provided the strongest agreement with reference emission estimates, achieving an RMSE of 1690 kg/h. Finally, the results demonstrate that SimCLR-based SSL substantially enhances CH4 plume detection from Sentinel-2 imagery and supports more reliable emission quantification for large-scale CH4 monitoring.

8:45am - 9:00am

Satellite-based detection of methane emissions from permafrost peatland warming

Ray Nassar¹, Joseph Mendonca¹, Sabour Baray¹, Peter Morse², Qindii Shafi³, Ama Gamage³, Oliver Schneising⁴, Michael Buchwitz⁴

¹Environment and Climate Change Canada, Science and Technology Branch, Toronto, Canada; ²Natural Resources Canada, Geological Survey of Canada, Ottawa, Canada; ³University of Waterloo, Waterloo, Canada; ⁴University of Bremen, Institute of Environmental Physics, Bremen, Germany

Column-averaged methane (XCH4) data spanning 2018-2023 from the European Space Agency (ESA) Tropospheric Monitoring Instrument (TROPOMI) on the Copernicus Sentinel-5 Precursor satellite are assessed for evidence of methane (CH4) emissions from permafrost. We generated bi-monthly anomaly maps of XCH4 from TROPOMI and soil temperature (Tsoil) from reanalysis data for all land north of 50°N. Considering the XCH4 anomalies in the contexts of soil carbon content and wind variability led to a focus on Canada’s Hudson Bay Lowlands (HBL), Earth’s second largest peatland complex (~325,000 km2), which is underlain by continuous to isolated permafrost. This sub-Arctic region is vulnerable to rapid climatic warming and exhibits wind conditions favorable for emission detection from space. HBL XCH4 anomalies strongly correlate with soil temperature anomalies (R = 0.626 to 0.866), consistent with wetlands as the primary CH4 emission source; however, the strong increase in CH4 emissions over 2018-2023 may also suggest a contribution from permafrost thaw and expansion of thermokarst fens.

9:00am - 9:15am

Satellite-based Assessment of Wetland Methane Emissions in Urban Regions: a Comparative Analysis with Anthropogenic Sources Across North American Cities

Mohammadali Hemati¹, Masoud Mahdianpari^1,2, Hodjat Shiri³, Fariba Mohammadimanesh⁴

This study leverages TROPOMI satellite observations and atmospheric inversion modelling to quantify methane emissions from urban wetlands across six major North American cities, including Toronto, Montreal, New York, Los Angeles, Houston, and Mexico City. By coupling high-resolution column-averaged methane measurements with the GEOS-Chem chemical transport model via the Integrated Methane Inversion (IMI) platform, the research distinguishes emissions from both natural wetland and anthropogenic urban sectors. Results indicate that prior inventories substantially underestimate urban wetland methane emissions in most cities. Posterior wetland emissions are resolved alongside dominant anthropogenic sources such as landfills, energy systems, and wastewater, revealing spatially distinct patterns and highlighting seasonal wetland flux variability. The findings demonstrate that urban wetlands, although representing a relatively smaller source compared to anthropogenic emissions, display considerable underrepresented contributions to local methane budgets, underscoring the need for robust, integrated monitoring in urban environments. This methodology provides a scalable framework for routine urban wetland methane flux quantification and supports evidence-based climate mitigation and land management strategies.

9:15am - 9:30am

Methane Plume Detection in Sentinel-2 Imagery using a Transformer-based Model and a Comprehensive Benchmark Dataset

Ali Radman¹, Masoud Mahdianpari^1,2, Fariba Mohammadimanesh³

¹Memorial University of Newfoundland, St. John's, Newfoundland, Canada; ²2 C-CORE, St. John’s, Newfoundland, Canada; ³Canada Centre for Remote Sensing, Natural Resources Canada, 580 Booth Street, Ottawa, Ontario K1A 0E4, Canada

Methane plume detection from medium-resolution multispectral satellites such as Sentinel-2 remains challenging due to weak methane signals and strong background variability across land cover, illumination conditions, and atmospheric states. To advance automated detection capabilities, we develop a large-scale benchmark dataset that combines simulated methane plume enhancements with real Sentinel-2 imagery, covering a wide range of emission magnitudes and diverse environmental scenarios. The dataset includes over 64,000 samples and incorporates methane-sensitive inputs derived from the MBMP retrieval workflow, providing a comprehensive foundation for robust model training and evaluation. Building on this dataset, a hybrid transformer–U-Net architecture is proposed, integrating global self-attention with Grouped Attention Gates to enhance feature fusion and improve segmentation of methane structures. The model achieves high accuracy on the benchmark dataset and demonstrates strong generalization to real emission events in complex environments. The combined contributions of the benchmark dataset and hybrid model offer a promising path toward reliable, scalable methane plume monitoring using widely available multispectral satellite observations.

9:30am - 9:45am

Cross Sensor Fusion of Hyperspectral-derived and Sentinel-5P Data for Greenhouse Gas and Air Pollution Mapping

Vasil Yordanov, Sadra Zahed Kachaee, Maria Antonia Brovelli

Politecnico di Milano, Department of Civil and Environmental Engineering (DICA), Italy

Methane (CH₄) is a potent short-lived climate pollutant, making the detection of major point sources (“super-emitters”) crucial for mitigation. The Sentinel-5 Precursor (S5P) mission, with the TROPOMI instrument, captures global methane concentrations at ~7 × 5.5 km resolution with near-daily coverage. While this resolution is too coarse to identify emissions from individual facilities, its revisit frequency allows effective regional monitoring. Conversely, high-resolution (HR) imaging spectrometers like Carbon Mapper’s Tanager (~30 m) and NASA’s EMIT (~60 m) provide detailed plume mapping but have limited spatial and temporal coverage.

Carbon Mapper releases open-access, high-resolution plume products including georeferenced rasters and metadata. In this study, these HR detections serve as reference events to assess their visibility in coarser Sentinel-5P observations. The workflow includes curating HR events, summarizing their emission context, and inspecting nearby Sentinel-5P data for consistent methane enhancements. The method is exploratory and avoids presupposing Sentinel-5P’s success or failure in detecting plumes at this scale.

This analysis bridges the gap between frequent global monitoring and targeted HR observations. It establishes a path for future cross-sensor integration, combining HR spatial precision with Sentinel-5P’s temporal continuity. With additional labeled data, this approach could inform machine-learning tools for methane anomaly detection and plume segmentation, improving operational methane monitoring across scales.

8:30am - 10:00am

WG III/8E: Remote Sensing for Agricultural and Natural Ecosystems
Location: 714B

8:30am - 8:45am

Large-scale individual crown tree segmentation across entire white spruce forests using UAV hyperspectral imagery and deep learning

Sylvain Colomer^1,5, Aravind Harikumar¹, Leen Al-Sameai¹, Vincent Riedel¹, Vincent Seigner², Ian Major², Nathalie Isabel², Ingo Ensminger^1,3,4

¹Department of Biology, University of Toronto, Mississauga, ON L5L 1C8 CA; ²Laurentian Forestry Centre, Natural Resources Canada, Canada; ³Graduate Department of Cell and Systems Biology, University of Toronto, Toronto, ON M5S CA; ⁴Graduate Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON M5S CA; ⁵ETIS Laboratory, UMR8051, CY Cergy Paris Université, ENSEA, CNRS, Cergy, France

The development of high-performance, affordable UAVs has transformed vegetation monitoring, enabling observation of forest canopies at an unprecedented level of detail. UAV-derived datasets now provide high-fidelity structural and physiological information at the individual tree level across entire forest stands, offering novel insights into forest dynamics. In the context of increasing tree mortality, such data are becoming essential for understanding forest resilience and adaptation. However, exploiting this data requires effective individual tree crown segmentation algorithms (ITCS) at the forest scale, capable of tackling large-scale data and variability introduced by the environment. In this paper, we developed a new workflow designed to process UAV hyperspectral imagery at the forest scale, enabling automated ITCS and analysis. Our pipeline integrates hyperspectral-to-RGB conversion, ITCS, and centroid-based mask fusion. To assess the performance of our pipeline, we evaluated the model on two replicated white spruce common gardens in Canada, each comprising approximately 6,000 trees of similar age and structure. The experiments rely on a large multi-temporal dataset of hyperspectral imagery acquired during 60 UAV missions between 2022 and 2024, allowing us to evaluate the robustness of the proposed pipeline across a wide range of seasonal and acquisition conditions. Results show that the proposed pipeline achieves a mean segmentation performance of 0.536 mAP (0.885 mAP50) on the annotated dataset. At the forest scale, the system demonstrates strong detection capability with F1-scores of 0.948 at the Pintendre site and 0.863 at the Pickering site, successfully detecting most trees while maintaining stable performance across varying environmental conditions.

8:45am - 9:00am

Evaluating a modified StarDist Implementation for Individual Tree Detection and Crown Delineation in heterogeneous Landscapes

Nora Tilly¹, Andreas Tilly², Georg Bareth¹

¹University of Cologne, Germany; ²Independent Researcher

Individual tree detection and crown delineation (ITDCD) in dehesa landscapes is complicated by geometric distortions from steep terrain, varying tree densities, and the partly multi-crown 'broccoli-like' structure of holm and cork oaks. This study evaluates the usability of a modified StarDist deep learning model, which has recently shown effectiveness for ITDCD in Canadian forests. Moreover, this study develops a workflow transforming the original StarDist, designed for microscopy images, into an ITDCD solution, taking the georeferencing of geospatial data into account. The tile-wise organized ground truth dataset is created with the pretrained Tree Segmentation model available in the ArcGIS Living Atlas, combined with manual revision. Several augmentation methods are applied, resulting in 960 images, which are split into 85 % for training and 15 % for validation. Following the approach of the Canadian forest study, the StarDist implementation is modified by introducing a constraint to the probability loss function. Rather than computing loss across all pixels, the modified loss function considers only pixels explicitly annotated as objects, while background pixels are excluded. An additional dataset of 1,200 trees serves as ground truth for testing the prediction across the entire study area. Using an Intersection over Union of 0.5, this test demonstrates good performance (Accuracy: 87.50 %; F1-score: 0.85). The accuracy varies with tree density: in areas with sparse tree cover, nearly all tree crowns are detected; in moderately dense areas, a number of tree crowns are missed; whereas in very dense tree layers, the frequency of missed detections increases.

9:00am - 9:15am

Treetop-Guided Multi-task Deep Learning Framework for Individual Tree Crown Detection and Delineation from Airborne LiDAR in Mixed-Wood Forests

Qian Li, Baoxin Hu

York University, Canada

Individual tree crowns detection and delineation from airborne LiDAR data is essential for forest inventory, carbon stock estimation, and ecosystem monitoring. In mixed-wood forests, however, this task remains difficult due to high stand density, multi-layered canopy structure, and the wide variation in crown size and shape across coniferous and deciduous species.

This study addresses two core limitations of existing deep learning methods for individual tree crown delineation. Standard instance segmentation models rely on blind anchor-based proposals that frequently miss small understorey trees in dense canopies, and their pixel-based mask representations struggle to accurately capture crown boundaries for small or irregular crowns.

We propose a multi-task learning framework that jointly trains a structure-aware treetop detection head and a crown segmentation head on a shared backbone network. The treetop detection head generates spatially precise crown seeds guided by canopy height and allometric relationships, replacing blind anchor proposals with data-driven initialisation. Two segmentation strategies are evaluated within this framework: a Mask R-CNN pixel-based approach and a StarDist contour-based approach.

Experiments are conducted on a high-density airborne LiDAR dataset acquired over a mixed-wood forest in Ontario, Canada, comprising 4,417 manually delineated reference crowns. Results demonstrate improved detection completeness for small crowns and more accurate boundary delineation for overlapping larger crowns compared to single-task baselines.

9:15am - 9:30am

Tree species identification in Ontario mixed forests using multi-temporal hyperspectral and LiDAR data with UAV

Zhaoshu Shi¹, Alexander McLaren², Ben DeVries³

¹University of Guelph, Canada; ²University of Guelph, Canada; ³University of Guelph, Canada

This study examines the use of multi-temporal UAV hyperspectral and LiDAR data to identify tree species in a mixed deciduous forest in southern Ontario, Canada. Weekly UAV flights were conducted from summer through spring to capture structural and spectral changes associated with leaf development, senescence, and leaf drop. Field measurements were collected to provide species labels and biometric information for individual trees.

LiDAR data are processed to delineate individual tree crowns and to derive structural metrics such as crown height, width, density, and vertical canopy profile. Hyperspectral imagery, consisting of more than 300 bands, is co-registered with the LiDAR-derived crowns to extract spectral signatures and compute vegetation indices. These data support the development of a spectral library for the main species in the study area.

The multi-temporal dataset allows evaluation of how phenological changes influence separability among species. Early leaf loss in autumn and differences in budburst timing in spring are expected to produce temporary structural and spectral contrasts that aid classification. Machine learning models, including random forest and neural networks, are applied to assess the contribution of structural, spectral, and seasonal features to species discrimination.

9:30am - 9:45am

UAV-Based 3D gaussian splatting for reconstruction and individual segmentation of field-grown soybean seedlings

Hanbing Jiao¹, Junhua Kang¹, Heli Li², Guijun Yang^1,2

¹College of Geological Engineering and Geomatics, Chang'an University, China; ²Key Laboratory of Quantitative Remote Sensing in Agriculture of Ministry of Agriculture and Rural Affairs, Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, China

Accurate 3D reconstruction and instance segmentation of soybean seedlings are crucial for early phenotyping and precision agriculture. This study presents a UAV-based sparse-view 3D reconstruction and plant-level segmentation framework that integrates 3D Gaussian Splatting (3DGS) with Mobile-SAM, enabling efficient and high-fidelity modeling under routine field conditions. Traditional LiDAR and MVS approaches, while detailed, are constrained by cost, acquisition density, and computational complexity. By contrast, 3DGS offers explicit Gaussian primitives for fast rendering and direct geometric access but often fails under sparse-view UAV imagery due to weak multi-view constraints and repetitive canopy structures.

To overcome these limitations, the proposed method introduces a mask–geometry co-optimization mechanism: YOLO-generated bounding-box prompts guide Mobile-SAM to produce accurate single-view plant masks, which serve as semantic priors to associate 2D observations with 3D Gaussian primitives. Iterative refinement aligns rendered and observed masks, ensuring spatial consistency and coherent 3D plant boundaries.

Field experiments on a soybean plot demonstrated the method’s effectiveness, achieving high reconstruction quality and visually precise seedling segmentation. The resulting 3D models capture fine structural details and distinct plant instances even under sparse-view UAV data.

This work highlights the potential of combining explicit geometric modeling and lightweight semantic segmentation to achieve robust, scalable, and field-deployable 3D crop reconstruction, offering a promising pathway for high-throughput plant phenotyping and yield estimation in real-world agricultural applications.

9:45am - 10:00am

Upscaling vegetation cover from UAV to satellite imagery

Elena Belcore¹, Marco Piras¹, Alvaro Moreno Martínez², Ana B. Ruescas²

¹DIATI, Department of Environment, Land and Infrastructure Engineering, Politecnico di Torino, Corso Duca degli Abruzzi, 24, 10129 Torino, Italy; ²Image Processing Laboratory (IPL), Universitat de València, Paterna, Spain

In this study, we propose an upscaling approach based on 8-band PlanetScope SuperDove imagery (Coastal Blue, Blue, Green I, Green, Yellow, Red, Red Edge, NIR) combined with UAV data. We employed an evidential Dirichlet neural network to estimate the fractional cover of 13 herbaceous and shrub species typical of Mediterranean coastal dunes, previously mapped at 3 cm using a traditional Random Forest classifier trained on UAV multispectral samples. The overall goal is to enable large-scale mapping of coastal vegetation using high-resolution satellite imagery.

8:30am - 10:00am

ICWG III/IVa-E: Disaster Management
Location: 715A

8:30am - 8:45am

A Remote Sensing Approach to Identifying Drought Onset and Progression in Central India

Hempushpa Sahu, Pradeep Kumar Garg, Saurabh Vijay

Indian Institute of Technology Roorkee

Climate change is intensifying droughts, creating an urgent need to understand these events and take necessary mitigation actions. This work focuses on the Bundelkhand region of Uttar Pradesh, India, an area that frequently experiences severe water stress and is highly susceptible to drought. We used multi-source remote sensing datasets to monitor drought conditions through established drought indices. The analysis period spans from 2000 to 2021. CHIRPS data were used to calculate SPI and RAI, while PKU GIMMS NDVI data were used to calculate VCI. ERA-5 Land was used for soil moisture data to derive SMCI. To track how drought propagates, we performed a correlation analysis between the indices representing meteorological and agricultural drought. The results show that the datasets and the methods are suitable for identifying droughts in the region. Historical drought episodes were accurately detected, and the analysis of the 2015 drought revealed its onset from June to September, which aligns with the monsoon season in Bundelkhand. The datasets and indices used provide a practical and reliable output for sparse ground-based observations for regional drought monitoring and management.

8:45am - 9:00am

Improved Agro-Climatological Drought Monitoring: The Near-global Combined Drought Monitoring Dataset

Sneha Kulkarni, Yohei Sawada

University of Tokyo, Japan

The Near-global Combined Drought Monitoring (NEC-DROMO) dataset provides a comprehensive depiction of drought conditions by integrating multiple agro-climatological variables across global land areas. Spanning 2002–2021 at a monthly time step and 0.25° spatial resolution, NEC-DROMO combines soil moisture, vegetation water content (VWC), rainfall, and temperature to capture both agricultural and meteorological drought signals. A key feature of the dataset is the use of Principal Component Analysis (PCA) to derive dynamic, month-specific weights for each variable, allowing the Combined Drought Indicator (CDI) to reflect seasonal and regional variability in drought drivers.

The dataset is built primarily on inputs from the ECoHydrological Land Reanalysis (ECHLA), which provides soil moisture, VWC, and temperature derived from passive microwave observations assimilated through a land–vegetation model. Rainfall fields are obtained from the ERA5 reanalysis, ensuring consistency across atmospheric and land-surface conditions. Validation against satellite-based indicators, ground observations, and event-based disaster datasets demonstrates NEC-DROMO’s strong capability to reproduce observed drought patterns globally.

With its multi-variable foundation and long-term coverage, NEC-DROMO serves as a valuable resource for drought monitoring, climate analysis, food-security assessment, and agricultural risk management. It supports detailed historical analyses and offers an integrated perspective for users seeking reliable, spatially consistent drought information.

9:00am - 9:15am

Observed increase in tropical vegetation droughts over the past three decades

Shuai Cheng

Eastern Institute of Technology, Ningbo, China

Tropical terrestrial vegetation is critical to the global carbon cycle but faces escalating drought threats. Traditional assessments using fixed climate thresholds often ignore actual physiological responses and non-moisture disturbances. To address this, we developed a novel framework that isolates the true physiological impacts of atmospheric and soil moisture (SM) deficits to identify growing-season vegetation droughts (1982–2019). Results reveal pantropical increases in drought intensity, with tropical forests experiencing significantly sharper intensifications than other biomes. Regionally, African forests exhibit the most severe expansions in drought intensity and area. Interpretable machine learning attributes this intensifying drought predominantly to declining SM (NDVI: 52.1%; LAI: 53%). Finally, while reliable historical reconstruction is vital for future projections, CMIP6 models fail to reproduce these observed trends. These findings highlight mounting drought pressures on tropical forests and underscore the critical need for improved climate models to inform mitigation strategies.

9:15am - 9:30am

Multi-source data driven forecasting of Extreme Heat Events using an ARIMA–XGBoost hybrid framework

Dandi Liao, Lina Xu, Pengfei Long, Yuxuan Wang, Qiying Dong, Siyu Liu, Xincai Chang

School of Geophysics and Geomatics, China University of Geosciences, Wuhan, China.

Extreme heat events (EHEs) pose growing risks to densely populated subtropical cities such as Hong Kong, yet there remains a need for lightweight, interpretable tools that can provide multi-day forecasts based on readily available observations. This study develops a multi-source data driven framework that integrates aerosol optical depth (AOD), land surface temperature (LST), precipitable water (PW), and precipitation (Precip), together with ARIMA-based anomaly features, to predict EHEs over Hong Kong. Using a seven-day sliding window, independent XGBoost classifiers are trained to forecast daily EHE occurrence probabilities for the next 1–5 days over ten climate years (March 2015–February 2025). A lead-specific threshold optimization on a validation subset is applied to maximize F1-score. Test results show that AUC values for Lead 1–Lead 5 remain between 0.935 and 0.883, with F1-scores between 0.738 and 0.639, indicating robust predictability up to five days in advance. A process-scale duration inference method based on the leading continuous segment of the predicted sequence achieves 67.08% exact-match accuracy, 77.69% accuracy within ±1 day, and a mean absolute error of 0.75 days. The proposed framework is computationally efficient and operationally relevant, offering practical support for urban heat early warning and risk management.

9:30am - 9:45am

Climate Transition Zones As Emerging Hotspots For Natural Hazards: Insights From Land Use- Climate Feedbacks Amplify Disaster Risk In Taiwan

Manikandan Sathianarayanan, Pai-Hui Hsu

National Taiwan University, Chinese Taipei

Anthropogenic climate change and land use transformations are interactively reshaping environmental risks. This study investigates the critical feedback between Land Use/Land Cover (LULC) change and shifts in Köppen-Geiger (KG) climate zones in Taiwan from 2001–2020, and their combined impact on disaster hotspots.

Using MODIS and CHIRPS data alongside a comprehensive disaster inventory, we quantified the spatial co-occurrence of LULC change and climate zone transitions. Our preliminary results reveal a significant climatic shift, with over 10,500 km² transitioning from tropical monsoon (Am) to a drier tropical savanna (Aw) climate, alongside substantial wetland loss and urban expansion. We hypothesize that these dynamic "climate transition zones" are emerging fronts of heightened disaster risk.

Our analysis tests whether areas undergoing active climate reclassification concentrate a disproportionate share of historical landslides and floods. The expected outcome is a novel, dynamic risk assessment framework that moves beyond static models. By identifying these emerging hotspots, this research provides a critical tool for proactive land-use planning and climate-resilient disaster risk reduction, with methodologies applicable to other complex, hazard-prone regions.

9:45am - 10:00am

Performance Evaluation and Limitations Assessment of GeoAI Democratization for Extreme Event Induced Disasters

Andrea Demartis¹, Fabio Giulio Tonolo¹, Andrea Ajmar²

¹Politecnico di Torino, Deaprtment of Architecture and Design (DAD), Viale Mattioli 39, 10125, Torino, Italy; ²Politecnico and Università di Torino, Interuniversity Department of Regional and Urban Studies and Planning (DIST), Viale Mattioli 39, 10125, Torino, Italy

Climate change is amplifying the occurrence and intensity of Extreme Event Induced Disasters (EEID), such as floods and wildfires, which increasingly threaten societies and ecosystems. Fast and accurate monitoring tools are therefore essential for damage assessment and emergency response. Remotely sensed data, particularly from the Copernicus Sentinel-2 mission, provide valuable multispectral information for large-scale environmental monitoring, but their manual analysis remains time-consuming.

Recent advances in Deep Learning (DL) have enhanced classification, segmentation, and change detection of geospatial data. New multimodal Prompt-Based (PB) architectures integrate image and text inputs via Text Encoders (TEs), enabling zero-shot detection of previously unseen objects. These models promise flexible, prompt-driven analysis but often underperform compared to Object-Specific (OS) models optimized for particular tasks.

In Earth Observation (EO), foundation models such as Prithvi-EO and TerraFM mark a major step forward, offering generalized pre-training across vast multi-sensor datasets to support downstream OS tasks with limited data.

While DL traditionally requires coding expertise, commercial GIS platforms now integrate DL tools accessible through Graphical User Interfaces (GUIs), allowing inference and limited fine-tuning of pre-trained models. This democratizes DL access for GIS users but shifts expertise toward model evaluation and interpretability.

This study systematically compares PB and OS models executed through both GUI-based and Python environments using Sentinel-2 flood and wildfire imagery, assessing accuracy, flexibility, and processing efficiency to evaluate the balance between accessibility and performance in the democratization of DL for EEID monitoring.

8:30am - 10:00am

WG II/3G: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

8:30am - 8:45am

ZeD-MAP: Bundle Adjustment Guided Zero-Shot Depth Maps for Real-Time Aerial Imaging

Selim Ahmet Iz¹, Francesco Nex², Norman Kerle², Henry Meissner¹, Ralf Berger¹

¹German Aerospace Center, Germany; ²University of Twente, The Netherlands

Real-time depth reconstruction from ultra-high-resolution UAV imagery is essential for time-critical geospatial tasks such as disaster response, yet remains challenging due to wide-baseline parallax, large image sizes, low-texture or specular surfaces, occlusions, and strict computational constraints. Recent zero-shot diffusion models offer fast per-image dense predictions without task-specific retraining, and require fewer labelled datasets than transformer-based predictors while avoiding the rigid capture geometry requirement of classical multi-view stereo. However, their probabilistic inference prevents reliable metric accuracy and temporal consistency across sequential frames and overlapping tiles. We present ZeD-MAP, a cluster-level framework that converts a test-time diffusion depth model into a metrically consistent, SLAM-like mapping pipeline by integrating incremental cluster-based bundle adjustment (BA). Streamed UAV frames are grouped into overlapping clusters; periodic BA produces metrically consistent poses and sparse 3D tie-points, which are reprojected into selected frames and used as metric guidance for diffusion-based depth estimation. Validation on ground-marker flights captured at approximately 50 m altitude (GSD ≈ 0.85 cm/px, ~2,650 m² ground coverage per frame) with the DLR Modular Aerial Camera System (MACS) shows that our method achieves sub-meter accuracy, with approximately 0.87 m error in the horizontal (XY) plane and 0.12 m in the vertical (Z) direction, while maintaining per-image runtimes between 1.47 and 4.91 seconds. Results are subject to minor noise from manual point-cloud annotation. These findings show that BA-based metric guidance provides consistency comparable to classical photogrammetric methods while significantly accelerating processing, enabling real-time 3D map generation.

8:45am - 9:00am

Bundle-Adjusted Initialization for efficient Earth Observation Gaussian Splatting

Jiyong Kim¹, Shuang Song^1,3, Rongjun Qin^1,2

¹Department of Civil, Environmental and Geodetic Engineering, The Ohio State University, USA; ²Department of Electrical and Computer Engineering, The Ohio State University, USA; ³Translational Data Analytics Institute, The Ohio State University, Columbus, USA

Satellite-based 3D reconstruction has gained prominence with the advancement of Earth Observation techniques. Recent work on Earth Observation Gaussian Splatting (EOGS) demonstrated the potential of adapting 3D Gaussian Splatting to satellite imagery, enabling rapid Digital Surface Model (DSM) generation from multiple images using Rational Polynomial Coefficients (RPCs) as camera models. However, EOGS suffers from critical inefficiencies: it randomly initializes a large number of Gaussians in volumetric space and relies on opacity-based pruning, resulting in unstable memory footprints and premature loss of fine details—particularly problematic for low-resolution satellite data.

This work presents an improved Gaussian Splatting framework for satellite imagery that addresses these limitations through two key contributions. First, we introduce bundle-adjusted initialization, which leverages geometrically precise points from the bundle adjustment process as initialization seeds rather than random placement. This approach ensures Gaussians are anchored to accurate geometric positions from the outset, significantly improving convergence stability. Second, we propose densification-included optimization, which strategically adds Gaussians in regions requiring detailed reconstruction while maintaining computational efficiency. This selective densification preserves fine-scale features without the memory overhead of EOGS's initial over-allocation strategy.

Our method achieves faster processing times and maintains more consistent memory usage while producing higher-quality DSMs, particularly in challenging low-resolution scenarios. By combining geometric priors from bundle adjustment with adaptive densification, we enable more practical and efficient satellite-based 3D reconstruction suitable for large-scale Earth observation applications.

9:00am - 9:15am

Evaluating Classical and Deep Keypoint Detectors For SfM Reconstruction in Arctic UAV Imagery

Nicholas Sansoterra¹, Charles Toth¹, Alper Yilmaz¹, Mendoza Lenzano², John Anderson³, William Shuart³

¹The Ohio State University, United States of America; ²Resp. Lab. Geomatica Andino (LAGEAN); ³USACE ERDC GRL Corbin field Station, USA

This contribution presents a comparative evaluation of classical and deep learning–based keypoint detectors for Structure-from-Motion (SfM) reconstruction in challenging Arctic UAV imagery. Snow-covered environments pose difficulties for standard feature matching due to low texture, repetitive patterns, and specular surfaces. While deep keypoint pipelines have shown strong performance on indoor and urban benchmarks, their effectiveness in winter aerial domains remains largely unexplored.

Using multi-view UAV datasets collected across several Alaskan sites, we benchmark three feature-extraction front-ends within a uniform pycolmap-based SfM pipeline: (i) classical SIFT with nearest-neighbor matching; (ii) SuperPoint, a self-supervised convolutional detector–descriptor; and (iii) DISK, a reinforcement-learning–based feature extractor. A simple hybrid approach combining SuperPoint and DISK matches is also tested. All methods share identical geometric verification and bundle-adjustment settings to ensure consistency.

Results show that SIFT remains highly robust on moderately textured Arctic scenes, registering all images and producing the most complete point clouds. SuperPoint and DISK achieve similar reprojection accuracy but struggle with image registration and keypoint coverage on some sequences. Conversely, on extremely low-texture scenes where SIFT fails almost entirely, both deep methods still enable partial reconstructions. Persistent failure cases for all techniques include dense canopy and homogeneous snow.

The study highlights a domain gap between existing deep keypoint models and Arctic aerial imagery, suggesting that domain-specific training and improved spatial keypoint diversity could substantially enhance deep SfM performance in polar regions.

9:15am - 9:30am

Occlusion-Robust SfM in Construction Sites via Geometry-Guided Foreground Segmentation

Changjiang Yin¹, Shaoming Zhang¹, Qin Ye^1,2, Junqi Luo¹

¹College of Surveying and Geo-Informatics, Tongji University, 200092, Shanghai, China; ²Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, 518000, Shenzhen, China

Accurate 3D reconstruction is a key enabler for construction progress monitoring and digital-twin maintenance. However, in tower-crane imagery, persistent dynamic occluders such as hooks and slings violate the static-scene assumption of conventional Structure-from-Motion (SfM), leading to feature mismatches and degraded reconstruction consistency. In this paper, we present a geometry-guided occlusion-handling pipeline for crane-mounted construction-site SfM. Our approach leverages geometric cues from reprojection errors and depth inconsistencies to identify outlier observations, clusters them into spatially coherent prompts, and uses these to guide a foundation segmentation model (SAM2). The resulting per-frame masks are integrated into mask-constrained SfM optimization, ensuring that only static background contributes to reconstruction. Experiments on three real-world crane-mounted sequences (30m, 45m, and 120m) show consistent reductions in mean reprojection error relative to the unmasked baseline. In the most challenging case, the error decreases from 0.962 to 0.872 pixels (9.4%). Compared with a fixed rectangular masking strategy, the proposed masks yield similar reprojection errors while better preserving valid observations and sparse-point completeness. These results indicate that the proposed framework provides a practical geometry-guided strategy for improving internal reconstruction consistency in crane-mounted construction environments.

9:30am - 9:45am

Geometry-aided Video Panoptic Segmentation

Tuan Nguyen, Max Mehltretter, Franz Rottensteiner

Institute of Photogrammetry and Geoinformation, Leibniz Hannover University, Germany

Video panoptic segmentation (VPS) unifies panoptic segmentation and object tracking by assigning each pixel a semantic class label, or for thing classes, an instance identifier that is consistent across frames. Addressing this task, we propose a novel online VPS method for processing stereoscopic image sequences, which is based on depth-aware kernel-based panoptic segmentation. Specifically, we introduce a geometrical constraint based on predicted bounding boxes into the segmentation of thing instances to overcome the fundamental limitation of kernel-based panoptic segmentation that only appearance information is considered in this step; this regularly leads to panoptic segmentation results in which distinct instances are erroneously merged into one mask. To link detected instances across frames, we propose to extend the commonly employed appearance-based association with a motion-related constraint based on optical flow; this resolves ambiguities in case of instances of similar appearance and, thus, reduces the number of incorrect associations. We experimentally evaluate our method on the publicly available Cityscapes-VPS dataset and compare our results to those of several related methods from the literature. The results demonstrate that our method improves the panoptic quality for a single frame and enhances the instance association across frames, leading to an overall improvement of 3.5% in Video Panoptic Quality on thing classes compared to the employed baseline.

9:45am - 10:00am

Quatifyng altimetric and volumetric changes of the Belvedere glacier (2009–2023) using Pleiades and Pleiades neo data

Francesco Ioli¹, Luca Cerina², Alberto Cina³, Livio Pinto²

¹IRPI - Italian National Research Council, Turin, Italy; ²DICA - Politecnico di Milano, Italy; ³DIATI - Politecnico di Torino, Italy

This study addresses the morphological evolution of the Belvedere Glacier (Monte Rosa, Macugnaga – Italy) over the period 2009–2023, using a photogrammetric methodology based on Pleiades (2017) and Pleiades Neo (2023) Very-High Resolution (VHR) satellite imagery, integrated with historical aerial data from 2009. The main objective was to quantify altimetric and volumetric variations of the glacier, assess the intensity of ice mass loss, and analyze the geomorphological effects of the flood event that occurred on August 27, 2023, which generated a major debris flow.

Raster differencing between Digital Elevation Models (DEMs) revealed a significant lowering of the glacier surface. Between 2009 and 2017, the glacier lost approximately 19.3 × 10⁶ m³ of ice (about 2.4 × 10⁶ m³/year), while in the following period (2017–2023) the loss reached 16.9 × 10⁶ m³, with an increased average annual rate of 2.8 × 10⁶ m³/year. These values confirm an acceleration in the ablation process, consistent with other studies (De Gaetani 2021; Ioli 2023) and with the general retreat trend observed in Alpine glaciers due to climate warming.

8:30am - 10:00am

IvS7B: Innovative Remote Sensing of Wetlands in Canada and Beyond
Location: 716A

8:30am - 8:45am

Automated multi-temporal wetland mapping using Sentinel-2 in the Great Lakes-St Lawrence basin

Kathleen Reid¹, Ben DeVries¹, Genevieve Ali²

¹University of Guelph, Canada; ²McGill University, Canada

Wetland characteristics such as size, inundation permanence and timing, and surface hydrological connectivity substantially impact wetland processes and functions. The ability to monitor these types of wetland characteristics, and changes through time, is dependent on the spatial and temporal resolution of the imagery data used to map wetland locations. Existing inventories of surface water features have largely been limited to permanently open water features such as lakes and ponds larger than 1km2 at monthly or annual intervals. To address these limitations a random forest model was trained to predict sub-pixel water fraction (SWF) in Sentinel-2 imagery at 10m and 20m spatial resolution. This approach facilitated the detection of small surface water features, including water features interspersed with vegetation such as wetlands, at a sub-monthly temporal scale. Overall, in the 10m SWF data, small and narrow water features were detected that were not apparent at the 20m scale, the shape of feature boundaries was more precise, and the continuity of narrow channels was better maintained compared to the 20m SWF data. Improved detection of small features and narrow channels supports improved wetland inventories, particularly regarding the inclusion of small wetlands which are important biogeochemical hotspots, and automated surface water connectivity classification. The temporal resolution of Sentinel-2 facilitates the detection of ephemeral inundation and wetland surface hydrologic connections, as well as monitoring changes in inundation and connectivity through time.

8:45am - 9:00am

High-Resolution Delineation of Coastal Marsh Boundaries: Evaluating Adaptive Thresholding and Machine Learning Approaches

Doris Obodoefuna, Bing Lu, Ali Jamali

Simon Fraser University, Canada

Salt marshes are ecologically significant ecosystems increasingly threatened by sea level rise, climate change, sediment disruption, and human pressure. Accurate delineation of marsh boundaries is essential for monitoring spatial and temporal change and informing conservation strategies. Remote sensing imagery provides an efficient means to map these boundaries over large areas. This study used high-resolution WorldView-3 imagery (0.3 m after pan-sharpening) to evaluate two methodological categories for mapping marsh extent in the Fraser River Delta, Canada: index-based thresholding (Global Otsu and Adaptive Otsu) and machine learning classification (Random Forest, K Nearest Neighbors, and Support Vector Machine). Each method produced binary marsh maps that were converted to boundary vectors and validated against field-surveyed marsh edges using spatial accuracy metrics, including mean distance error and RMSE. Adaptive Otsu achieved the highest accuracy (mean distance 0.42 m; RMSE 0.53 m) and effectively delineated boundaries across contrasting marsh conditions. Global Otsu performed moderately (mean distance 0.47 m; RMSE 0.62 m). Machine learning models showed lower accuracy overall: Random Forest (0.56 m; 0.73 m), K Nearest Neighbors (0.57 m; 0.76 m), and Support Vector Machine (0.71 m; 0.90 m). These findings demonstrate that locally adaptive thresholding outperforms traditional thresholding and machine learning classifiers for fine-scale marsh boundary extraction in heterogeneous coastal environments, offering a practical approach for remote sensing-based marsh monitoring.

9:00am - 9:15am

Comparative Analysis of 5-band and 10-band Multispectral Drone Imagery for Salt Marsh Vegetation Mapping

Gregory S. Norris¹, Brigitte Leblon², Armand LaRocque¹, Myriam A. Barbeau³, Alan R. Hanson⁴

¹Faculty of Forestry and Environmental Management, University of New Brunswick, Fredericton, NB, Canada, E3B 5A3; ²Faculty of Natural Resource Management, Lakehead University, Thunder Bay, ON, Canada, P7B 5E1; ³Department of Biology, University of New Brunswick, Fredericton, NB, Canada, E3B 5A3; ⁴Canadian Wildlife Service, Environment Canada, P.O. Box 6227, Sackville, NB, Canada, E4L 4N1

Multispectral drone sensors enable fine-scale ecological mapping, but added bands can inflate processing costs. We evaluated the MicaSense RedEdge-MX Red and Blue cameras (5 bands each) versus the Dual Camera System (10 bands) for vegetation mapping in two salt marsh sites in Aulac, New Brunswick, Canada (24 classes at the reference site; 15 at the restoration site). Pixel-based Random Forest (RF) classifications were used to compare validation accuracy, variable importance, and processing time for stitching and classification. Five-band maps achieved up to 95% validation accuracy; the 10-band configuration improved accuracy by ≤2%. Band contributions were site dependent: the near-infrared (NIR) band from the Red camera aided classification at the reference site, whereas additional red-edge bands in the Blue/Dual setups improved performance at the restoration site. However, stitching time rose sharply for the Blue and Dual systems, and RF classification time scaled with data volume and class complexity. Overall, the 5-band Red camera provided a cost-effective balance of accuracy and efficiency, offering practical guidance for sensor selection in drone-based salt marsh monitoring.

9:15am - 9:30am

Wetland classification and mapping in the Richelieu river watershed with Sentinel-1 sar and Sentinel-2 multispectral data

Emma Denomme¹, Brigitte Leblon¹, Florin Pendea¹, Armand LaRocque¹, Élias Benaï²

¹Lakehead University, Canada; ²Connexion Nature, Quebec, Canada

Protection of wetlands in Canada is becoming increasingly important as the ecological services they provide become more well understood and simultaneously, as the advance of human settlement and impacts of climate change imperil them. Rapid and effective identification of wetland areas is crucial for this protection. While there is an estimated 1.2 million km2 of wetland area across the country, only a very small portion of this area is currently mapped and classified in accordance with the 5 major classes and 9 subclasses of the Canadian National Wetland Inventory (CNWI). Additionally, the mapping that has already been completed in some areas is of limited accuracy. To increase accuracy and reduce the cost of wetland mapping we use a combination of Sentinel-1 SAR and Sentinel-2 Multispectral images with topographical data (an SAR-derived DEM). Seasonal variations in water level and vegetation were accounted for through the acquisition of imagery from both satellites in May, July, and September. Using the Montérégie region of southern Quebec as a case study we use a combination of the images and DEM metrics for the entire study area to classify landcover into 21 classes with the Random Forest classifier. The initial Random Forest classification produced an overall classification accuracy of 96.3%. Our study shows that classifying Sentinel-1 and 2 images allows us to determine the location and type of wetlands with a high degree of accuracy. This will allow for more efficient conservation strategies in the mapped areas.

9:30am - 9:45am

Monitoring coastal marsh vegetation features using high-resolution remote sensing

Bing Lu, Ali Jamali, Doris Obodoefuna

Simon Fraser University, Canada

Coastal marshes provide critical ecosystem services, including habitat for diverse plant, fish, and bird communities, shoreline protection, and carbon storage. These low-lying ecosystems are increasingly threatened by sea-level rise and human pressures, necessitating systematic monitoring to inform conservation and restoration efforts. Marsh vegetation characteristics, such as species composition and leaf area index (LAI), are key indicators of ecosystem condition, yet traditional field surveys are often labor-intensive, costly, and spatially limited. High-resolution remote sensing offers a powerful alternative, providing extensive spatial coverage and repeated observations for long-term monitoring. In this study, 30 cm WorldView-3 imagery of the Sturgeon Bank Wildlife Management Area in southern British Columbia, Canada, was combined with machine learning (Random Forest) and deep learning models (2D CNN and Vision Transformer, ViT) to map marsh vegetation species and estimate LAI. Extensive field surveys were conducted at selected sampling points along 24 transects to document species composition and measure LAI, which datasets were used for model training and validation. Results show that the ViT model achieved the highest classification performance (Overall Accuracy 94.05%, Kappa 93.44%), outperforming CNN and RF, and was used to generate a species distribution map. Random Forest, while less effective for classification, accurately estimated LAI (R² ~0.85), producing an LAI map that, combined with the species map, revealed species-specific growth patterns. These results demonstrate the effectiveness of high-resolution remote sensing and advanced analytical models for detailed characterization of complex coastal marsh ecosystems, supporting both ecological understanding and local conservation planning.

8:30am - 10:00am

Forum5A: From Science to Action: Advancing Global Agricultural Monitoring for Food Security and Resilience
Location: 716B

8:30am - 10:00am

ThS1: Advancements in Wildfire Science, Management, and Engagement: Integrating Earth Observation Technologies and Collaborative Development
Location: 717A

8:30am - 8:45am

Advancing Canadian wildfire technology through onboard processing and on the ground collaboration

Tim Heydrich¹, Andrew Macdonald¹, Tanya Harrison¹, Becca Bonham-Carter¹, Luis Chavier¹, Alexis Pascual¹, Shannon Paul¹, Saleh Abdelrahman¹, Scott Brass², Cordy Tymstra³, Rob McAlpine⁴, Eric Choi¹, Yolanda Brown¹

¹Mission Control Space Services Inc., Ottawa, ON Canada; ²Eagle Flight Network, Tsuu T'ina Nation, AB Canada; ³Whitebark & Sage Wildfire Science and Management, Edmonton, AB, Canada; ⁴Western University, London, ON, Canada

The intensity, frequency, and duration of wildland fires are growing in Canada and around the world. Timely fire intelligence products from remote sensing platforms can assist fire managers and lead to fewer impacts. New onboard processing techniques using machine learning allow greater levels of analysis and refinement on edge devices like aircraft and satellites, reducing bandwidth and latencies. Our Fire Band Analysis Network approach brings together wildfire science and management experts and academics, an Indigenous owned business that specializes in satellite communication and community outreach, and a Canadian space company with expertise in deploying machine learning models to spacecraft. We show initial results with onboard segmentation models and present a path to prototype this onboard processing model on a cubesat currently in orbit and on drones equipped with infrared sensors, ultimately bringing the derived data products to user communities on the ground.

8:45am - 9:00am

Science Applications and Mission Updates from Canada’s WildFireSat Mission

Morgan A. Crowley¹, Julia Harvie¹, Mark C. de Jong¹, Dan K. Thompson¹, Alan S. Cantin¹, Matthew Ansell¹, Geoffrey Goetz¹, Jacqueline Oliver¹, Segun Oladipo¹, Maurice Lecours¹, Denis Dufour², Colin B. McFayden¹, Joshua M. Johnston¹

¹Natural Resources Canada - Great Lakes Forestry Centre, Canada; ²Canadian Space Agency, Longueuil, Canada

This presentation will provide an overview and update on the WildFireSat mission and its data product algorithm development. Specifically, we will summarize the 2025 Science and Applications Plan and share updates from the Tier 2 stage of products and algorithms. The Tier 2 products that will be shown include the multi-source fire events, time of arrival outputs, and satellite-derived fire behaviour products (e.g., satellite-observed rate and direction of spread, fireline intensity). Ongoing science-development activities include algorithmic validation, uncertainty characterization, and completion of algorithmic theoretical basis documents.

Built through Canadian and international partnerships, WildFireSat will support fire monitoring and management while enabling major scientific advancements for the global fire monitoring community. The scientific applications of WildFireSat are broad, covering all stages of a fire event’s life cycle. By prioritizing the needs of wildfire managers and a broad range of end-users, the WildFireSat mission is a strong model for future satellite missions to integrate user engagement throughout all phases of the mission timeline.

9:00am - 9:15am

Advancing Wildfire Detection and Characterization Using the Normalized Hotspot Indices (NHI)

Giuseppe Mazzeo¹, Nicola Genzano², Alfredo Falconieri¹, Carolina Filizzola¹, Nicola Pergola¹, Francesco Marchese¹

¹National Research Council, Institute of Methodologies of Environmental Analysis, Tito Scalo (Pz),; ²Politecnico Milano, Dept. of Architecture, Built Environment and Construction Engineering (DABC) Milano, Italy

Normalized Hotspot Indices (NHI)—originally developed for volcanic hotspot detection—has emerged as a powerful, flexible tool for the identification and characterization of high-temperature sources using Sentinel-2 MSI and Landsat-8/9 OLI/OLI-2 observations. By exploiting the combined radiance information from the Near Infrared (NIR) and Short-Wave Infrared (SWIR) spectral bands, the NHI algorithm leverages the multispectral capabilities to identify and characterize hotspots of various origins.

A specific configuration of the NHI algorithm has recently been developed for wildfire mapping. This improved version demonstrated strong performance in complex environments such as California, Hawaii, Canada, Greece, Spain, and Australia, significantly improving the delineation of flame fronts and substantially reducing omission and commission errors.

In this work, we present the results of applying NHI-F to various wildfire events, including the wildfires in Canada in May 2025. Our analysis focuses on two main dimensions essential for modern fire science: (i) the spatial characterization of active flaming fronts and burned-area dynamics at 20–30 m scale and (ii) the quantification of fire intensity through Fire Radiative Power (FRP) and SWIR-based radiance metrics.

9:15am - 9:30am

Rapid georeferencing of sensor-limited helicopter imagery for wildfire response

Seongyu Kim¹, Jeonghyo Oh¹, Jangwoo Cheon², Impyeong Lee¹

¹Dept. of Geoinformatics, University of Seoul, Seoul, Republic of Korea; ²Geospatial Team, InnoPAM, Seoul, Republic of Korea

In the initial response to wildfires, securing rapid and accurate geographic information is essential. However, helicopter imagery acquired on-site often lacks precise sensor metadata, such as camera pose and internal parameters, making the application of georeferencing difficult. In particular, obliquely captured wildfire imagery presents additional registration challenges due to severe viewpoint changes, scale variations, and low-texture environments. This study proposes an automated georeferencing pipeline capable of operating under these constraints. The proposed method consists of five stages: preprocessing, image retrieval, feature extraction and matching, Exterior Orientation Parameters (EOP) estimation, and orthomosaic generation. An initial Area of Interest (AOI) is defined using inaccurate initial position data, and the Region of Interest (ROI) within the reference map is obtained through a ResNet50-based image retrieval approach. Subsequently, virtual Ground Control Points (GCPs) are generated through deep learning-based feature matching. Elevation data is then assigned using a Digital Elevation Model (DEM), and EOP are estimated via Perspective-n-Point (PnP) and RANSAC algorithms. Intermediate frames are initialized via interpolation and refined through bundle adjustment to produce the final orthomosaic. Experimental results demonstrated that utilizing SuperGlue and LightGlue complementarily increased the number of successfully georeferenced intervals from 5 to 9. Furthermore, a minimum RMSE of 28.30 m was achieved in the most accurate interval. This method proves that by automating the feature-based georeferencing process, practical geographic information can be rapidly provided for initial disaster response, even in sensor-limited environments.

9:30am - 9:45am

Characterizing Wildland-Urban Interface Fire Typology and Climate Associations across California, USA

Huiyi Su¹, Pingting Cao², Ximei Wu², Lu Liang³, Qingqing Xu⁴, Yang Ju⁵, Jiaojiao Diao⁶, Wenchao Ma^1,2, Qin Ma^1,2

¹State Key Laboratory of Climate System Prediction and Risk Management, Nanjing, China, 210023.; ²School of Geography, Nanjing Normal University, Nanjing, China, 210023.; ³Department of Landscape Architecture and Environmental Planning, University of California, Berkeley, USA, 94720; ⁴Sierra Nevada Research Institute, University of California, Merced, USA, 95340.; ⁵School of Geography and Ocean Science, Nanjing University, Nanjing, China, 210023.; ⁶Department of Integrative Biology, University of Guelph, Ontario, Canada N1G2W1

California experiences globally intense wildfire activity with accelerating human casualties and economic losses. Existing research quantifies anthropogenic and climatic contributions to wildland-urban interface (WUI) fires at aggregate levels, yet overlooks heterogeneity arising from differences in ignition locations and dominant spread areas. Using multi-source data from California (2002–2023), we classified WUI fires into four behavioral modes based on ignition site and primary spread zone: I-I (WUI ignition, WUI spread), I-W (WUI ignition, wildland spread), W-I (wildland ignition, WUI spread), and W-W (wildland ignition, wildland spread). We systematically analyzed size characteristics, inter-annual trends, fuel composition, and climate sensitivity across modes. Key findings include: (1) WUI fires accounted for 95.6% of total burned area from large fires, with only 12.2% of burned area within the WUI; both total and mean burned area increased significantly over two decades. (2) Lightning-caused WUI fires showed significantly delayed ignition dates, whereas human-caused fires occurred significantly earlier, with elevated fire frequency observed during Independence Day, Labor Day, and Thanksgiving. (3) I-I fires were predominantly driven by anthropogenic factors with the highest proportion of shrub/grass fuel and the smallest mean size; W-W and I-W fires exhibited significant climate sensitivity, with I-W showing a higher rate of increase than W-W over the study period. These findings reveal differentiated driving mechanisms across WUI fire behavioral types, providing scientific evidence for targeted fire management strategies.

10:00am - 10:30am

Morning Coffee Break
Location: Exhibition Hall "E"

10:30am - 12:00pm

Plenary Session 5
Location: Exhibition Hall "G"

Keynote 1: Dr. Minda Suchan
Awards Ceremony:

The Li Deren Award
The Otto von Gruber Award

Keynote 2: Professor Michael Daly

12:00pm - 1:30pm

Lunch
Location: Exhibition Hall "E"

1:30pm - 3:00pm

WG III/1F: Remote Sensing Data Processing and Understanding
Location: 713A

1:30pm - 1:45pm

From Image to Perception: Scene-Graph-Driven Modeling of Human-Scale Urban Experience with Street-view Images

Haipeng Yang, Xian Guo

Beijing University of Civil Engineering and Architecture, China, People's Republic of

This study examines how street-view scenes relate to urban perception using a scene-graph-driven modeling method. Each image is parsed into subject–predicate–object triplets; entity appearance from a CNN backbone and relation semantics from a Transformer detector are fused at node level via a learnable gate. A relation-aware graph neural network performs message passing and attentive readout to predict six perception dimensions (beautiful, boring, depressing, lively, safe, wealthy). Taking Place Pulse 2.0 dataset as benchmark, we convert pairwise votes to binary labels per dimension with standard train/validation/test splits. Experiments compare the graph approach against CNN+SVM and Transformer+SVM baselines under identical protocols. Results show consistently higher accuracy across all six dimensions, with notable gains for beautiful and wealthy. Gradient and integrated-gradient analyses offer node- and edge-level attributions, highlighting elements such as trees, facades, and overhead wires. The method balances accuracy with clarity, and the results point to practical cues that can support human-centered urban design.

1:45pm - 2:00pm

Real-Time Road Condition Detection and Mapping Using YOLOv11 and Built-In Car Dashcam

Harjot Josan¹, Frank Zhang², Baoxin Hu³

¹University of the Fraser Valley (UFV), Canada; ²University of the Fraser Valley (UFV), Canada; ³Dept. of Earth and Space Science and Engineering, York University, Toronto, Canada

Road surface conditions decline due to heavy traffic volumes, severe weather, and recurring utility works, yet still, many road agencies still rely on manual windshield surveys and semi-automated inspections. Not only are these methods time-consuming, but also difficult to scale and labour-intensive. With the help of recent advances in deep learning and the widespread availability of built-in vehicle dashcams, they offer new opportunities for low-cost, automated pavement assessments. This contribution presents a mobile, dashcam-based framework for detecting road-surface defects using the latest YOLOv11, which is combined with geolocation tagging for spatial visualization.

To test out our YOLOv11 training model, we conducted the initial dataset at the University of the Fraser Valley campus and manually annotated it to identify crack fillings, crosswalk markings, speed bumps, lane markings, and other surface conditions. This was just a prototype, which would later be trained to detect all road conditions, such as gravel, potholes, and uneven roads, as well. To address variations in lighting and motion, augmentation techniques were applied. YOLOv11 acquired a mean average precision above 90% across all tested categories.

This prototype demonstrates a practical, low-cost approach for real-time pavement monitoring. Future work includes expanding data collection, developing an operational dashboard for road authorities, having exact GPS coordinates pinned on maps with damaged road images, and evaluating model performance across different data sources, including models trained through Google Images. By producing actionable geospatial information, this system supports more efficient maintenance workflows and offers a scalable pathway for municipalities seeking to modernize road-condition assessment.

2:00pm - 2:15pm

Towards Global Interpretability: Evaluating XAI Metrics in Building Footprint Extraction

Elif Ozlem Yilmaz, Taskin Kavzoglu

Gebze Technical University, Turkiye

Global population is projected to increase by about 70% by 2050, with a growing proportion of people living in urban areas. This trend highlights the importance of accurately assessing urban expansion. Automatic building detection from remotely sensed imagery using deep learning (DL) has demonstrated considerable potential for applications, including sustainable urban planning and infrastructure monitoring. However, the inherent black-box nature of DL models limits their transparency and reduces trust in model-driven decisions. Although various Explainable Artificial Intelligence (XAI) approaches have been proposed to highlight image regions influencing model predictions, qualitative visual inspection alone is insufficient for reliably evaluating the credibility of these explanations. This study evaluates several XAI techniques for building footprint extraction using a U-Net model trained on a refined Massachusetts Buildings Dataset. The segmentation model achieved precision, recall, F1-score, IoU, and overall accuracy values of 89.68%, 85.69%, 87.53%, 79.03%, and 94.35%, respectively. To investigate the model’s decision-making process, three explanation methods, namely Saliency, GradientSHAP, and GuidedGradCAM, were applied. The quality of the generated explanations was then quantitatively assessed using 16 evaluation metrics. Beyond single-image analysis, a dataset-level evaluation was conducted using 547 image patches containing building coverage greater than 20%. The results indicate that GuidedGradCAM produces more consistent and reliable explanations. Furthermore, dataset-level analysis using dense-building samples provides a statistically more robust representation of overall model behaviour compared to evaluations based on individual images. These findings highlight the importance of quantitative assessment in validating the interpretability of DL models for building footprint extraction.

2:15pm - 2:30pm

MaskRoof: A deep Learning Framework and Benchmark Dataset for fine-grained urban Rooftop Utilization and potential Analysis

Jinfeng Xie¹, Haojie Yang², Lingshuang Dong³, Anthony Yeh¹, Yi Zhang²

¹The University of Hong Kong, Hong Kong S.A.R. (China); ²Institute of Future Human Habitats, Tsinghua Shenzhen International Graduate School; ³Huawei Technologies Co., Ltd., Dongguan, Guangdong Province, China

Urban rooftops represent a critical vertical resource for sustainable development, yet comprehensive assessment of their utilization patterns and available capacity remains constrained by inadequate datasets and limited algorithmic capabilities. This study introduces the Urban Rooftop Utilization Dataset (URUD), the first multi-city, pixel-level semantic segmentation dataset encompassing 1,560 high-resolution satellite images from four Chinese cities. URUD establishes eight semantic categories including a novel "available area" class to address ambiguous regions that existing classification schemes fail to capture. The study further proposes MaskRoof, a transformer-based deep learning framework specifically designed for fine-grained rooftop analysis. The model integrates two task-specific modules, Hierarchical Zoom-in Attention (HZA) and Prior-Guided Cross-Attention (PGCA), to address challenges of small-scale target detection and class imbalance. Experimental results demonstrate that MaskRoof achieves superior performance with 94.46% accuracy and 47.29% mIoU, outperforming existing segmentation architectures. Application to Shanghai's outer ring area reveals that 60.74% of rooftop space remains available for utilization, with significant spatial heterogeneity across building types. Industrial and warehouse structures retain substantially greater unutilized areas compared to office and residential buildings. These findings provide quantitative evidence for differentiated urban planning strategies and demonstrate the framework's capability for large-scale rooftop potential assessment in complex urban environments.

2:30pm - 2:45pm

A comparison of CNN, Transformer, and open-vocabulary architectures for real-time photovoltaic defect detection using UAV thermal imagery.

Aissam Salah¹, Mouad Jabrane², Imane Sebari¹

¹Department of Photogrammetry and Cartography, School of Geomatics and Surveying Engineering, IAV Hassan II, Rabat, Morocco; ²Research Unit of Geospatial Technologies for a Smart Decision, IAV Hassan II, Rabat 10101, Morocco

Real-time defect detection in solar farms is critical for profitability and safety. This paper compares state-of-the-art (SOTA) object detection architectures for deployment on edge computing platforms for the purpose of thermal PV defect detection using UAV imagery. We systematically evaluated Closed-Set (YOLOv10, YOLOv12, RT-DETR, RF-DETR) and Open-Vocabulary (YOLO-World, OWL-ViT) models on a public thermal dataset. Our results highlight two leading candidates. The transformer-based RF-DETR sets a new accuracy record at 82.6% mAP@0.50, driven by its self-supervised backbone, but its inference speed is low (12.6 FPS). Conversely, the CNN-based YOLO-World integrates language semantics to reach a competitive 78.1% mAP@0.50 while operating at a real-time speed of 31.3 FPS. We conclude that both RF-DETR and YOLO-World are promising for embedded thermal fault detection. The final selection will depend on on-platform inference performance.

1:30pm - 3:00pm

WG III/3C: Active Microwave Remote Sensing
Location: 713B

1:30pm - 1:45pm

On the Suitability of Distributed Scatterers for Bridge Monitoring in very high Resolution SAR Data

Stefan Scheiblauer, Francescopaolo Sica, Michael Schmitt

University of the Bundeswehr Munich, Germany

This study investigates the suitability of Distributed Scatterers (DS) for satellite-based bridge monitoring in very high-resolution (VHR) Synthetic Aperture Radar (SAR) data. While Persistent Scatterer Interferometry (PSI) relies on isolated, temporally stable reflectors, the DS concept extends the analysis to statistically homogeneous areas. In bridge monitoring, however, elevated and narrow structures challenge the assumption of spatial homogeneity due to signal contributions from both the bridge deck and the underlying terrain in side-looking SAR geometry.

Using 23 TerraSAR-X Staring Spotlight acquisitions (September 2022 - September 2023) over two highway bridges near Regensburg, Germany, the study analyses the effects of layover and partial pixel mixing on height correction and deformation estimation. The DS identification is based on statistical homogeneity testing and covariance estimation, with coherence thresholds applied to ensure phase stability.

Results demonstrate that bridge decks exhibit variable coherence depending on surface roughness and illumination geometry. In some cases, overlayed signals from bridge and ground surfaces produce erroneous elevation and deformation values. The analysis highlights the need for careful interpretation of DS results in VHR data and provides insights into the limitations and potential of DS-based InSAR for linear infrastructure monitoring.

1:45pm - 2:00pm

Modeling tunnel excavation in Taipei, Taiwan, using a Gaussian trough and single-look Sentinel-1 InSAR time series

Erik Rivas¹, Mahmud Haghighi¹, Mahdi Motagh^1,2

¹Leibniz Hannover University, Germany; ²Helmholtz Centre Potsdam–GFZ German Research Centre for Geosciences, Potsdam, Germany

Taipei has experienced an important urban development in the recent years with the expansion of its Taipei Mass Rapid system (MRT). This expansion is currently taking place in the Tamsui-Xinyi Line (Red Line) with one new metro station, the Guangci Fengtian Temple Station. This station connects the east part of the Xinyi district as the continuation of the Xiangshan Station. This project extension has been claimed to be one of the most difficult ones in the metro line development due to its complex geological setting going from very soft sediments to hard rock in a few meters. We have employed Sentinel-1 SAR images to measure the tunnel excavation settlement utilizing ascending and descending tracks and estimating vertical and horizontal time series deformations.

2:00pm - 2:15pm

Stereo SAR for Building Imaging

Yun Lin, Yanping Wang, Junqi Tian, Jiankun Dong, Wenjie Shen, Wen Jiang

North China University of Technology, China

Structural health monitoring is essential for building safety. While SAR provides all-weather, non-contact imaging, it is often affected by geometric distortions like layover and foreshortening, making it difficult to extract accurate 3D structural information from complex targets like buildings. Inspired by stereo vision, we propose a stereo SAR mode that acquires two images via a single rotation. By transforming Cartesian to polar coordinates, the disparity is constrained to the angular direction, significantly simplifying the matching process. We derive the nonlinear relationship between height and disparity and apply Newton’s iterative method for accurate 3D reconstruction. Real data collected by a millimetre-wave radar system validate the effectiveness of the proposed approach.

2:15pm - 2:30pm

Towards Country-Wide LoD1 City Model Reconstruction of from TanDEM-X Intensity Images

Michael Schmitt, Michael Recla, Christopher Ummerle, Islam Mansour

University of the Bundeswehr Munich, Germany

3D city models have become an important piece of geoinformation. They are available in different Levels of Detail (LoD), which determine the amount of complexity provided in the model. LoD1 city models represent simple prismatic building volumes and are typically produced by means of remote sensing. In this article, we investigate the possibility for country-wide reconstruction of LoD1 city models from TanDEM-X intensity images by utilizing deep learning-based single-image height and building footprint reconstruction. As study area, we use the land surface of the country of Denmark. Our results show the general potential of this AI-based approach of country-wide city model reconstruction, which can serve as a data-efficient pipeline that is particularly well-suited in time-critical scenarios or for the exploitation of archive imagery of satellite missions with global data coverage.

2:30pm - 2:45pm

Deformation Monitoring and Analysis of Railway Bridges Integrating Time-Series InSAR and Finite-Element Modeling

Xiaoqiong Qin^1,2, Qingbo Ma^1,2, Chisheng Wang³, Linfu Xie³, Xiangsheng Chen^1,2

¹State Key Laboratory of Intelligent Geotechnics and Tunnelling, Shenzhen University, Shenzhen, 518060, China; ²School of Civil and Traffic Engineering & Underground Polis Academy, Shenzhen University, Shenzhen, 518060, China; ³Smart City Research Institute & School of Architecture and Urban Planning, Shenzhen University, 518060, China

Interferometric Synthetic Aperture Radar (InSAR) is widely used to measure millimetre-level deformation of bridges and other struc-tures. However, retrieving multi-dimensional displacements of a bridge and integrating these measurements with structural stress for coupled analysis remains a major challenge. To tackle this issue, we propose an integrated framework and demonstrate its application on the Hutiaohe extra-large bridge in Guizhou Province. First, a two-dimensional E-PS-InSAR time-series processing chain is de-veloped to derive the bridge’s bi-directional deformation. Next, structural temperatures are obtained through the ANUSPLIN interpo-lation scheme, allowing the accurate isolation of the thermal response. Finally, the finite-element model (FEM) of the bridge is con-structed to interpret the observed deformation and thermal signatures within the structural context. The results show that, compared to conventional InSAR approaches, the proposed framework yields a richer set of insights by conducting a joint analysis mul-ti-dimensional deformation, structural behavior and thermal effects.

2:45pm - 3:00pm

A New SAR Interferometry Approach to Linear Infrastructure Monitoring using Spatial Displacement Gradients

Andreas Piter¹, Mahmud Haghshenas Haghighi¹, Mahdi Motagh^2,1

¹Institute of Photogrammetry and GeoInformation, Germany; ²GFZ Helmholtz Centre for Geosciences, Germany

Monitoring linear infrastructures such as railways and highways with Multitemporal Interferometric Synthetic Aperture Radar (MTInSAR) requires to identify spatial displacement gradients to assess and mitigate the related hazard. During conventional MTInSAR, the majority of the processed pixels are not directly relevant to the linear infrastructure. However, these pixels are required to aid the phase unwrapping and to remove the atmospheric phase contribution. To overcome this limitation, we propose a new method that directly estimates the spatial gradient from the Synthetic Aperture Radar (SAR) images solely along the linear infrastructure avoiding costly phase unwrapping, error propagation from pixels outside the linear infrastructure and atmospheric filtering. Our experiments based on high and medium resolution images from TerraSAR-X and Sentinel-1, respectively, demonstrate that the estimated spatial gradients agree well with the MTInSAR results with a maximum Root Mean Square Error (RMSE) of 3.5 mm/year. Applying our method on Sentinel-1 images enables computationally efficient monitoring of linear infrastructures exploiting the wide area coverage and availability of the SAR images.

1:30pm - 3:00pm

WG III/7B: Remote Sensing of the Hydrosphere and Cryosphere
Location: 714A

1:30pm - 1:45pm

Deep learning–based enhancement of feature tracking for sea ice drift estimation

Ki-Yeong Mun, Junhwa Chi

Division of Data Information Sciences, Pukyong National University, Busan, Republic of Korea

This study proposes a deep learning–based enhancement of feature tracking to improve Sea Ice Drift (SID) estimation using Sentinel-1 Synthetic Aperture Radar (SAR) imagery. Traditional computer vision methods, such as Oriented FAST and Rotated BRIEF (ORB), are commonly used for generating initial drift vectors within the Nansen Environmental and Remote Sensing Center (NERSC) workflow; however, their performance declines under rotational variations, low-texture surfaces, and the fluid-like, short-term dynamics of sea ice. To address these limitations, this study evaluates two deep learning–based methods—SuperGlue and the Local Feature Transformer (LoFTR)—to enhance the robustness and accuracy of feature matching between consecutive SAR scenes. Furthermore, to effectively utilize multi-polarization information, a multi-polarization strategy was applied across both the feature tracking and pattern matching stages. Performance was evaluated using in-situ drift observations from Ice-Tethered Profiler (ITP) buoys, with feature matching assessed by the number of matched keypoints and estimated SID vectors, and drift accuracy evaluated using RMSE and the coefficient of determination (R²). Experimental results demonstrate that polarization integration significantly improves performance, reducing RMSE and increasing R². Among the methods, LoFTR achieved the best performance, followed by SuperGlue and ORB, with notable reductions in speed and directional errors. Overall, the findings demonstrate that deep learning–based methods substantially improve the stability and accuracy of SAR-derived SID estimation. These methods enable more stable and reliable performance in the Arctic environment, which is characterized by sea ice reduction, strong seasonal variability, and highly dynamic drift patterns.

1:45pm - 2:00pm

Implementation and validation of a new weather filter for reducing weather effect in the ASMR2 sea ice concentration data

Kohei Cho¹, Josefino Comiso², Misako Kachi Kachi³, Rigen Shimada³

¹Tokai University, Japan; ²NASA; ³JAXA

global sea ice distributions on a daily basis. Ice concentration (IC) is one of the most important sea ice parameters derived from brightness temperatures measured by the microwave radiometers. However, even at microwave frequencies, the brightness

temperature data over open ocean areas are affected by the presence of adverse weather conditions, including elevated atmospheric water vapor, cloud liquid water, and abnormal surface roughness conditions. The net result is the retrieval of moderate sea ice concentration values in the open ocean where sea ice is not expected. The current sea ice algorithms make use of what is called a “weather filter” to correct such false retrieval of sea ice, but significant areas in the ice-free water that have the false ice cover remain in some areas. In this study, an improved weather filter, namely the Advanced Weather Filter (AWF), that minimizes, if not eliminates, this problem, developed by Cho et al. (2023), was implemented to produce JAXA/AMSR2 sea ice concentration products

of the Arctic for verification. The AWF was validated and shown to be very effective in selected study regions in the Arctic during the summer time from 30 June to 3 July 2014 and the winter time from 15 December to 18 December 2014, thereby supporting the

integration of the AWF into the standard AMSR2 sea ice concentration product. The AWF should be broadly applicable and can be implemented in other satellite passive microwave ice concentration datasets.

2:00pm - 2:15pm

Capturing the Soil Zero-Curtain Effect from Multi-Frequency Passive Microwave Retrievals

Hesam Salmabadi^1,2, Aaron Berg³, Alexandre Roy^1,2

¹Dep. of Environmental Sciences, University of Quebec in Trois-Rivieres, QC, Canada; ²Centre d'Études Nordiques, Université Laval, QC, Canada; ³Dep. of Geography, Environment & Geomatics, University of Guelph, ON, Canada

Seasonal soil freeze-thaw (FT) transitions govern critical hydrological and biogeochemical processes across northern landscapes. The physical state of freezing soil exists on a thermodynamic continuum influenced by the zero-curtain effect, a period where latent heat exchange stabilizes temperatures near 0°C. Despite this, operational passive microwave algorithms, such as FT-SMAP and FT-ESDR, enforce discrete binary classifications that mask this biogeochemically active partially frozen period. To address this limitation, this study establishes a probabilistic, non-binary FT detection framework using a parsimonious L1-regularized logistic regression model driven by multi-frequency passive microwave observations. To isolate dynamic phase changes from static landscape noise, the model integrates two locally standardized indices: the Normalized Polarization Ratio (NPR) from SMAP L-band to track soil liquid water permittivity, and the Normalized Difference V-Pol (NDV) from AMSR2 Ka/Ku-bands to capture volume scattering within canopies and snowpack. The model was trained using topsoil temperatures from North American networks, employing a probabilistic Soil Freezing Characteristic Curve to isolate high-confidence training end-members and a density-based spatial clustering approach to prevent spatial data leakage. The logistic framework demonstrated robust geographic generalizability, achieving an F1-score of 0.957 in Tundra environments. Crucially, it significantly mitigated false alarms in complex forested canopies, suppressing false positive rates in Mixed Forests to 12.6%, compared to 44.3% for FT-ESDR and 33.5% for FT-SMAP. By mathematically isolating the zero-curtain transition, this scalable approach provides the continuous baseline data necessary for advancing seasonal carbon respiration modeling in rapidly warming northern environments.

2:15pm - 2:30pm

Passive L-Band Surface State Retrievals in the Arctic Winter: L-Band Radiometer Development and Calibration

Kristofer Mäkinen¹, Alexandre Langlois^1,2, Alexandre Roy^2,3

¹Université de Sherbrooke, Canada; ²Centre d’études nordiques; ³Université du Québec à Trois-Rivières

This work presents instrument development and calibration of a terrestrial L-band radiometer designed to support satellite retrieval validation and radiation transfer model parameter refinement in the Arctic. As satellite-based retrievals of key geophysical variables such as snow density and ground temperature continue to improve, their accuracy remains limited by scarce ground-truth data. Our refined radiometer addresses this gap by providing targeted, high-resolution terrestrial measurements capable of characterizing surface heterogeneity across Arctic land and water environments.

The instrument was redesigned from an existing model, and was improved based on lessons from earlier field campaigns, focusing on robustness, simplified operation, and enhanced radio-frequency isolation. Calibration procedure focused on measuring the night sky over several nights in cold temperatures to accurately characterize the operation in very cold conditions.

Initial calibration experiments show stable performance and improved consistency compared to earlier instrument versions. While some challenges remain, the system is expected to be field ready and able to capture brightness temperatures accurately over long time periods and varying conditions.

Future campaigns will extend these measurements to lake and sea ice, supported by ground-penetrating radar enabled surface roughness characterization. These efforts will ultimately contribute to improved radiative transfer modeling and more accurate satellite retrievals of key Arctic geophysical variables.

2:30pm - 2:45pm

Self-Modulation Aggregation within Dense Skip Connections for Mapping of Retrogressive Thaw Slumps

Yi Yuan^1,2, Guiyun Zhou¹, Zhishuai Zheng¹, Minghui Chang^1,2, Andrea Maldonado², Binbin He¹, Martin Werner²

¹School of Resources and Environment, University of Electronic Science and Technology of China, China; ²Big Geospatial Data Management, Technical University of Munich, Germany

Accurate mapping of retrogressive thaw slumps (RTSs) in permafrost regions remains challenging due to their irregular morphology, blurred boundaries, and strong spatial correlation. This paper proposes a lightweight multi-level self-modulation (MLSM) module embedded into the UNet++ backbone to enhance non-local feature modeling for high-resolution image segmentation. The overall framework is built upon a UNet++ backbone with dense skip connections, where the proposed MLSM module adaptively fuses multi-scale contextual information to enhance feature coherence across spatially correlated regions. By incorporating low-rank regularization through a soft nuclear norm, MLSM dynamically modulates feature responses according to structural variations, allowing attention to adapt to spatially complex RTS regions. The integration of depth-wise convolution and channel recalibration further refines feature aggregation efficiency. Experimental evaluations on Maxar dataset demonstrate that the proposed method achieves superior segmentation accuracy and smoother boundary delineation compared with existing models. The proposed framework provides a robust and computationally efficient approach for RTS mapping, contributing to improved understanding of local geomorphic patterns.

2:45pm - 3:00pm

Snow Persistence Dynamics in the NWH Himalaya (2000–2024): MODIS-Based Trend Analysis

Sakshi Tripathi^1,2, Praveen.K Thakur¹, Vaibhav Garg¹, Ashutosh Sharma², R.P Singh¹

¹Indian Institute of Remote Sensing . IIRS-ISRO, Dehradun; ²Indian Institute of Technology Roorkee, India

This study investigates long-term snow persistence dynamics across the North-Western Himalaya (NWH) spanning 2000–2024 using MODIS Terra and Aqua daily snow products. Snow persistence—defined as the number of days a location remains snow-covered—is a crucial indicator of climatic variability and hydrological behaviour in high-mountain environments. Annual snow persistence was derived from daily CGF_NDSI_Snow_Cover layers after mosaicking, clipping to the study region, reclassifying snow pixels, and summing snow days at 500 m resolution. Pixel-wise trend analysis was conducted using the Mann–Kendall test, supported by Kendall’s Tau, p-values, and variability metrics.

The results show clear spatial contrasts: high-elevation zones (>4000 m) maintain persistent snow cover (>300 days/year), while mid-altitude regions (1500–3000 m) exhibit moderate persistence but significant negative trends. Low-elevation areas display minimal snow longevity and rapid decline over the 25-year period. The region recorded maximum snow-covered area in 2019 and a notably reduced extent in 2016. Approximately 29% of the NWH shows statistically significant trends, predominantly negative, with an overall mean decline of −3.2 snow days per year. Variability is highest in mid-elevation transition zones, which appear particularly sensitive to warming.These findings highlight ongoing reductions in seasonal snow cover in the NWH and their implications for glacier mass balance, water resource availability, and hydrological timing. The study underscores the value of long-term satellite-based monitoring to understand cryospheric response under changing climate conditions.

1:30pm - 3:00pm

ThS3: Spatial Intelligence in the Wild
Location: 714B

1:30pm - 1:45pm

Proactive cognitive map for embodied spatial reasoning

Wenbin Wang, Zhifeng Gu, Guangchi Fang, Bing Wang

The Hong Kong Polytechnic University

This work addresses the emerging challenge of achieving proactive spatial cognition for embodied and spatial AI systems operating in dynamic real-world environments. Conventional mapping and reasoning approaches are largely passive and task-dependent, limiting their ability to build persistent understanding beyond immediate goals. We introduce the Proactive Cognitive Map (PCM), a unified framework that enables agents to autonomously construct, verify, and refine their spatial knowledge through continual perception, self-questioning, and mental simulation. PCM integrates a grid-based perceptual map with a semantic, object-centric memory, forming an explicit and interpretable representation of the environment. A self-questioning module identifies uncertain or ambiguous regions and generates targeted queries, while a simulation module emulates human imagination to perform counterfactual reasoning and lightweight geometric self-verification across time and viewpoints.

We evaluate PCM across episodic-memory embodied QA tasks and the long-horizon, multi-task benchmarks, GOAT-Bench, covering episodic reasoning, continual understanding, and cross-task generalization. Results show that PCM’s self-driven graph construction and proactive refinement outperform goal-specific exploration methods.

By transforming mapping from static perception into a continual cognitive process of questioning, imagining, and verifying, this study provides a step toward lifelong, interpretable, and self-improving spatial intelligence.

1:45pm - 2:00pm

Automatic Update and 3D Gaussian Reconstruction of Building Facade using Multi-Sensor Unmanned Aerial and Ground Vehicles: An Air-Ground Fusion Approach

Jing Li^1,2, Shicheng Xu¹, Banghui Yang^1,3, Shuhao Sun⁴

¹Aerospace Information Research Institute，Chinese Academy of Sciences, Macau S.A.R. (China); ²International Research Center of Big Data for Sustainable Development Goals, China; ³University of Chinese Academy of Sciences, Beijing 101408, China; ⁴Tianjin Chengjian University, Tianjin, China

As a spatial digital foundation for digital twins and smart cities, the timeliness and accuracy of realistic 3D models are of critical importance. Intelligent and automated data acquisition and update workflows form the core infrastructure that sustains this digital foundation. Current modeling techniques relying on a single data source face inherent limitations: UAV(Unmanned aerial vehicle)-based oblique photogrammetry struggles to capture lower facade details, often leading to geometric distortions and blurred textures, while conventional terrestrial surveying methods suffer from low efficiency and limited automation as well as intelligence. Moreover, the substantial viewpoint differences between aerial and ground data hinder effective fusion. However, recent technological advances in 3D Gaussian Splatting (3DGS), large vision model, multi-sensor SLAM and robotic systems, open up new opportunities to significantly improve the fidelity, efficiency, completeness and automation of 3D reconstruction through the cooperation of UGVs and UAVs.To address the current challenges from 3D reconstruction, this study proposes a novel framework which seamlessly integrates autonomous unmanned systems, state-of-the-art large visual models, multi-sensor SLAM (simultaneous localization and mapping) and cutting-edge 3D Gaussian rendering technology. The framework realizes an integrated workflow for automatic updating building facade and high-fidelity 3D GS rendering using air to ground fusion algorithms with autonomous systems. The primary focus is to advance the automation and intelligence of building 3D reconstruction, thereby enabling efficient updates of urban 3D models.

2:00pm - 2:15pm

Monocular 3D Reconstruction for Martian Terrain Based on Diffusion Model

Jiarui Cao¹, Rong Huang^1,2, Yusheng Xu^1,2, Zhen Ye^1,2, Xiaohua Tong^1,2

¹College of Surveying and Geoinformatics, Tongji University, Shanghai, China; ²The Shanghai Key Laboratory of Space Mapping and Remote Sensing for Planetary Exploration, Shanghai, China

High-precision digital terrain models (DTMs) are important for Mars explorations and research. However, traditional terrain reconstruction methods suffer from limitations in coverage and resolution. To enhance the model's ability to recover fine-grained topography, we present a diffusion-based monocular terrain reconstruction method, which progressively recovers Martian terrains from single-view high-resolution optical images. We employed a multi-scale U-Net denoising network with attention mechanisms and introduced an additional end-to-end depth constraint. To improve terrain reconstruction efficiency, we implemented a diffusion model in the latent space and adopted a skipping sampling mechanism. We employed the proposed method to reconstruct terrain in different regions. Experimental results demonstrate that the reconstructed terrain achieves an accuracy of 2 m. Furthermore, compared to photogrammetric terrain, the shaded relief generated by our method exhibits greater similarity to the input imagery.

2:15pm - 2:30pm

GESM: GMM-based Efficient Sonar Mapping

Kaicheng Zhang, Zhifeng Gu, Gaungchi Fang, Bing Wang

The Hong Kong Polytechnic University, Hong Kong S.A.R. (China)

GESM is a Gaussian-mixture sonar mapping pipeline that converts 2D imaging sonar into a continuous 3D probabilistic map for navigation. We estimate posterior occupancy with Gamma-CFAR, cluster occupied and free space along beams, encode them with weighted EM/MPPCA and moment-matched Gaussians, and incrementally merge local mixtures into a globally consistent map. Loop closure is handled by in-place edits of mixture parameters. On simulation and pool/harbour data, GESM yields dense, navigation-ready structure and free water while reducing map memory by ~99% compared with a comparable voxel grid.

2:30pm - 2:45pm

An Analysis of the Impact of Geospatial Data Sources on Mesh-Based Localisation Performance

Francesco Vultaggio^1,2, Phillipp Fanta-Jende¹, Alexander Kern², Markus Gerke²

¹Austrian Institute of Technology, Austria; ²Technical University of Braunschweig, Germany

This paper investigates how the provenance and resolution of geospatial data used to construct mesh maps affect the accuracy and robustness of mesh-based visual localisation. Mesh-based approaches offer significant advantages over traditional pipelines reliant on Structure from Motion (SfM) models, including the ability to scale to city-sized scenes---by leveraging large-scale data sources such as national mapping databases--- and on-demand generation of arbitrary synthetic views. While prior work has focused on algorithmic improvements to mesh-based localisation, none has systematically analysed how different input data affect localisation outcomes. In this work, we evaluate three meshes---derived from aerial oblique imagery, combined aerial and ground mobile mapping data, and close-range ground imagery---across the egenioussBench Extended and House of Science query sets and four image matchers. We show that mesh quality is the dominant factor governing localisation performance. In the House of Science experiments, aerial meshes lack the resolution required to resolve façade detail, causing near-total localisation failure regardless of matcher. In the egenioussBench Extended experiments, augmenting an aerial mesh with ground data yields consistent but less dramatic improvements. We further introduce the Perceptual Detail Score (PDS), a viewing-condition-aware metric that proves to be a strong predictor of downstream pose accuracy across all experimental configurations.

2:45pm - 3:00pm

JCFI: a Composite Index for RMLS-based Shield Tunnel Segment Joint Recognition

Liying Wang¹, Ze You¹, Yiwei Yu¹, Yong Feng², Chunxi Xie³

¹School of Geomatics, Liaoning Technical University, Fuxin, China; ²Division of Geoinformation Management, Department of Natural Resources of Liaoning Province, Shenyang, China; ³Institute of Surveying, Mapping and Geographic Information, China Railway Design Group Co., LTD., Tianjin, China

The accurate recognition of segment joints serves as a critical step for capturing joint anomaly information, evaluating segment assembly quality, diagnosing structural health status, and determining the loosening of connecting bolts. It holds significant importance for the operation and maintenance of shield tunnels. However, existing studies on joint recognition based on Rail-borne Mobile Laser Scanning (RMLS) suffers from insufficient comprehensiveness in feature representation, leading to notably poor accuracy and robustness under complex scenarios such as noise interference, data loss due to object occlusion, and uneven point cloud density. To address this issue, this study proposes a shield tunnel segment joint recognition method based on the Joint Composite

Feature Index (JCFI). The proposed method first employs a cross-sectional ellipse fitting approach to filter out obvious non-lining points. Subsequently, a composite index JCFI, which integrates curvature, left-right density ratio, and relative depth, is designed to

quantitatively characterize the feature differences of segment joints. Finally, based on the constructed JCFI indicator, the recognition of circumferential and longitudinal joints is sequentially achieved. Validation tests using RMLS point cloud data from the Guangzhou Metro Line 8 tunnel demonstrate that the proposed method, by constructing the JCFI that comprehensively characterizes joint features, effectively handles complex scenarios including noise interference, joint missing, and uneven point cloud density. The

joint recognition achieves a recall rate of 90.14%, a precision rate of 99.04%, and an IoU of 89.36%, providing a reliable technical solution for the accurate identification of shield tunnel segment joints.

1:30pm - 3:00pm

ThS28: Learning Across Temporal and Spatial Scales
Location: 715A

Comparative analysis of dual-form networks for live land monitoring using multi-modal satellite image time series

Iris Dumeur¹, Jérémy Anger^1,2, Gabriele Facciolo²

¹Kayrros, France; ²Université Paris-Saclay, ENS Paris-Saclay, CNRS, Centre Borelli, France

Multi-modal Satellite Image Time Series (SITS) analysis faces significant computational challenges for live land monitoring applications. While Transformer architectures excel at capturing temporal dependencies and fusing multi-modal data, their quadratic computational complexity and the need to reprocess entire sequences for each new acquisition limit their deployment for regular, large-area monitoring. This paper studies various dual-form attention mechanisms for efficient multi-modal SITS analysis, that enable parallel training while supporting recurrent inference for incremental processing.

We compare linear attention and retention mechanisms within a multi-modal spectro-temporal encoder. To address SITS-specific challenges of temporal irregularity and unalignment, we develop temporal adaptations of dual-form mechanisms that compute token distances based on actual acquisition dates rather than sequence indices.

Our approach is evaluated on two tasks using Sentinel-1 and Sentinel-2 data: multi-modal SITS forecasting as a proxy task, and real-world solar panel construction monitoring. Experimental results demonstrate that dual-form mechanisms achieve performance comparable to standard Transformers while enabling efficient recurrent inference. The multi-modal framework consistently outperforms mono-modal approaches across both tasks, demonstrating the effectiveness of dual mechanisms for sensor fusion. The results presented in this work open new opportunities for operational land monitoring systems requiring regular updates over large geographic areas.

Seasonality and Aerosol Optical Thickness affect Landsat 7 and 8 Harmonization Performance

Galen Richardson¹, Anders Knudby¹, Elisha Richardson², Wenjun Chen³

¹University of Ottawa, Ottawa, ON, Canada; ²Carleton University, Ottawa, ON, Canada; ³Canadian Centre for Mapping and Earth Observation, Ottawa, ON, Canada

Sensor harmonization is required to produce consistent Landsat imagery for long-term change detection. This study investigated the effect of seasonality and aerosol optical thickness (AOT) on linear harmonization functions, which are frequently used to create consistent Landsat 7 ETM+ and Landsat 8 OLI time series data. We found that training harmonization functions with pixels that have low or average AOT can greatly reduce the difference between near-coincidental Landsat 7 and Landsat 8 observations, and that seasonally trained harmonization models outperform models trained on year-round data. We assessed the effect of ETM+/OLI sensor harmonization on forest type classification with a Random Forest model, and found that seasonally harmonized imagery provided more consistent classification maps than the alternatives. This study illustrates important details related to the creation of harmonized datasets and is a significant step toward creating more consistent Landsat 7 and Landsat 8 data for long-term change detection analysis.

Dynamics of Urban Expansion in the Inter-Andean Valleys: Projecting Scenarios for Sustainable Territorial Planning

Mónica Enríquez¹, Jaqueline Morocho¹, Divar Castro-Rodas^2,3, Paulo Escandón-Panchana^4,5, Lucas Lima Raiol⁶, Andrés Velastegui-Montoya^1,3

¹Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ²Faculty of Mechanical Engineering and Production Sciences, ESPOL Polytechnic University; ³Laboratory of Geoinformation and Remote Sensing, Faculty of Engineering in Earth Sciences, ESPOL Polytechnic University; ⁴Departamento de Ingeniería Cartográfica y Topografía, Universidad Politécnica de Madrid (UPM); ⁵Escuela de Ciencias Ambientales, Universidad Espíritu Santo; ⁶Programa de Pós-Graduação em Ciências Ambientais (PPGCA), Institute of Geosciences (IG), Federal University of Pará (UFPA)

Urban growth in Ecuador's inter-Andean valleys has accelerated the territory's transformation, driven by the expansion of road infrastructure and the occupation of environmentally fragile areas. In this context, the Ruta Viva highway has reconfigured urbanisation patterns in the parishes of Cumbaya and Tumbaco, advancing the urban frontier into agricultural areas and moderate slopes. The objective of this study is to evaluate the dynamics of urban expansion in the parishes of Cumbaya and Tumbaco during the period 2002-2032, using a multitemporal and predictive approach to project future urbanisation scenarios and generate inputs for sustainable territorial planning and land management. The methodology integrated multitemporal analysis of land use and land cover data from MapBiomas (2002-2022), predictive modelling using CA-Markov-MOLUSCE, and urban expansion analysis. The results show a 3% increase in urban coverage during the 2002-2022 period and a projected 12% growth by 2032, concentrated south of the Ruta Viva corridor and within the agricultural mosaic. Simulations show that slopes below 25° are more susceptible to urbanisation, while vegetation cover loss reaches 30% on the slopes of Ilalo Hill. This study provides a robust, replicable tool for anticipating urbanisation scenarios in Andean environments, guiding land management and environmental conservation strategies in regions of high urban pressure.

Understanding the effect of spatiotemporal mismatches between airborne and ground surveys for ALS models of forest biomass: a case study in the Canadian boreal forest

Maxim Okhrimenko^1,2, Guillermo Castilla², Chris Hopkinson¹

¹University of Lethbridge, Canada; ²Canadian Forest Service (NRCan), Canada

The Area-Based Approach (ABA) for modelling forest biomass with ALS data assumes perfect co-registration, but operational inventories often have spatiotemporal misalignments. This study isolates and quantifies the independent error contributions from temporal gaps and spatial co-location errors. The analysis uses a unique dataset from the Taiga Plains, Canada, featuring 163 re-measured field plots paired with repeated ALS acquisitions from the same sensor.

To assess temporal effects, we constructed scenarios with varying time-gap distributions. Symmetrical time gaps (SD 1.1 vs 2.5 years) increased RMSE by ~1 percentage point but did not add bias. In contrast, skewed distributions introduced significant systematic biases of 8.0 % (6.8 Mg ha⁻¹).

To assess spatial effects, we linked co-location uncertainty directly to plot-level neighbourhood heterogeneity. This was done by shifting the 20x20m ALS footprint over a 1m lattice and recalculating predictors. The resulting predictor variability (RMS(CV) 12.7%) was propagated through the model, implying a positional sigma of 10-15%. Monte Carlo simulations confirmed this spatial component is the dominant error source, contributing 2–4 percentage points to the ~22% baseline %RMSE.

Our findings show that while balanced temporal gaps are manageable, spatial co-location affected by the local heterogeneity is the most critical factor for robust ABA models.

1:30pm - 3:00pm

SpS4B: Remote Sensing of Atmospheric Components for Climate Change and Air Quality: Bridging ISPRS and AERSS
Location: 715B

1:30pm - 1:45pm

PhysNorm-Net: A physics-guided adapted normalization network for reconstructing gapless, hourly tropospheric NO2 VCDs over Asia (2019–2024)

Hongrui Gao, Kai Qin

School of Environment and Spatial Informatics, China University of Mining and Technology, Xuzhou 221116, China

Tropospheric nitrogen dioxide (NO2) is a crucial trace gas for air quality assessment, yet satellite observations often suffer from spatial gaps (e.g., cloud cover) and temporal limitations. While the geostationary satellite GEMS provides hourly data over Asia, its short historical record and missing data restrict long-term studies. Therefore, a physics-guided adapted normalization network (PhysNorm-Net) is designed to reconstruct a gapless, hourly, and high-resolution (0.05°) tropospheric NO2 dataset over Asia from 2019 to 2024.

The model features an asymmetric U-Net architecture. It handles irregular data gaps using Partial Convolution with a dynamic mask and extracts spatiotemporal representations from meteorological and chemical priors. A novel Physics-Aware Normalization (PhysNorm) module bridges the modality gap by dynamically modulating satellite feature maps using physical backgrounds, ensuring adherence to atmospheric diffusion laws.

Extensive evaluations show that PhysNorm-Net achieves high prediction accuracy (R2 = 0.886). It robustly recovers spatial morphologies and pollution plumes even under extreme missing data scenarios. The generated 2019-2024 dataset accurately captures complex diurnal variations and localized hotspots, providing valuable insights into human activities and pollution policies in Asia.

1:45pm - 2:00pm

Physics-Informed Neural Networks for Efficient Spatiotemporal Inversion of NOx Emissions from TROPOMI

Qin He¹, Kai Qin¹, Hongrui Gao¹, Man Sing Wong², Jason Blake Cohen¹

¹China University of Mining and Technology, Xuzhou, 221116, China; ²The Hong Kong Polytechnic University, Kowloon, 999077, Hong Kong

Accurate estimation of nitrogen oxide (NOx) emissions is essential for understanding their role in atmospheric chemistry and managing air pollution. This study presents a novel approach using Physics-Informed Neural Networks (PINNs) to invert NOx emissions from TROPOspheric Monitoring Instrument (TROPOMI) satellite data. By coupling the physical laws of atmospheric processes, effectively bridging traditional data assimilation techniques with the computational efficiency of deep learning. Unlike purely data-driven models, it directly integrates physical constraints from atmospheric mass continuity equation into the model training process, eliminating the need for inputs or outputs from computationally intensive chemical transport models. Application to the Yangtze River Delta region of China (2018–2023) revealed detailed spatiotemporal NOx emission trends, including the impacts of the COVID-19 pandemic and subsequent recovery. Uncertainty quantification through Monte Carlo dropout provides robust error estimates. This physics-informed approach demonstrates strong potential for efficient NOx emission inversion and offers a versatile foundation for broader quantitative remote sensing applications.

2:00pm - 2:15pm

Fast Cloud Property Retrieval from TROPOMI O₂-A Band Observations Using a DISAMAR-Based Neural Network Framework

Tao Xie¹, Xiaoyun Zhang^2,3, Wenmei Li¹, Yixiang Chen¹, Ping Wang², Piet Stammes², Gijsbert Tilstra², Olaf Tuinder², Maarten Sneep², Feng Lu⁴

¹School of Internet of Things, Nanjing University of Posts and Telecommunications, China; ²R&D Satellite Observations (RDSW), Royal Netherlands Meteorological Institute (KNMI), NL; ³Nanjing University of Information Science and Technology (NUIST), China; ⁴Key Laboratory of Radiometric Calibration and Validation for Environmental Satellites, National Satellite Meteorological Center (National Center for Space Weather), Innovation Center for Feng Yun Meteorological Satellite (FYSIC), China Meteorological Administrations, Beijing 100049, China

With improvements in the spatial resolution of satellite spectrometers such as TROPOMI, Sentinel-4 and Sentinel-5, more homogeneous cloudy scenes can be resolved at the pixel scale. Therefore, it is worthwhile to use a scattering cloud model in cloud retrieval algorithms. DISAMAR (Determining Instrument Specifications and Analysing Methods for Atmospheric Retrieval) is a computer model developed to simulate the retrieval of atmospheric trace gases, aerosols, clouds, and land-surface properties from passive remote-sensing observations in the 270–2400 nm wavelength range. As a line-by-line radiative transfer model, DISAMAR provides accurate simulations but is computationally expensive. Machine learning techniques can improve the speed of cloud retrieval, because a neural network trained with detailed radiative transfer calculations for scattering clouds can replace the most time-consuming part of the retrieval algorithm.

In this study, we plan to build a cloud retrieval algorithm based on DISAMAR and accelerate it using neural network methods. The algorithm uses TROPOMI observations in the O₂-A band and supports the joint retrieval of cloud optical thickness (COT) and cloud-top pressure (CTP). The neural network models are trained offline using a large, high-resolution spectral data set in the O₂-A band generated by the DISAMAR forward model. All neural networks share the same set of input features but predict different targets, including reflectance and the derivatives of reflectance with respect to cloud pressure and cloud optical thickness. These predictions are then used within an optimal estimation framework to retrieve the cloud parameters.

2:15pm - 2:30pm

Generation of Nighttime Visible Bands for the Advanced Himawari Imager based on Deep Learning technologies

Jiaqi Jin^1,2, Jing Li^1,2, Man Sing Wong^1,2,3,4, PW Chan⁵

¹State Key Laboratory of Climate Resilience for Coastal Cities, The Hong Kong Polytechnic University, Hong Kong, China; ²Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ³Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ⁴Research Institute of Land and Space, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China; ⁵The Hong Kong Observatory, Hong Kong, China

This study involves remote sensing and artificial intelligence technologies. The study proposed a deep learning-based algorithm to generate the nighttime visible bands for Advanced Himawari Imager geostationary satellite.

2:30pm - 2:45pm

A radiative transfer model-guided deep learning framework for aerosoloptical thicknessretrieval fromsatellite observations

Man Sing Wong^1,2,3,4, Jing Li¹, Kai Qin⁵

¹Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China; ²Research Institute for Sustainable Urban Development, The Hong Kong Polytechnic University, Hong Kong SAR, China; ³Research Institute of Land and Space, The Hong Kong Polytechnic University, Hong Kong SAR, China; ⁴Otto Poon Research Institute for Climate-Resilient Infrastructure, The Hong Kong Polytechnic University, Hong Kong SAR, China; ⁵School of Environment and Spatial Informatics, China University of Mining and Technology, China

Atmospheric aerosols play a vital role in regulating air quality, ecosystems, and climate. Owing to their short atmospheric lifetime, aerosols exhibit strong spatial and temporal variability. Accurate global and regional monitoring of aerosol properties is essential for ecological processes, and radiative forcing. Satellite remote sensing has become a key tool for monitoring aerosol optical thickness (AOT) because of its broad spatial coverage. Traditional physical approaches rely on radiative transfer models (RTMs) to simulate top-of-atmosphere radiances. However, RTMs simplify the real atmosphere, and their accuracy depends strongly on assumed aerosol optical properties and surface reflectance, leading to major uncertainties and inter-algorithm discrepancies. In recent years, data-driven methods have rapidly advanced, driven by developments in machine learning and the increasing availability of collocated satellite and ground-based AOT datasets. The data-driven methods exclusively rely on the data pairs of satellite observations and ground-measured aerosol properties. It learns empirical relationships between satellite observations and measured aerosol properties, and it is more flexible to incorporate more diverse information. However, the AERONET ground stations, commonly used for training, are unevenly distributed and concentrated in urban regions, leaving other surface types such as forests and barren lands underrepresented. Besides, extreme pollution events (e.g., dust storms) are often misclassified as clouds and masked out in AERONET records, introducing bias into training datasets. To mitigate these limitations, this study proposes integrating simulated RTM data into the inversion framework to enhance the robustness and generalization of data-driven AOT retrieval models.

2:45pm - 3:00pm

Evaluating the generalization and uncertainty of data-driven air quality remote sensing models using an idealized testbed

Xinqi Wei¹, Qin He², Kai Qin²

¹Nanjing University of Posts and Telecommunications; ²China University of Mining and Technology

Short annotation如下

Reliable satellite-based estimation of near-surface air pollutants increasingly relies on data-driven models, yet their credibility is hindered by biased generalization assessment and unverified uncertainty estimates. Spatially sparse and unevenly distributed monitoring networks together with strong spatial autocorrelation cause conventional cross-validation approaches to substantially overestimate predictive skill, especially in regions lacking in situ observations. At the same time, although many models produce pixel-level uncertainty estimates, the degree to which these uncertainties reflect true prediction error remains largely unexplored.

This study introduces a controlled, model-agnostic evaluation framework to rigorously examine both spatial generalization and uncertainty reliability in air-quality remote sensing models. A chemical transport model provides a continuous, full-coverage nitrogen dioxide field that serves as an idealized truth. Sampling this field at actual monitoring locations reproduces real observational sparsity while preserving an unbiased reference for domain-wide evaluation. Multiple machine learning models are assessed using sample-based, site-based, and spatially optimized cross-validation to quantify evaluation bias and its dependence on spatial structure.

A dual-path uncertainty strategy is implemented to separately characterize aleatoric and epistemic components, complemented by diagnostic metrics assessing calibration, interval coverage, and sharpness. The framework provides a rigorous pathway for diagnosing reliability in data-driven atmospheric estimation models and supports the development of robust, trustworthy applications in quantitative remote sensing.

1:30pm - 3:00pm

IvS11: Remote Sensing and Geospatial Technologies for Vegetation Fire Management and Recovery Resilience
Location: 716A

1:30pm - 1:45pm

Application of remote sensing data in ice modelling for a regulated river

Karl-Erich Lindenschmidt¹, Denise Sudom², Saber Ansari², Mojtaba Jandaghian², Sergio Gomez¹, Amir Ali Khan², Thomas Puestow²

¹University of Saskatchewan, Canada; ²National Research Council, Canada

The formation and stability of river ice covers in regulated waterways are critical for uninterrupted hydro-electric operations. This study investigates the use of remote sensing, including satellite imagery, aerial surveys, and near-surface observations, to monitor ice cover development in the Beauharnois Canal along the St. Lawrence River. Ice booms are deployed in this canal to promote the rapid formation of a stable ice cover during freezing events, minimizing disruptions to dam operations. Remote sensing data were used to assess the spatial extent and temporal evolution of an ice cover and to calibrate the river ice model RIVICE. The model was applied to simulate ice formation for the 2019-2020 ice season, first for the canal with a series of three ice booms and then rerun under a scenario without booms . Comparative analysis reveals that the presence of ice booms facilitates the development of a relatively thinner and more uniform ice cover. In contrast, the absence of booms leads to thicker ice accumulations and increased risk of ice jamming, which could impact water management and hydroelectric generation operations. These findings demonstrate the value of remote sensing in river ice modelling and potential applications to support operational decision-making for regulated river systems.

1:45pm - 2:00pm

Investigating the Sensitivity of multi-frequency SAR Coherence to flooded Arctic Landfast Ice

Constanza Salvó^1,2, Saeid Homayouni^1,2

¹Institut national de la recherche scientifique, Canada; ²Centre d'études nordiques

When heavy snow or thinning ice allows seawater to intrude into the snow–ice interface, a saline slush layer forms, softening the surface and reducing traction. Because flooding is often invisible, travelers risk becoming stuck in remote areas, creating hazardous conditions. Saline slush also alters the snowpack’s physical and electromagnetic properties. Increased liquid water and salinity affect microwave signal interactions, complicating the estimation of ice properties using remote sensing. Depending on snow depth, temperature, and salinity, slush may refreeze or remain unfrozen, influencing ice thickness and heat transfer. Synthetic Aperture Radar (SAR) is widely used to monitor sea ice under all weather and light conditions. Its signal penetrates the dry snowpack and respond to changes at the snow base, making SAR suitable for detecting seawater flooding. However, SAR observations are sensitive to the target dielectric properties, surface roughness, frequency, incidence angle, and environmental variability. L-band coherence has shown sensitivity to flooding, but its behaviour on snow-covered ice remains poorly understood. This study examines the relationship between seawater flooding and SAR coherence using X- and L-band data collected alongside 2024–2025 field measurements in Qikiqtarjuaq, Nunavut. This research will show how SAR coherence can reveal flooded ice, supporting safer travel in northern communities.

2:00pm - 2:15pm

Segmentation of SAR imagery of river ice in the St. Lawrence River using deep learning: Preliminary steps to best practice

Katharine Scott¹, Nathan Wu², Liubov Poliakova³, Denise Sudom⁴

¹University of Waterloo, Canada; ²University of Waterloo, Canada; ³University of Waterloo, Canada; ⁴Ocean,Coastal and River Engineering,National Research Council of Canada

River ice is a key variable in northern regions, with impacts on transportation, infrastructure and flood events. There is increasing emphasis on using remote sensing data to assist operational monitoring. This study investigates the use of synthetic aperture radar (SAR) data for this purpose. The main goal is to provide an open, accessible and scalable approach for accurate semantic segmentation of SAR data into ice and water classes.

2:15pm - 2:30pm

Retrieving Snow Water Equivalent (SWE) from satellite gravimetry using a spectral combination approach

Farzam Fatolazadeh¹, Shusen Wang², Mehdi Eshagh^1,3, Kalifa Goïta¹

¹Centre d’applications et de recherches en t´el´ed´etection (CARTEL), D´epartement de G´eomatique appliqu´ee, Universit´e de Sherbrooke, Sherbrooke, Qu´ebec J1K 2R1, Canada; ²Canada Centre for Remote Sensing, Natural Resources Canada, Ottawa K1A 0E4, Canada; ³Division of Meteorology-forecast and Observation, Swedish Meteorological and Hydrological Institute, Sweden

Snow Water Equivalent (SWE) refers to the quantity of water contained within the snowpack, which is a critical component of the seasonal water cycle in cold regions, notably Canada. The Gravity Recovery and Climate Experiment (GRACE) mission primarily focuses on quantifying Terrestrial Water Storage Anomalies (TWSA),

which is the sum of anomalies in groundwater, soil moisture, surface water, and snow/ice. Separating the individual components with high precision is a challenging task due to the complex interactions of these parameters and their uncertainties involved. This study proposes an enhanced estimator which is modified based on the

spectral combination theory, to extract the SWE component from GRACE/GRACE-FO (Follow-On) TWS measurements. This estimator uses a hydrological model and its uncertainty to optimally extract the SWE component from the GRACE monthly models in spectral domain. The approach was applied in eight selected basins across

Canada, covering a diverse range of climatic and geographical conditions. Different winter seasons of each basin were considered, including the peak accumulation and ablation phases of the snowpack, from January 2003 to the end of 2022.

2:30pm - 2:45pm

Forecasting Ice Thickness on the Churchill River and Lake Melville, Labrador Using Machine Learning, 2023-2025

Ian Turnbull, Michael Lynch, Maria Yulmetova, Pradeep Bobby

C-CORE, Canada

During the winters of 2023-2024 and 2024-2025, machine learning (ML) based models were implemented to predict ice thickness at eight sites on the Churchill River and Lake Melville, Labrador for one- and three-day horizons. The forecast ice thicknesses were fed into the Churchill River Flood Forecasting System (CRFFS) operated by the Newfoundland and Labrador (NL) provincial government’s Water Resources Management Division (WRMD). The models were trained on measured ice thickness data from 2017-2023, with the 2024-2025 models additionally trained with data from the 2023-2024 ice season. The 2023-2024 models were deep learning models that used Long Short-Term Memory (LSTM) Recurrent Neural Networks (RNNs), and the 2024-2025 models were ML models that used a simpler gradient boosting regression (GBR) algorithm. The LSTM (2023-2024) models used a running time-series of local meteorological observations as predictor variables to directly forecast ice thickness, and the GBR (2024-2025) models mainly used forecast surface energy balance variables to predict changes in ice thickness. The average performance of the models across the eight sites was comparable between the two ice seasons; however, the 2024-2025 season models improved performance at key sites on the Churchill River that are critical to ice jam flood forecasting. This paper describes the development of the models and their operation and comparative performance over the 2023-2025 ice seasons.

2:45pm - 3:00pm

From Concept to Application: Machine Learning for Near-Real-Time River Ice Breakup Prediction Using SAR and Meteorological Data

Maria Yulmetova, Pradeep Bobby, Michael Lynch, Ian Turnbull

C-CORE, Canada

Accurate, reliable, and early-warning forecasts of river ice breakup are essential for flood risk mitigation and public safety, particularly in relation to river transportation and ice road operations. Synthetic Aperture Radar (SAR) satellite imagery has been widely utilized for monitoring river ice conditions due to its sensitivity to surface roughness and dielectric properties. This study advances traditional SAR applications and, to our knowledge, presents the first model that directly incorporates SAR data as input within a machine learning (ML) framework for river ice breakup prediction. The method leverages the correlation between SAR backscatter dynamics and the onset of surface melt. The model was evaluated using leave-one-out cross-validation, achieving an overall accuracy of 92%, an F1-score of 0.91, a Kappa coefficient of 0.84, and a mean absolute error (MAE) of less than 6 days for both the two- and three-week forecasts. The algorithm was also implemented in near-real-time operational settings, demonstrating strong performance with MAE values ranging from zero to four days across different river segments. The approach was further tested on an independent site, where it maintained robust predictive skill. The newly developed method shows strong potential for two- and three-week forecasting of river ice breakup, offering a scalable, cost-effective, and operationally viable tool for management and early warning applications.

1:30pm - 3:00pm

Forum5B: From Science to Action: Advancing Global Agricultural Monitoring for Food Security and Resilience
Location: 716B

1:30pm - 3:00pm

Forum10: Photogrammetry and Remote Sensing Enabled Geospatial Science for Equitable, Liveable Cities
Location: 717A

1:30pm - 5:00pm

General Assembly 3
Location: 701A

3:00pm - 3:30pm

Afternoon Coffee Break
Location: Exhibition Hall "E"

3:30pm - 5:15pm

ThS18: Advances in Reality Capture, AI, and Digital Twin Technologies for Construction Engineering
Location: 713A

3:30pm - 3:45pm

Image sequence based prediction of the temporal evolution of fresh concrete properties under realistic conditions

Max Meyer¹, Amadeus Langer¹, Max Mehltretter¹, Dries Beyer², Max Coenen³, Bastian Strybny³, Tobias Schack⁴, Michael Haist⁴, Christian Heipke¹

¹Institute of Photogrammetry and GeoInformation, Leibniz University Hannover, Germany; ²Feist Construct GmbH, Bad Pyrmont, Germany; ³Institute of Building Materials Science, Leibniz University Hannover, Germany; ⁴Institute of Construction Materials, University of Stuttgart, Germany

Advancing the level of digitalization and automation in concrete manufacturing can substantially contribute to lowering CO2 emissions associated with the concrete production. This work introduces a new methodology for predicting the time-dependent properties of fresh concrete during mixing. For the prediction, a deep learning network is created which uses stereoscopic image sequences of the flowing material together with tabular data as input. Besides mix design parameters and process state data, like energy consumption, moisture and fresh concrete temperature, temporal information is included in the tabular data. The temporal information represents the time interval between image acquisition and the time for which the properties should be predicted. During training, this interval corresponds to the difference between the image acquisition and the time at which reference measurements are taken, allowing the network to implicitly learn the temporal evolution of the material properties, namely the slump flow diameter, yield stress, and plastic viscosity. Incorporating time-dependent prediction enables the forecasting of property changes throughout the mixing process, offering a valuable tool for real-time process control. This capability allows timely adjustments whenever deviations from the desired material behavior are detected. The experimental investigations presented in this paper demonstrate the feasibility of this method under realistic conditions.

3:45pm - 4:00pm

Single-image to model registration for semantic enrichment of indoor BIM

Dorota Włodarczyk, Małgorzata Jarząbek-Rychard

Institute of Geodesy and Geoinformatics, Wrocław University of Environmental and Life Sciences, Poland

Effective integration of geometric and semantic data within Building Information Models (BIM) is essential for the efficient life cycle management of modern facilities. However, maintaining accurate as-is BIM models for existing buildings remains a significant challenge, as manual updates are labour-intensive and full 3D reconstruction is often impractical for incremental changes. In such cases, image-based approaches offer a fast and flexible alternative, but require reliable alignment of 2D imagery with existing BIM geometry. To address this challenge, this study introduces a streamlined pipeline for semantic enrichment that uses a single-image visual localisation approach to directly align 2D imagery with existing BIM geometry. The proposed method integrates transformer-based panoptic segmentation (Mask2Former) with a closed-form Perspective-n-Line solver to estimate 6-degrees-of-freedom (6-DoF) camera poses. The novelty of the proposed approach lies in the explicit use of semantic information as a geometric constraint to guide the selection of 2D–3D correspondences for pose estimation. Semantic labels are utilised to filter line correspondences, ensuring that only stable architectural boundaries (e.g., walls, floors, and ceilings) are used in the registration process. Such semantic filtering stabilises correspondence selection, effectively mitigating pose ambiguity in repetitive indoor layouts or scenes where structural elements are partially obscured by furniture and clutter. Experimental results confirm high accuracy, achieving a median position error of 9.84 cm and an orientation error of 1.05° in complex indoor environments. This robust registration framework provides a reliable foundation for the downstream semantic enrichment and digital twin updates.

4:00pm - 4:15pm

LSTNet: Local Shape Transformer Network for Road Marking Extraction

Jiafeng Wu^1,2,3, Chaorui Liu^1,2,3, Jiajun Shi^1,2,3, Jonathan Li^1,2,3,4, Lingfei Ma^1,2,3,4

Road markings are vital for HD maps and autonomous driving, yet LiDAR-based extraction is difficult due to missing RGB information, severe class imbalance, and thin, elongated geometry under sparse and noisy returns (Ma et al., 2020). We propose LSTNet, which performs local-shape tokenization by grouping points on tangent planes and encoding tokens from relative coordinates, normals, curvature, and intensity contrast. A geometry-aware transformer aggregates these tokens across multiple scales with attention biased by relative position and normal similarity, capturing long and thin structures while preserving edges. Our contributions can be summarized as follows: (1) We present LSTNet, which directly segments road marking from 3D point clouds, avoiding image conversion and preserving geometric fidelity. (2) We introduce a dedicated point-cloud dataset for road marking extraction to enable training and fair evaluation. (3) We design a task-specific and boundary-aware training objective that improves thin road marking recall and robustness under class imbalance.

4:15pm - 4:30pm

Automatic 3D Building Model Generation for Energy Digital Twins

Oscar Roman^1,2, Giorgio Agugiaro³, Ken Arroyo Ohori³, Maarten Bassier⁴, Elisa Mariarosaria Farella¹, Fabio Remondino¹

¹3D Optical Metrology, Bruno Kessler Foundation, via Sommarive 18, Trento, Italy; ²University of Trento, EICS and DII Department, Trento, Italy; ³3D Geoinformation group, Department of Urbanism, Faculty of Architecture and Built Environment, Delft University of Technology, Delft, The Netherlands; ⁴Department of Civil Engineering, TC Construction - Geomatics, KU Leuven - Faculty of Engineering Technology, Ghent, Belgium

The concept of Digital Twins (DTs) in Architecture, Engineering and Construction (AEC) domain encompasses a wide range of applications and scales, from single buildings to entire cities, spanning monitoring, simulation, energy management and operational control. Regardless of the specific application, a valid Digital Twin (DT) is a dynamic, data-driven model that stays continuously synchronized with its physical counterpart in both time and state via sensors and the Internet of Things (IoT). It must receive real-world input and provide feedback for analysis or control, ultimately progressing toward a self-operational DT. In the energy domain, an Energy Digital Twin (EDT) must be designed to (i) include sufficient geometric information (ii) support continuous monitoring, (iii) assist scenario-based simulation and (iv) enable operational maintenance and decision support. To achieve these objectives, the EDT’s geometry should be managed through two complementary representations: (i) a watertight solid volumetric model for physics-based simulation and (ii) a boundary representation (B-Rep) model for precise topology, semantics and data exchange. A mapping layer keeps the two representations consistent, preserving identity and topology across states and linking to the graph. Consequently, the EDT should adopt a multi-level architecture defining both geometric and data structures. This work introduces a robust Scan-to-Energy Digital Twins (Scan-to-EDTs) framework that generates multi-level building EDTs by integrating geometric, semantic and simulation layers to enable interoperable energy analyses.

4:30pm - 4:45pm

From propagation to prediction: point-level uncertainty evaluation of MLS point clouds under limited ground truth

Ziyang Xu¹, Olaf Wysocki³, Christoph Holst^1,2

¹Chair of Engineering Geodesy, TUM School of Engineering and Design, Technical University of Munich; ²TUM Leonhard Obermeyer Center, Technical University of Munich; ³CV4DT, University of Cambridge

Evaluating uncertainty is critical for reliable use of Mobile Laser Scanning (MLS) point clouds in many high-precision applications such as Scan-to-BIM, deformation analysis, and 3D modeling. However, obtaining the ground truth (GT) for evaluation is often costly and infeasible in many real-world applications. To reduce this long-standing reliance on GT in uncertainty evaluation research, this study presents a learning-based framework for MLS point clouds that integrates optimal neighborhood estimation with geometric feature extraction. Experiments on a real-world dataset show that the proposed framework is feasible and the XGBoost model delivers fully comparable accuracy to Random Forest while achieving substantially higher efficiency (about 3 times faster), providing initial evidence that geometric features can be used to predict point-level uncertainty quantified by the C2C distance. In summary, this study shows that MLS point clouds' uncertainty is learnable, offering a novel learning-based viewpoint towards uncertainty evaluation research.

4:45pm - 5:00pm

Automatic Scan-to-BIM: The Impact of Semantic Segmentation Accuracy on Opening Detection

Jidnyasa Patil, Arcot Sowmya, Mohsen Kalantari

University of New South Wales, Sydney, Australia

The automation of Scan-to-BIM remains a major challenge within the Architecture, Engineering, and Construction industry, particularly in the detection and geometric characterisation of architectural openings such as doors and windows. Although recent advances in 3D semantic segmentation have improved the classification of architectural elements, the effect of segmentation accuracy on downstream geometric detection and reconstruction is still under study. This work compares five state-of-the-art deep learning models, PointNeXt, PointMetaBase, Point Transformer V1, Point Transformer V3, and Swin3D, on opening detection in Scan-to-BIM. A unified evaluation framework integrating DBSCAN clustering with axis-aligned bounding box fitting is introduced to generate per-instance geometric representations. The models are assessed using semantic metrics and geometric reliability indicators, including centroid error, dimensional deviation and 3D IoU. Experiments on the S3DIS Area 5 dataset, reveal notable performance differences across models. Swin3D achieved the highest door detection rate of 96.9%, followed by PointMetaBase at 92.9%, PointNeXt at 87.4%, PTV3 at 85.0%, and PTV1 at 81.9%. Window detection proved more challenging, with Swin3D and PTV3 both achieving 75.0%, PTV1 at 71.2%, and PointNeXt and PointMetaBase at 67.3%. Notably, PointMetaBase produced strong geometric accuracy for doors despite lower semantic scores. These results suggest that high segmentation accuracy does not always lead to precise geometric reconstruction. To assess generalisation, the trained models were applied to 11 Matterport3D rooms, confirming that the observed patterns extend across different scanning environments. This study concludes that in Scan-to-BIM workflows, greater emphasis should be placed on geometric reconstruction algorithms than segmentation performance alone.

5:00pm - 5:15pm

Fast and accurate point surveying using the PIX4Dcatch mobile app

Giulia Rovelli¹, Marta Coelho Lopes², Gaia Amaranta Taberna¹, Adrian Fernandez¹, Paloma Pomares¹, Jean-Baptiste Magnin¹, Andrei Mitache¹, Davide Antonio Cucci¹, Christoph Strecha¹, Pierangelo Rothenbühler¹

¹PIX4D SA, Switzerland; ²École Polytechnique Fédérale de Lausanne (EPFL), Switzerland

The digitalization of the architecture, construction and subsurface utility engineering sectors demands efficient, accurate and flexible 3D point surveying methods. Established ones based on Global Navigation Satellite System (GNSS) rovers or total stations suffer from significant limitations, such as requiring open-sky visibility, high costs and complex setups. This paper introduces a novel method for georeferencing 3D points using the PIX4Dcatch mobile application coupled with an external Real-Time Kinematic (RTK) GNSS receiver. The method enables to survey a point of interest by just aiming the smartphone and tapping on the screen during a capture. A lightweight, modified Bundle Adjustment algorithm runs on the device, delivering accurate 3D coordinates in seconds without any post-processing. We evaluated the method by surveying several known cadaster points for hundreds of times across diverse field conditions, achieving a mean planimetry error norm of approximately 3 cm and 97% of errors below 10 cm. Similar statistics are achieved with single-point measurements using an RTK rover. Although not intended to replace millimeter-precision instruments, the accuracy profile of our method is perfectly suited for many applications, such as subsurface utility mapping, which often have decimeter-level regulatory requirements. Given its high efficiency, low cost and ease of use, we believe that our method has the potential to transform as-built documentation workflows in diverse engineering sectors.

3:30pm - 5:15pm

ThS9: EuroSDR Thematic Session: Emerging Challenges and Opportunities for National Mapping and Cadastral Agencies
Location: 713B

3:30pm - 3:45pm

Airborne Laser Scanning in GNSS-denied Areas

George Vosselman¹, Florian Pöppl², Peter Rieger², Gottfried Mandlburger³, Camillo Ressl³

¹University of Twente, Netherlands, The; ²Riegl, Austria; ³TU Wien, Austria

Jamming and spoofing of GNSS signals have become common practice in war zones and areas of political tension. The unavailability of reliable GNSS signals has a major impact on mapping services. Airborne laser scanning is one type of aerial survey that depends on GNSS. In this presentation, we propose a concept for airborne laser scanning surveys without using GNSS. We also present the results of an initial feasibility study.

3:45pm - 4:00pm

Visible Cadastral Boundary Delineation in Data-Scarce Countries using Data from Neighboring Data-Rich Countries

Jeroen Grift^1,2, Claudio Persello¹, Mila Koeva¹

¹University of Twente; ²Kadaster

Accurate cadastral maps are essential for effective land administration, supporting tenure security, land management, and socio-

economic planning. Automating cadastral boundary extraction can accelerate mapping in regions with incomplete or absent cadas-

tral information, but deploying pretrained models in data-scarce areas is challenging due to limited reference data and heterogeneous

landscapes. In this study, we investigate cross-region transfer learning for delineating cadastral boundaries using high-resolution

aerial imagery. We employ CadNet, a U-shaped deep learning model with a Swin Transformer backbone pretrained on the Dutch

CadastreVision dataset, and fine-tune it using Polish cadastral reference data selected for landscape similarity to a data-scarce region

in northern Moldova. Evaluation on Moldovan test tiles demonstrates substantial quantitative improvements: recall for visually dis-

cernible boundaries increases from 0.310 to 0.624, total vector-based discrepancy via Normalized Discrepant Area decreases from

7.898 to 7.051. Qualitatively, fine-tuning produces more continuous and coherent boundaries, recovers interior parcel divisions, and

better aligns predicted parcel structures with ground truth, compared to the pretrained model, which generates fragmented and in-

complete boundaries. These results highlight the importance of landscape similarity and reference data quality for transfer learning

and demonstrate a scalable framework for automated cadastral mapping in regions with similar landscape characteristics.

4:00pm - 4:15pm

Aerial image quality control - spatial resolution

Ivar Oveland¹, Andreas Prebensen Korsnes¹, Eija Honkavaara², Lauri Markelin², Andrew C. Flatman³, Henry Meißner^4,5

¹The Norwegian Mapping Authority, Kristiansand, Norway; ²NLS, Helsinki, Finland; ³KDS, Copenhagen, Denmark; ⁴German Aerospace Center, Berlin, Germany; ⁵Geoinformatics and Land Management, OTH Amberg-Weiden, Amberg , Germany

This study presents Siemens star studies in Norway, Finland, and Denmark during 2023-2025. The preliminary results demonstrate a significant and expected difference between GSD and GRD, highlighting that the GRD is a critical parameter when planning and procuring aerial imagery services. GRD relates to the smallest objects that can be reliably mapped. Incorporating GRD into planning ensures that expectations better match the final outcome. The study provides valuable insight into the practical use of Siemens star considering size, frequency, design, material selection, including comparisons between Bayer pattern and pan-sharpened sensors. The Nordic countries have different strategies for evaluating GSD considering prequalification, national calibration fields and field installations on individual projects. This study provides an overall assessment of the different approaches. The project aims to establish common requirements and methodologies for aerial image quality assessment, ultimately contributing to a European-wide GRD based resolution standard

4:15pm - 4:30pm

New Digital Models for the Italian Terrain Morphology and Gravity Field

Silvano Pecora¹, Maria Paola Bonofiglio¹, Giuseppe Poccia², Carlo Doglioni^4,5, Fernando Sansò^4,6, Riccardo Barzaghi⁶, Renè Forsberg⁷, Vittorio Casella⁸, Marco Arciero⁹, Alessandra Maria De Pace^10,12, Filippo Bocchino¹⁰, Roberta Ravanelli¹¹, Mattia Crespi^3,4,10,12

¹Ministero dell’Ambiente e della Sicurezza Energetica, Rome, Italy; ²Istituto Geografico Militare, Florence, Italy; ³Istituto Nazionale di Geofisica e Vulcanologia, Rome, Italy; ⁴Accademia Nazionale dei Lincei, Rome, Italy; ⁵Dept. of Earth Sciences, Sapienza University of Rome, Rome, Italy; ⁶Dept. of Civil and Environmental Engineering, Politecnico di Milano, Milan, Italy; ⁷National Space Institute, Technical University of Denmark, Lyngby, Denmark; ⁸Dept. of Civil Engineering and Architecture, University of Pavia, Pavia, Italy; ⁹eGeos S.p.A., Rome, Italy; ¹⁰Geodesy and Geomatics Division, Dept. of Civil, Building and Environmental Engineering, Sapienza University of Rome, Rome, Italy; ¹¹Geomatics Unit, Department of Geography, University of Liège, Liège, Belgium; ¹²Sapienza School for Advanced Studies, Sapienza University of Rome, Rome, Italy

Benefiting of EU funds coming from National Plan for Recovery and Resilience after the covid-19 pandemic, Italian Ministry for the Environment and Energy Security, in coordination with Istituto Geografico Militare and Istituto Nazionale di Geofisica e Vulcanologia, is currently implementing a national project for the acquisition and processing of airborne LiDAR and gravimetric data covering the entire Italian territory. The goal is to overcome the heterogeneity of existing digital terrain and surface models and gravimetric dataset, which suffer from inconsistencies in spatial coverage, temporal epoch, accuracy, and metadata completeness. The project will produce homogeneous, high-resolution Digital Terrain and Surface Models (DTM and DSM) and a new airborne gravimetric database, enabling to estimate a refined gravimetric geoid and significantly improving the Italian geospatial reference infrastructure. All the collected data and realized products will be publicly available. The main features of the project, and a selection of the already available results are hereafter presented.

4:30pm - 4:45pm

Colour Adjustment of Aerial Images from 2000–3000 m Altitude: Empirical Normalisation using Large Ground Colour Targets

Ivar Oveland¹, Steven Le Moan²

¹The Norwegian Mapping Authority, Kristiansand, Norway; ²Colourlab, Norwegian University of Science and Technology, Gjøvik, Norway

High-altitude aerial image national mosaics often exhibit visible colour and tone differences caused by atmospheric variability, illumination changes, sensor differences and post-processing workflows. These radiometric inconsistencies negatively influence both visual quality and the comparability of image data across sensors, time and campaigns. This work presents an empirical two-step colour adjustment and radiometric normalisation method for imagery acquired from 2000–3000~m altitude using a large multi-colour ground target designed to provide stable, spatially robust reference statistics. Field reflectance values are measured with a handheld spectrometer and converted to CIELAB coordinates. A global 3D similarity (Helmert) transform aligns measured image colours to ground-truth CIELAB values, followed by local residual chromatic correction using inverse distance weighting. Experiments on aerial datasets demonstrate that the method significantly reduces colour discrepancies at the calibration site.

4:45pm - 5:00pm

Enabling regular map updates and identification of impervious surfaces through satellite data fusion, machine learning and cloud platforms

Melchior Shukuru¹, Stephen McCarron¹, Guilllermo Castro Camba², Conor Cahalane¹

¹Department of Geography, Maynooth University, Co . Kildare, Ireland; ²Dept Surveying, Remote Sensing, Geodesy & Boundaries, Tailte Éireann, Phoenix Park, Dublin 8, Ireland

Frequent cloud cover is a common impediment deterring many countries from employing optical earth observation data for the purposes of national map updates. A decision-level data fusion approach allows the use of satellite imagery in such locations and therefore has potential to assist in this task. In this study we test the use of cloud penetrating Sentinel-1 to enhance the delineation of impervious surfaces from other land cover types, impervious surfaces being a key component of hydro-climatological models in urban and semi-urbanised areas. Using machine learning techniques and leveraging the full Copernicus archive in the Google Earth Engine (GEE) platform, a post-classification change detection approach was developed to assess impervious surface expansion between 2017 and 2023 across the urban centre of Dublin, Ireland. Image classification, conducted using a random forest classifier, achieved overall accuracies of 93% and 91% and kappa coefficients of 0.91 and 0.89 for 2017 and 2023 data, respectively. The addition of multispectral and RADAR indices such as NDVI, NDBI and PRISI was tested and proved generally effective, but showed limitations in areas adjacent to the coast and inland water bodies, with indications of confusion between land cover types. The inclusion of NDWI in data fusion was shown to help differentiate waterbodies from impervious surfaces, particularly highlighting the importance of integrating a water-specific index. NDVI also outperformed other indices in feature importance, though PRISI was shown to helpfully cluster impervious surfaces

5:00pm - 5:15pm

Conceptualising Value in Public Sector Geographic Information for Digital Twins

Jack Metcalfe¹, Claire Ellul¹, Stefano Cavazzi², Jantien Stoter³, Jeremy Morley¹

¹Department of Civil, Environmental and Geomatic Engineering, University College London, London WC1E 6BT, UK; ²Ordnance Survey, Southampton SO16 0AS, UK; ³3D Geoinformation Research Group, Delft University of Technology, 2628 CD Delft, The Netherlands

Digital twins (DTs) are digital representations of physical entities with data connections synchronising the physical and digital states. While DTs originated in manufacturing and aerospace, they are increasingly applied at geographic scales addressing urban issues. As a result, DTs must utilise geographic information (GI) to represent the built environment, though this is often an implicit aspect. Public sector geographic information (PSGI), typically produced by National Mapping and Cadastral Agencies (NMCA) is a particular type of GI that serves as an authoritative, foundational component to geospatial applications. However, the value of this PSGI as foundation component of DTs is not well understood. Existing GI valuation methodologies do not account for the unique characteristics of foundational PSGI, or its role within DTs , leaving NMCAs unable to justify investment, and adapt their contributions, to emerging DTs. To address this gap, this study applies Jabareen's (2009) conceptual framework analysis methodology to define what value means in the context of PSGI in DTs. The analysis identifies seven value enablers and five value dimensions that characterise PSGI value in DTs and provide the basis for future quantitative valuation methodologies. These concepts are integrated through an urban infrastructure DT example and synthesised through boundary case analysis. The resulting conceptual understanding provides a foundation for NMCAs to systematically articulate and evidence their contributions to DTs.

5:15pm - 5:30pm

Consolidating Feedbacks and Expertise of Digital Twins of Territories' Engineers in Nation-Wide Frameworks

Théo Szanto, Mathieu Brédif, Bénédicte Bucher

Univ Gustave Eiffel, ENSG, IGN, LASTIG

Digital Twins of Territories (DTTs) are increasingly adopted by municipalities to support ecological transition, crisis resilience, and participatory decision-making. Designing a DTT that fits local needs requires engineers to combine multiple areas of expertise (data discovery, integration, modeling, visualization, and stakeholder interaction) while working with heterogeneous geospatial datasets of varying quality. Nation-wide DTT frameworks aim to assist these efforts, yet they currently lack mechanisms to consolidate the expertise produced during local DTT developments.

This paper introduces dttrecipe, a model designed to capture, structure, and share DTT engineers' feedback and decision-making processes. Building on the prov, wfdesc and wfprov ontologies, and inspired by the OGC Geospatial User Feedback standard, dttrecipe formalizes the description of territorial stakes, data workflows, encountered problems, and the rationale behind design choices. It supports both complete and partial workflow descriptions, encouraging collaboration, reproducibility, and cross-territorial knowledge reuse.

The model is qualitatively evaluated via a case study focused on bicycle-mobility planning and citizen engagement in a rural city. The resulting recipe highlights recurrent categories of DTT engineering challenges, including data discoverability and usability issues, multi-source misalignment, documentation accessibility, and limited local expertise. Explicit documentation of these challenges shows how engineers' often implicit expertise can be converted into reusable knowledge for other territories facing similar constraints.

The work shows that structured documentation of DTT engineering practices can strengthen national DTT frameworks by improving interoperability and enabling efficient knowledge transfer. Future work will address querying mechanisms and evaluate the reuse of shared recipes at scale.

3:30pm - 5:15pm

WG III/4D: Landuse and Landcover Change Detection
Location: 714A

3:30pm - 3:45pm

Study on Multi-scale Assessment Methodology for SDGs Localization

Mingyi Du, Shisong Cao, Jingyi Chen

Beijing University of Civil Engineering and Architecture, Beijing, China

This study takes into account the heterogeneity of regional development stages and the local development context, systematically explores the relationship between localization and sustainable development, and constructs a quantifiable localized SDGs assessment model for China. An empirical analysis of the multi-level SDGs evaluation system was conducted. To address the challenges posed by heterogeneous multi-source data in the assessment process, a composite Key Performance Indicator (KPI) screening model based on Random Forest and Hyperlink-Induced Topic Search (HITS) was proposed, enhancing the scientific rigor and efficiency of localized SDGs monitoring and evaluation.

3:45pm - 4:00pm

Discussion on the ‘Integration of Four Databases’ for Natural Resources Survey and Monitoring in Beijing Based on the ‘Jiaxing Experience’

Yingchun Huang^1,2, Feng Ni^1,2, Jun Wu^1,2, Juan Gu^1,2, Yifei Cao^1,2, Xudong Yang^1,2

¹Beijing Institute of Surveying and Mapping, China, People's Republic of; ²Beijing Key Laboratory of Urban Spatial Information Engineering, Beijing,China, People's Republic of

This paper is mainly based on the experience of Jiaxing City, which has done a good job in the investigation and monitoring work in China, to inspire the investigation and monitoring work in Beijing, and to provide technical support for supporting the investigation and monitoring work in Beijing to achieve the goal of "one inspection, multi-purpose, integration and sharing". Through research, the four databases of new basic surveying and mapping, land change survey, urban land space monitoring, and land space planning are integrated, and the integration of content indicators, survey methods, collection and storage, management and sharing is realized.

4:00pm - 4:15pm

Pernambuco Water Dataset (PWD): a high-resolution multi-source dataset for deep learning-based waterbody segmentation in tropical and semi-arid regions

Tiago Prudencio Silvano¹, Reginaldo Pereira da Silva Filho², Joel Passos¹, Simone Sayuri Sato¹, Cleber Zanchettin¹

¹Federal University of Pernambuco (UFPE); ²Brazilian Army Geographic Service (DSG)

Accurate extraction of water bodies from remote sensing imagery is essential for environmental monitoring, water resource management, and hydrological applications. However, the performance of deep learning models for water segmentation depends on the availability of representative datasets that capture diverse environmental and spectral conditions, particularly in tropical and semi-arid regions that remain underrepresented in existing datasets. This study presents the Pernambuco Waterbody Dataset (PWD), a multi-source dataset comprising aerial and satellite remote sensing imagery for water-body segmentation. The dataset covers the state of Pernambuco, Brazil, including tropical and semi-arid environments associated with the Atlantic Forest and Caatinga biomes. The dataset includes high-resolution aerial imagery (0.5 m) from the Pernambuco Tridimensional Program (PE3D) and Sentinel-2A imagery (10 m), with manually annotated water bodies generated by cartographic specialists. The dataset was constructed through data acquisition, preprocessing, manual annotation, mask generation, patch extraction (512 × 512 pixels), and division into training, validation, and test subsets. The first version includes 51,743 aerial patches and 15,321 Sentinel-2A patches. To validate the dataset, U-Net, U-Net++, and DeepLabV3+ architectures with ResNet and EfficientNet backbones were evaluated using Recall, Precision, F1-score, and IoU metrics. The best performance was achieved by U-Net++ (ResNet34) for aerial imagery (IoU 0.946) and U-Net (ResNet34) for Sentinel-2A imagery (IoU 0.871). Overall, the proposed dataset provides a robust benchmark for advancing deep learning-based water body extraction using multi-source remote sensing data.

4:15pm - 4:30pm

Bitemporal Spatial Autocorrelation Matrix for Change Detection in Multispectral Imagery: A Case Study on the Drying of a Lake in Southern Italy

Giovanni Piccolo^1,2, Cinzia Sada¹

¹Università degli Studi di Padova - Physics & Astronomy Department “G. Galilei”; ²Engineering Ingegneria Informatica S.p.A

Multispectral satellite imagery provides an essential source of information for monitoring environmental transformations, yet robust unsupervised change detection remains challenging due to radiometric variability and seasonal dynamics. At the same time, supervised approaches based on Deep Learning are often constrained by the need for computationally expensive accelerated hardware and the limited availability of high-quality annotated datasets. This work introduces a framework based on the Bitemporal Spatial Autocorrelation (BSAC) matrix, that rather than relying on pixel-wise spectral differencing or data-intensive Deep Learning models, it is designed to quantify structural changes by evaluating the symmetry properties of the spatial autocorrelation across multiple spatial lags. Three complementary metrics are derived from the BSAC representation: a binary change/no-change trigger that identifies structural discontinuities, an asymmetry magnitude that measures the intensity of change, and a normalized Symmetry Index obtained via singular value decomposition to characterize the geometric coherence of the correlation structure.

The methodology is applied to Sentinel-2 imagery of Lake Fanaco (Sicily, Italy), which experienced severe desiccation during the 2024 drought. Experiments conducted using NDWI and NDVI confirm the index-agnostic nature of the framework, capturing both

hydrological contraction and vegetation stress. Comparison with an unsupervised K-means segmentation baseline shows strong spatial agreement in identifying the affected areas. Thanks to its unsupervised formulation and near-linear computational complexity, the BSAC framework represents a scalable and interpretable approach for operational change monitoring in Earth observation.

4:30pm - 4:45pm

A Novel Label-Free Approach for Post-Fire Environmental Assessment Based on Zero-Shot Segment Anything Model (SAM)

Melih Altay¹, Fatih Fehmi Simsek², Saygın Abdikan³

¹Department of Geomatics Engineering, Hacettepe University, Ankara 06800, Türkiye; ²TUBITAK Space Technologies Research Institute, Ankara 06800, Türkiye; ³Department of Geomatics Engineering, Hacettepe University, Ankara 06800, Türkiye

Accurate and rapid burned-area mapping is essential for assessing the ecological impacts of forest fires and supporting post-fire recovery efforts. Traditional pixel-based methods often suffer from limited accuracy due to spectral confusion, topographic effects, and reliance on empirical thresholds. Although deep learning models such as U-Net, DeepLab, and SegFormer improve spatial precision, their operational scalability is constrained by the need for extensive labeled data and regional retraining.

This study introduces a zero-shot burned area mapping approach using the Segment Anything Model (SAM) with Sentinel-2 imagery. SAM, trained on over a billion masks, enables prompt-based segmentation without task-specific training. Composite inputs derived from NBR, NBR2, and NDVI indices were generated and fed into SAM, followed by testing multiple pre-processing, post-processing, and hyperparameter configurations.

Results show that multi-scale settings (crop_n_layers = 2) significantly enhance boundary continuity and geometric accuracy. The method achieved IoU values of 0.89 (Bursa) and 0.87 (Çanakkale), with corresponding F1 scores of 0.94 and 0.92 performances comparable to, and in some cases exceeding, supervised models. Integrating spectral index composites further reduced boundary fragmentation and improved discrimination between burned and unburned surfaces.

Overall, the proposed framework eliminates dependence on manual labeling, offering a fast, scalable, and cost-effective solution adaptable to diverse ecosystems and sensor conditions. The study demonstrates one of the first systematic applications of SAM for burned-area detection, highlighting its strong potential for zero-shot environmental monitoring and rapid post-fire assessment.

4:45pm - 5:00pm

Application of machine learning methods and Sentinel-2 data for multitemporal land-cover classification in conflict-affected areas

Kinga Karwowska¹, Aleksandra Sekrecka², Jakub Ślesiński³, Magdalena Lewińska⁴

¹Military University of Technology, Poland; ²Military University of Technology, Poland; ³Military University of Technology, Poland; ⁴Military University of Technology, Poland

In many regions of the world, especially those affected by armed conflicts, urbanization, or intensive environmental transformations, a high dynamic of land cover and land use changes is observed. Reliable monitoring of these processes requires the application of classification methods that ensure both high thematic accuracy and temporal consistency. This paper presents a multitemporal classification methodology based on Sentinel-2 optical data and machine learning models.

The research was conducted for the city of Sievierodonetsk (Luhansk Oblast, Ukraine) – an area that suffered significant destruction in 2022 as a result of military operations. The aim of the analysis was to identify land use changes in the years 2021-2025 using three classifiers: k-Nearest Neighbors (kNN), Random Forest (RF), and Gradient Boosting Classifier (GBC), combined into an ensemble system based on dynamic confidence weighting.

Quality assessment using the recall metric showed that the fusion method outperformed individual classifiers, achieving average values of 0.87-0.96, while classical models obtained 0.81-0.89. The largest changes (39%) occurred in the years 2022-2023, coinciding with the period of greatest military activity. The proposed method achieved the highest classification quality indices (F1 = 0.93, Acc = 0.98 for 2021), surpassing global products and models based on AlphaEarth. In subsequent years, high stability was maintained (F1 ≥ 0.88), confirming the effectiveness and robustness of the approach under various environmental conditions

5:00pm - 5:15pm

Monitoring landscape dynamics via multitemporal classification at Comandante Ferraz Station neighborhood, Keller Peninsula, Antarctica

Eduardo Soares Nascimento¹, Renato César dos Santos², Guilherme Cardim³, Samara de Azevedo⁴, Pedro Pina⁵, Erivaldo Antonio da Silva²

¹Graduate Program in Cartographic Sciences (PPGCC), Department of Cartography, School of Technology and Sciences, São Paulo State University (FCT-UNESP), São Paulo, Presidente Prudente, 19060-900, Brazil; ²Department of Cartography, School of Technology and Sciences, São Paulo State University (FCT-UNESP), São Paulo, Presidente Prudente, 19060-900, Brazil; ³Engineering Department, School of Engineering and Sciences, São Paulo State University (FEC-UNESP), Rosana, SP, Brazil; ⁴Institute of Natural Resources, Federal University of Itajubá (UNIFEI), Itajubá, MG, 37500-903, Brazil; ⁵Department of Earth Sciences, Faculty of Sciences and Technology, University of Coimbra (UC), 3030-790, Coimbra, Portugal

This study examines the landscape dynamics in the region surrounding Comandante Ferraz Antarctic Station, Keller Peninsula, King George Island, focusing on the quantification of land cover changes over 23 years. Emphasis is placed on the integration of a multitemporal Landsat time series (2001–2024) within a standardized spatio-temporal data cube framework, coupled with a Random Forest (RF) classification approach. This methodology enables consistent pixel-wise trajectory analysis across seven distinct epochs. The RF models achieved robust performance, with F1-scores for dominant classes like water and soil typically exceeding 0.90, although seasonal snow and ice showed greater spectral ambiguity in transitional months. Quantitative results from the transition matrices reveal a significant landscape reconfiguration: while ice (85.3%) and soil (81.2%) showed high persistence, a prominent trend of deglaciation was identified, characterized by the transition of ice and snow into exposed soil and the emergence of pioneer vegetation communities detected from 2014 onwards. The study demonstrates that the integration of machine learning and data cubes provides a powerful tool for monitoring environmental shifts in high-latitude maritime Antarctica, supporting long-term ecological assessments and climate impact modeling.

3:30pm - 5:15pm

WG III/6B: Remote Sensing of the Atmosphere
Location: 714B

3:30pm - 3:45pm

Improving Severe Convective Rainfall Forecasting Using Machine Learning with Multi-band Radar Observations

Leiming Ma

Shanghai Typhoon Institute, China, People's Republic of

Severe convective rainfall, triggered by multi-scale atmospheric interactions, poses a critical forecasting challenge in coastal cities like Shanghai, where monsoon, topography, and sea-land breeze amplify extremes. Conventional methods, constrained by scale separation and model biases, struggle to predict convection. This study develops the Synergistic Framework for Convective Rainfall Forecasting (SSF-CRF) by integrating three modules: (1) Adaptive S/X-band radar remote sensing, dynamically capturing mesoscale convective structures; (2) Gated Vertical Information Propagation (GVIP) network, machine learning on vertical energy propagation to capture convection; (3) Precipitation Ordinal Distribution Autoencoder (PODA), correcting numerical weather prediction (NWP) biases with ordinal precipitation classification. Verification against Radar data and European Centre for Medium-Range Weather Forecasts (ECMWF) model indicates that SSF-CRF improves heavy rainfall (≥50 mm/h) Critical Success Index (CSI) by 33% versus operational forecasts. It offers a potential solution for convective forecasting in climate-vulnerable coastal regions, advancing remote sensing-driven atmospheric applications.

3:45pm - 4:00pm

Assessing Real-Time PPP Performance for PWV Estimation Using Low-Cost GNSS Stations and Multi-Source Correction Products

Milad Bagheri, Paolo Dabove

Polytechnic University of Turin, Italy

Monitoring atmospheric water vapour is essential for weather forecasting and climate studies. GNSS networks can retrieve Precipitable Water Vapour (PWV) continuously at each station location, but the accuracy depends on the quality of the satellite orbit and clock corrections used in the processing. This study evaluates PWV retrieval from 478 stations of the French Centipede low-cost GNSS network using four levels of correction products with decreasing latency: GFZ Final ($\sim$2 weeks), Rapid ($\sim$1 day), Ultra-rapid (3--9 hours), and broadcast ephemerides (real-time). Validation against ERA5 reanalysis shows that the Final and Rapid products achieve similar performance (RMSE $\approx$ 2~mm, $r^2$ = 0.84), confirming that near-real-time processing introduces no significant accuracy loss. Ultra-rapid products remain usable (RMSE = 3.4~mm), while broadcast ephemerides show larger errors (RMSE = 5.8~mm) but still capture the spatial moisture pattern. In addition, a real-time experiment using the freely available Galileo High Accuracy Service (HAS) demonstrates that stable tropospheric estimates (ZTD $\pm$ 1.4~mm, PWV $\pm$ 0.2~mm) can be obtained in real time, even before the positioning solution has fully converged. These results suggest that combining the spatial density of low-cost networks with real-time HAS corrections could enable high-resolution PWV monitoring that is not achievable with existing systems.

4:00pm - 4:15pm

Use of FY-3G Airborne Rain Radar for Typhoon Precipitation Analysis

Hong Li, Rui He, Jingyao Luo

Shanghai Typhoon Institute of CMA, China, People's Republic of

Fengyun-3G, launched in 2023, carries Ku- Ka dual-frequency precipitation measurement radar (PMR) providing new opportunities for monitoring the fine three-dimensional structure of typhoon precipitation over the ocean. This study first validate the FY-3GPMR data by using the ground-based data, then utilizes PMR to analyze the precipitation during the rapid intensification phase of Super Typhoon Yagi in the year of 2024. The analysis reveals the horizontal and vertical distribution characteristics of precipitation during Yagi's RI phase based on the FY-3G PMR data, and discusses the associated dynamical-microphysical coupling mechanism. Overall, FY-3G PMR offers critical insights for understanding cloud and precipitation process involved in the RI.

4:15pm - 4:30pm

Spatiotemporal Characteristics and Environmental Drivers of Atmospheric Water Vapor in Mainland China: Insights from Fengyun-4A Satellite Data

Zhiguo Pang^1,2, Minghan Sun¹, Jingxuan Lu^1,2, Wei Jiang^1,2, June Fu^1,2, Xiangdong Qin¹

¹State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower Research, China; ²Research Center of Flood and Drought Disaster Prevention and Reduction of the Ministry of Water Resources, China

Atmospheric water vapor plays a fundamental role in regional climate regulation and precipitation formation, yet its vertical structure and spatiotemporal evolution over mainland China remain insufficiently understood due to complex terrain and diverse climatic conditions. Using Fengyun-4A layered precipitable water (LPW) products from 2020 to 2023, this study provides a comprehensive assessment of the vertical distribution, spatiotemporal variability, and key environmental drivers of water vapor across China. Results show pronounced spatial gradients and seasonal contrasts: total precipitable water (TPW) exhibits a slight overall decline, primarily driven by reductions in low layer; spatially, TPW is highest in the southeast and lowest in the northwest; seasonally, water vapor peaks in summer and reaches its minimum in winter, with spring and autumn representing monsoon-transition phases. Vertically, approximately 75% of atmospheric water vapor is concentrated within the lowest 4 km, with the middle layer contributing most to regional differences, while high layer remains relatively uniform and minimally influenced by terrain. Environmental correlations indicate that TPW is positively associated with 2m temperature, relative humidity, surface pressure, total cloud cover, and precipitation, but negatively associated with DEM and evaporation. Layer-dependent responses indicate that the lower layer is strongly influenced by surface processes, the middle layer by both surface moisture transport and large-scale circulation, and the high layer primarily by thermodynamic structure and synoptic background. These findings, derived from high-resolution satellite observations, enhance understanding of atmospheric water vapor stratification and its controlling mechanisms, providing essential support for water vapor transport diagnosis, precipitation evolution, and operational forecasting improvement.

4:30pm - 4:45pm

Drought Identification and Prediction from GNSS Time Series Using SSA and Hybrid CNN-Transformer

Motahareh Esfandyari Kaloukan¹, Shirin Malihi², Siavash Iran-Pour³, Danesh Shokri⁴, Saeid Homayouni⁵

¹University of Isfahan; ²University of Cambridge, United Kingdom; ³University of Isfahan; ⁴Universit´e Laval; ⁵Institut National de la Recherche Scientifique

In recent decades, global climate change has triggered a rise in extreme environmental phenomena, including prolonged droughts, intensified precipitation events, and shifts in tidal patterns. This study focuses on the application of the observations from Global Navigation Satellite System (GNSS) signals for monitoring and classifying climatic conditions, with particular emphasis on drought.

Using daily vertical displacement data from a GNSS station in California (2005–2023), we developed a robust analysis framework. It includes data cleaning (removing outliers, filling gaps, detecting offsets, and modeling noise), trend and seasonal pattern extraction through Singular Spectrum Analysis (SSA), feature generation (like amplitude, energy, and dominant frequency), labeling

based on the Standardized Precipitation-Evapotranspiration Index (SPEI), and classification using a hybrid CNN-Transformer model. The results demonstrate the model’s capability to accurately detect drought periods (SPEI > -1) characterized by diminished amplitudes in seasonal components and heightened noisy fluctuations, as well as wet periods (SPEI < 1) marked by elevated energy in semi-annual signals. The model was evaluated with an overall accuracy of 83.3 percent, an F1-score of 0.90 for the drought class, and successful application to future data (2024–2029). This approach, independent of traditional meteorological data, underscores the potential of GNSS as a geodetic tool for environmental monitoring, albeit with limitations such as reliance on single stations and the need for supplementary datasets. The methodology holds promise for enhancing early warning systems and climate models.

4:45pm - 5:00pm

Integrating Satellite Observations to Assess Seasonal Wetland Methane (CH₄) and Carbon Dioxide (CO₂) Dynamics in the Greater Bay Area

Muhammad Umar, Majid Nazeer, Man Sing Wong

Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China

Carbon dioxide (CO₂) and methane emissions (CH₄) are primary greenhouse gases whose rising atmospheric levels intensify global climate change. Wetlands, despite covering only 5–8% of Earth’s land area, contribute nearly 30% of global methane emission while storing up to 30% of global soil organic carbon. This makes wetlands both sinks and sources of greenhouse gases, though their seasonal CO₂ and CH₄ dynamics in the Guangdong–Hong Kong–Macao Greater Bay Area (GBA) remain poorly understood.

Ground-based instruments offer high accuracy but limited spatial coverage, whereas satellite missions, such as Sentinel-5P/TROPOMI for XCH₄ and OCO-2 for XCO₂, enable wide-area monitoring. This study investigates the seasonal dynamics of CH₄ and CO₂ across different wetland ecosystems in the GBA using satellite observations and ERA5-Land climate variables. Seasonal means were computed in Google Earth Engine for Winter, Spring, Summer, and Autumn from 2019 to 2025.

Results show a consistent rise in atmospheric CH₄ from 1856 ppb (2019) to 1939 ppb (2025), with the highest levels in Autumn and Winter. CO₂ increased from 404 ppm to 424 ppm, peaking in Winter and Spring. Non-wetland regions and mangroves emerged as the primary contributors to greenhouse gas accumulation, while salt marshes and other wetlands showed lower values.

Pearson correlation analysis indicated strong influence of temperature, dew point, and precipitation on CO₂, while CH₄ showed variable sensitivity to rainfall and wind. Findings emphasize the impact of land-cover type and climate in shaping seasonal greenhouse gas dynamics, supporting SDG 13 and SDG 15, and necessitating hyperspectral data integration for climate policies.

5:00pm - 5:15pm

Remote Sensing Data Fusion for Urban Air Quality: Investigating the Relationship Between Land Surface Temperature, NDVI, and NO₂ Concentration

Aryan Hosein Aghaei, Hamid Ebadi, Abbas Kiani

Khajeh Nasir Toosi University of Technology, Iran, Islamic Republic of

Urban air quality remains a critical concern, as NO₂ emissions from transport and industrial activities frequently exceed healthy limits in major cities. Urban vegetation can help reduce pollution by enhancing natural filtration and cooling, while higher land surface temperatures (LST) tend to intensify pollutant accumulation. Using satellite-based remote sensing, this study investigates how vegetation health (NDVI) and surface temperature influence NO₂ levels in two distinct urban environments: Blackburn/Arlington Road in England and District No. 3 in Tehran, Iran, across pre-, during-, and post-COVID-19 lockdown periods. Both cities experienced notable environmental improvements in 2020: NDVI increased from approximately 0.45–0.48 to around 0.54–0.61, while NO₂ dropped significantly from about 0.46–0.50 to roughly 0.13–0.35. LST also declined from pre-lockdown values near 0.46–0.48 to as low as 0.12–0.38. During the lockdown, vegetation levels showed a clear negative relationship with NO₂ concentrations, and pollution trends displayed a strong positive association with higher temperatures, highlighting the linked benefits of greener and cooler environments. However, as human activities resumed after 2021, these relationships became inconsistent or weakened, with occasional shifts in direction depending on seasonal conditions and external drivers such as traffic recovery and industrial intensity. Overall, the results reinforce that increasing vegetation coverage and mitigating urban heating can meaningfully reduce NO₂ levels. By revealing how urban form, vegetation dynamics, and thermal conditions collectively shape pollution patterns, this research provides insights for city planners, environmental managers, and public health authorities working to design more sustainable and healthier urban environments.

3:30pm - 5:15pm

WG III/2B: Spectral and Thermal Data Processing and Analytics
Location: 715A

3:30pm - 3:45pm

BathyUNet++: A center-focused receptive-field network for high-resolution bathymetry mapping from SuperDove imagery

Wendian Lai^1,2, Xiaolong Yu¹, Anders Jensen Knudby², Lakmal Deshapriya Nawarathnage², Zhongping Lee¹

¹State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen 361102, China; ²Department of Geography, Environment and Geomatics, University of Ottawa, 60 University Private, Ottawa, ON, Canada, K1N 6N5

Bathymetry information around islands, reefs, and shallow-water regions is critical for both navigation safety and environmental management. However, these areas often feature diverse substrate types and strong spatial heterogeneity, which makes it challenging to accurately retrieve fine-scale bathymetry from traditional medium-resolution satellite imagery. High-spatial-resolution (HSR) sensors, such as PlanetScope SuperDove (~ 3.7 m spatial resolution), offer the potential to capture more detailed spatial features, yet their relatively low signal-to-noise ratio (SNR) can lead to noisy retrievals, particularly over low-reflectance waters. To mitigate this issue, incorporating the spatial context of neighboring pixels while jointly utilizing the spectral information offered by low- and high-resolution sensors can enhance the stability and accuracy of HSR-based bathymetry retrievals. In this study, a UNet++ neural network with the spatial and channel squeeze & excitation (scSE) attention mechanisms (BathyUNet++) was employed to retrieve bathymetry from SuperDove imagery. To satisfy the patch-based input requirement of UNet++, the model was fully trained using two sources of data: clear-sky SuperDove image patches paired with Landsat-8-derived bathymetry and a limited set of ALB data. Validation results demonstrated that the model accurately retrieved bathymetry in regions independent of the training set.The proposed model and framework can be readily adapted to other HSR sensors, offering a promising approach for global HSR shallow-water bathymetry retrieval using multi-source satellite observations.

3:45pm - 4:00pm

MQTT-Enabled Federated Self-Learning Minimal Learning Machine Classifier for Real-Time Hyperspectral Processing

Anna-Maria Raita¹, Hadrien Louichon², Ilkka Pölönen¹

¹University of Jyväskylä, Finland; ²IMT Atlantique

Despite its potential in forestry, agriculture, environmental monitoring, safety surveillance, and defence, real-time hyperspectral imaging (HSI) remains challenging in practice because of the high dimensionality of the data and limited onboard computational resources.

This work introduces a distributed HSI classification framework that integrates federated learning, a Self-learning Minimal Learning Machine classifier (SL-MLM), adaptive Kalman filter-based model fusion, and lightweight MQTT-based communication on Raspberry Pi edge devices and a laptop serving as the base station. Acting as local nodes, Raspberry Pis process HSI data row by row, update their models recursively, and only exchange compact model parameters and classification results with the base station. HSI data in its raw form remains local.

The findings suggest that the proposed local learning workflow can be implemented on Raspberry Pi devices, and Kalman-based fusion improves stability and consistency in comparison to individual local models. The method is feasible in scenarios where the number of labelled data points is restricted, as the SL-MLM classifier can be initialized with a mere handful of class-specific reference points. The research demonstrates a feasible, low-cost approach to distributed

embedded HSI classification and sensing.

4:00pm - 4:15pm

Estimating inland water quality parameters using Wyvern Dragonette-001 hyperspectral imagery, a case study from the St. Lawrence River, Canada

Mohsen Ansari, Anders Knudby

Department of Geography, Environment and Geomatics, University of Ottawa, 75 Laurier Avenue East, Ottawa, ON, K1N 6N5, Canada

Monitoring inland Water Quality Parameters (WQPs) is essential for managing freshwater ecosystems and assessing anthropogenic impacts (Mishra et al., 2017). Satellite remote sensing provides a cost-effective and large-scale approach for monitoring inland WQPs. However, most existing satellite sensors have limited spectral resolution, restricting their ability to capture subtle optical variations expressed by inland WQPs, and/or insufficient spatial resolution to yield valid water-only pixels in narrow rivers or nearshore zones (Ansari et al., 2025).

Recent advances in hyperspectral satellite technology have created new opportunities for inland WQP monitoring. The Wyvern Dragonette-001, launched in April 2023, provides hyperspectral imagery with a spatial resolution of 5.3 m and 23 spectral bands within the visible to near-infrared range (500–800 nm) (Ansari et al., 2025; Wyvern Dragonette, 2023). Given its novelty, the potential of such imagery for assessing WQPs in inland water remains largely unexplored. A recent review (Ansari et al., 2025) evaluating the sensor’s spectral resolution and signal-to-noise ratio for retrieving inland WQPs indicated that Dragonette-001 is suitable for estimating non-algal particles (NAP) and shows potential for chlorophyll-a mapping, although it is likely unsuitable for retrieving Colored Dissolved Organic Matter (CDOM).

This study reports on a practical test that assessed the feasibility of using Wyvern Dragonette-001 imagery to retrieve turbidity, Suspended Sediments (SS), and Dissolved Organic Carbon (DOC) in a portion of the St. Lawrence River, Québec, Canada.

4:15pm - 4:30pm

Effect of Hyperspectral Data Compression on Data Pre-processing: Analysis of Reconstruction Error Propagation

Jannick Kuester^1,3, Wolfgang Gross¹, Simon Schreiner¹, Andreas Michel¹, Jannik Sheikh¹, Joshua Dare-Cullen², Michael Heizmann³

¹Fraunhofer IOSB; ²University of Exeter; ³Karlsruhe Institute of Technology KIT

Hyperspectral imaging produces vast data volumes that often exceed storage and transmission capacities on airborne and satellite platforms. This study systematically investigates the effects of lossy hyperspectral data compression on the scientific usability of the resulting data products. Using UAV-based HySpex acquisitions from the HyperThun’22 campaign, several state-of-the-art learning-based compression models were evaluated, including spectral, spatial, and spatio-spectral architectures. The analysis quantifies how compression-induced reconstruction errors propagate through the full pre-processing workflow, from raw digital numbers through radiometric calibration, geometric correction, and atmospheric correction to the final surface reflectance domain.

Results show that spectral models such as the Adaptive 1D Convolutional Autoencoder (A1D-CAE) achieve the highest fidelity, maintaining sub-degree spectral deviations and near-perfect structural similarity. In contrast, purely spatial or 3D convolutional models exhibit severe distortions that persist across all pre-processing levels. The findings demonstrate that lossy compression can be applied at the raw stage without compromising the integrity of reflectance products, provided that spectral correlations are explicitly modeled.

This work highlights the importance of selecting compression architectures consistent with sensor characteristics and pre-processing workflows and provides a quantitative foundation for future operational implementations of onboard hyperspectral compression in Earth observation missions.

4:30pm - 4:45pm

VNIR–SWIR hyperspectral spectroscopy and deep learning for nitrogen prediction in potato crops

Ehsan Chatraei Azizabadi, Masoomeh Gomroki, Nasem Badreldin

University of Manitoba, Canada

Efficient nitrogen (N) management remains a major challenge for sustainable potato production, particularly on coarse-textured soils prone to nutrient leaching. This study investigates the use of Visible–Near Infrared to Short-Wave Infrared (VNIR–SWIR, 350–2500 nm) hyperspectral spectroscopy for non-destructive, in-season estimation of petiole nitrate nitrogen (PNN) under both field and laboratory conditions. Spectral data were collected using an ASD FieldSpec Pro spectroradiometer and processed through Savitzky–Golay smoothing, Standard Normal Variate normalization, and first-derivative transformation. Variable Importance in Projection (VIP) analysis was employed to identify N-sensitive wavelengths, and three predictive approaches—One-Dimensional Convolutional Neural Network (1D-CNN), Support Vector Regression (SVR), and Partial Least Squares Regression (PLSR)—were compared for their predictive accuracy. Calibration transfer using Piecewise Direct Standardization (PDS) was applied to harmonize field spectra with laboratory measurements. Results showed that the 1D-CNN achieved the highest predictive performance (R² = 0.90, RMSE = 0.22%), outperforming SVR and PLSR. PDS improved field-based predictions by reducing spectral discrepancies caused by illumination and canopy variability. The findings highlight the potential of hyperspectral spectroscopy combined with deep learning and calibration transfer techniques to provide accurate and scalable diagnostics of plant nitrogen status. This research supports the integration of proximal sensing and data-driven models for precision nutrient management in potato systems and broader agricultural applications.

4:45pm - 5:00pm

A multi-scale strip-wise convnet for infrared image stripe removal

Wei You¹, Zhiqiang Bian², Xinbo Zhou², Yi Xu¹, Kaimin Sun¹

¹Wuhan University, China, People's Republic of; ²Shanghai Institute of Satellite Engineering, China, People's Republic of

This contribution presents a novel method for infrared image stripe remover that addresses the limitations of current approaches in difficulty of extracting structural information of stripes. The proposed framework integrates strip convlution layers with multi-size kernels in a dense connection to enhance stripe structural information expression in challenging conditions and provides more reliable results for practical applications. Experimental evaluations on multiple datasets demonstrate significant improvements compared to state-of-the-art methods. The method is designed to be computationally efficient and suitable for real-world deployment in fields.

5:00pm - 5:15pm

Unsupervised tree species classification with UAV ultra-high resolution multispectral imaging

Larissa Ane De Sousa Lima, Krzysztof Bakuła

Warsaw University of Technology

This paper aims to evaluate the performance of ISODATA clustering for tree species classification using ultra-high-resolution multispectral data collected with Unmanned Aerial Vehicle. The study focuses on two sites in Żednia forest district near the city of Bialystok, northeastern Poland. The input data consist of 10-band multispectral orthomosaics with a resolution of 10 cm, acquired from an UAV platform equipped with a MicaSense RedEdge-MX dual camera and image-based Canopy Height Model. The classifications were conducted at two levels of forest detail: forest types, including two classes (broadleaf and conifer), and tree species, comprising four classes in Study Area 1 and ten species in Study Area 2. Multiple classifications were generated, testing different input parameters such as the number of clusters and various combinations of input data. For the first level of classification (forest type), overall accuracies range from 84,09% to 97,57% in Study Area 1 and from 82,31% to 92,74% in Study Area 2. At the second level of classification (tree species), overall accuracies vary from 70.73% to 91.77% in Study Area 1 and from 36,51% to 72,33% in Study Area 2. Overall, ISODATA demonstrates robust performance in classifying forest types in both study areas. However, performance in classifying tree species varies across different classes, with relatively high accuracies observed for certain species such as spruce, pine, oak, larch, and birch. The results underscore the potential of multispectral UAV data and unsupervised classification methods for accurately classifying tree species.

3:30pm - 5:15pm

WG II/1B: Image Orientation and Fusion
Location: 715B

3:30pm - 3:45pm

ATOM-ANT3D in Action: 3D Surveying from Confined Spaces to Urban Environments

Ahmad El-Alailyi^1,2, Luca Perfetti³, Fabio Remondino², Francesco Fassi¹

¹3D Survey Group, ABC Department, Politecnico di Milano, Milano, Italy; ²3D Optical Metrology (3DOM) Unit, Bruno Kessler Foundation (FBK), Trento, Italy; ³Department of Civil, Architectural, Environmental Engineering and Mathematics (DICATAM), Università degli Studi di Brescia, Brescia, Italy

This work presents a multi-camera mobile mapping solution designed to deliver accurate and efficient 3D reconstructions across a wide variety of challenging environments, ranging from confined indoor spaces to complex urban outdoor settings. Traditional photogrammetric and terrestrial laser scanning approaches, while capable of high accuracy, often suffer from limitations related to acquisition speed, logistical complexity, and significant post-processing effort—especially in large, occluded, or hard-to-access sites. Mobile Mapping Systems (MMS) based on Visual SLAM (V-SLAM) offer a compelling alternative, thanks to their ability to acquire high-frequency imagery in continuous motion and estimate sensor trajectories in real-time. However, MMS outputs frequently face issues such as reduced geometric accuracy, scale drift in monocular sequences, and the need for extensive optimisation to reach survey-grade results.

To address these limitations, the study extends an existing multi-camera V-SLAM pipeline by tightly integrating monocular estimates with multi-stereo trajectories within the ATOM-ANT3D fisheye multi-camera system. A novel monocular scale-recovery strategy is introduced, based on path-length ratios derived from concurrently recorded stereo tracks. This metrized monocular trajectory is then fused with stereo estimates through a robust pose graph optimisation, followed by a multi-view, feature-based refinement leveraging pre-calibrated camera geometry.

The proposed method is evaluated across four real-world scenarios—spiral tower staircases, dark underground caves, narrow urban corridors, and constrained industrial pipelines. Accuracy is assessed against reference 3D point clouds, while efficiency is compared to a standard multi-view stereo photogrammetric pipeline. Results demonstrate that the integrated approach significantly improves reconstruction consistency, robustness, and end-to-end throughput.

3:45pm - 4:00pm

Shape2Match: A Shape-to-Matching Framework for Infrared and Visible Image Matching

Maoyu Wang, Xulei Shi, Zhuolu Hou, Xinbo Zhao, Xin Huang, Yifan Liao, Yansong Duan, Pengjie Tao

School of Remote Sensing and Information Engineering, Wuhan University, China, People's Republic of

Traditional image matching methods rely heavily on gradient or intensity information. However, the severe nonlinear radiometric distortion (NRD) between infrared and visible images hinders the extraction of repeatable feature points, leading to poor matching performance. To address this, we propose Shape2Match, a novel framework that replaces point features with more consistent, modality-invariant shape features. Specifically, the method utilizes EfficientSAM to extract shape contours and employs elliptic fourier descriptors (EFD) to parameterize and normalize them, creating shape descriptor that is invariant to translation, rotation, and scale. Shape2Match adopts a coarse-to-fine hierarchical strategy: it first performs robust global shape matching using a weighted EFD distance, followed by precise keypoint matching—using Shape Context—within the coarsely aligned shape pairs. We validated Shape2Match on 153 image pairs from 6 datasets, comparing it against methods like SIFT, RIFT, and MS-HLMO. Experimental results demonstrate that Shape2Match achieves a 100\% success rate (SR) across all datasets and significantly outperforms other methods in the number of correct matches (NCM), proving its effectiveness and robustness against NRD, rotation, and scale variations.

4:00pm - 4:15pm

Historical images for surface topography reconstruction intercomparison experiment (Historix)

Amaury Dehecq¹, Friedrich Knuth^2,3, Joaquin M.C. Belart⁴, Livia Piermattei⁵, Camillo Ressl⁶, Robert McNabb⁷, Adina Racoviteanu¹, David Shean⁸, Luc Godin¹

¹University Grenoble Alpes, IRD, CNRS, Grenoble INP, IGE, Grenoble, France; ²Laboratory of Hydraulics, Hydrology and Glaciology (VAW), ETH Zurich, Zurich, Switzerland; ³Swiss Federal Institute for Forest, Snow and Landscape Research (WSL), Birmensdorf, Switzerland; ⁴Natural Science Institute of Iceland, Akranes, Iceland; ⁵Department of Geography, University of Zurich, 8057 Zurich, Switzerland; ⁶TU Wien, Department of Geodesy and Geoinformation, Vienna, 1040, Austria; ⁷School of Geography and Environmental Sciences, Ulster University, BT52 1SA Coleraine, UK; ⁸Department of Civil and Environmental Engineering, University of Washington, Seattle, WA, USA

Historical film-based images, acquired by aerial sensors since the 1930s and by satellite platforms since the 1960s, provide a unique opportunity to document changes in the Earth surface over the 20th century. Yet, they present significant and specific challenges, including complex distortion in the scanned image pixel grid and poorly known camera exterior and interior orientation. In recent years, semi- or fully-automated approaches, based on photogrammetric and computer vision methods, have emerged, but the performance and limitations of these methods have yet to be directly compared.

The objectives of the Historical Images for Surface Topography Reconstruction Intercomparison eXperiment (Historix) project are to compare existing methods for processing stereoscopic historical images and harmonize processing tools. Here we present the study site and dataset selected for this comparison, the design of the intercomparison and evaluation metrics, as well as preliminary results. Full evaluation will be presented at the conference.

4:15pm - 4:30pm

Geolocation enhancement of space borne cameras: the SAR-Optic approach

Philippe Nonin¹, Alexis Barot¹, Pascal Favé², Laurent Gabet¹, Mathilde Jaussaud², Wolfgang Koppe³, Alain Orsoni²

¹Airbus, France; ²Ign, France; ³Airbus, Germany

The location accuracy of an image acquired with a space borne camera relies on the knowledge of the orbit of the spacecraft and the orientation of the camera. The a posteriori estimation of a satellite orbit has been a well mastered technique for a long time. Sub-meter accuracy is achievable with a reasonable effort. The geolocation, with a similar accuracy, of the line of sight of an optical instrument flying at 500km or above is a much more challenging task..

On the other hand, the geolocation of a synthetic aperture radar (SAR) image depends only on the orbit of the spacecraft. It is, therefore, easy to acquire space borne SAR images with a sub-metric native geolocation. The Airbus SAR constellation (TerraSAR-X, TanDEM-X and PAZ) provides, on a commercial basis, images with a (better than) 0.2m geolocation accuracy.

The ability to find, through image matching, homologous points in SAR and optical images would transfer the native accuracy of SAR to optical observations, using classical photogrammetric bundle adjustment.

This paper describes an operational way to perform this SAR/Optic images matching and a validation of the absolute location accuracy achieved.

4:30pm - 4:45pm

Comparative analysis of mainstream image matching methods for georeferencing Tianwen‑1 HIRIC imagery without ground control points

Zhuolu Hou, Aomei Zhang, Yunfei Hu, Xulei Shi, Pei Mi, Penjie Tao, Tao Ke

School of Remote Sensing and Information Engineering, Wuhan University, China, People's Republic of

High-precision mapping of planetary surfaces, such as Mars, relies on matched control points derived from existing georeferenced data, as ground control points (GCPs) cannot be obtained through field measurement. However, the handcrafted image matchers like SIFT limit the robustness of this approach, particularly on texture-scarce and self-similar Martian terrain. While deep learning-based matchers offer a new paradigm, their performance gain for bundle adjustment remains inadequately quantified. This paper systematically evaluates four matchers (hand-crafted SIFT and deep learning-based DOG+HardNet+LightGlue, DISK+LightGlue, and LoFTR), assessing their impact on georeferencing tasks using Tianwen-1 high-resolution imagery. Deep learning methods, such as LoFTR, generate more correspondence points with a more uniform spatial distribution, halving the outlier rate and improving bundle adjustment accuracy by 10%. Our study provides a benchmark for planetary mapping and shows that powerful, learning-based image matchers are pivotal for next-generation automated mapping systems.

4:45pm - 5:00pm

Transforming National Air Photo Archives into Analysis-Ready Geospatial Products

Mozhdeh Shahbazi, Evangelos Bousias Alexakis, Mikhail Sokolov, Ella Mahoro, Pierre Gravel

Canada Centre for Mapping and Earth Observation, Natural Resources Canada, Canada

This work investigates the solutions developed at Natural Resources Canada to produce analysis-ready mapping products from Canada's national air photo library including two main workflows: 1) The photogrammetric processing of historical photos with an emphasis on the more challenging automated georeferencing component; 2) Enhancing interpretability through generative artificial intelligence models for super-resolution and deep colorization.

5:00pm - 5:15pm

The Project evalAT for Investigating the Accuracy of Aerotriangulations in Map Projections

Camillo Ressl¹, Norbert Pfeifer¹, Christine Ressl², Andreas Bayr²

¹TU Wien, Austria; ²BEV – Bundesamt für Eich und Vermessungswesen, Abteilung G2 – Fernerkundung, Wien, Austria

The accuracy of the aerial triangulation (AT) performed in the map projection for a GNSS-INS-supported image block consisting of 4342 vertical images, GSD 20 cm, with 22 main strips and 5 cross strips is investigated. Using 169 check points the obtained results are compared with the accuracy achieved by running the AT in an undistorted tangential system. It turns out, that in both systems the same accuracies can be achieved, with RMSE in (X, Y, Z) of (7, 10, 11) cm, if Earth curvature and scale distortion are correctly modelled in the map projection. If the scale distortion is not considered, then the RMSE in Z increases by 100% to 300% (depending on the height distribution of the GCPs). In AT software packages, that do not consider the scale distortion, a partial compensation is possible by either adapting the height of the projection centres or the principal distance leading to RMSE of around (10, 11, 15) cm.

3:30pm - 5:15pm

IvS10: Innovation in River Ice and Surveillance and Modeling: Best Practices and Emerging Technologies
Location: 716A

3:30pm - 3:45pm

Mapping the Structural Complexity of Vancouver Island’s Forests with Deep Learning and LiDAR–Sentinel Data Fusion

Luizmar De Assis Barros, Michelle Venter, Oscar Venter

University of Northern British Columbia, Canada

Forest structural complexity (FSC) reflects the three-dimensional arrangement and distribution of forest elements and serves as a key ecological indicator of biodiversity and forest productivity. Decades of overharvesting have transformed many temperate rainforests into young, homogeneous stands. Given the central role of FSC in ecosystem functioning, silvicultural strategies increasingly aim to retain or enhance structural complexity and mitigate the ecological impacts of timber harvesting. Monitoring structural development across silvicultural treatments, environmental gradients, and disturbance regimes is therefore essential. However, large-scale assessments of FSC remain limited. In this study, we evaluate the scalability of canopy entropy (CE), a LiDAR-derived FSC index, using deep learning applied to multisensor radar and optical imagery. We trained a U-Net convolutional neural network using airborne LiDAR-derived CE as the reference variable and Sentinel-1 and Sentinel-2 data as wall-to-wall predictors. The model demonstrated strong overall predictive performance (R² = 0.80, MAE = 0.09, bias = 0.02, nRMSE = 12.2%). However, the horizontal complexity component of CE (CExy) exhibited substantially lower accuracy. Although aspects of horizontal complexity may be indirectly inferred from vertical structure or canopy cover, CE should be interpreted with caution. Future work should focus on improving the representation of horizontal complexity. Despite these limitations, the resulting CE map provides a foundation for evaluating silvicultural practices and identifying structurally complex forests with high conservation value.

3:45pm - 4:00pm

Scaling LiDAR-derived forest biomass to optical and RADAR satellite imagery in peatlands: a systematic review and meta-analysis of modelling approaches and sensor performance

Jack Goldman¹, Sophie Wilkinson², Koreen Millard¹

¹Department of Geography and Environmental Studies, Carleton University, Ottawa, Ontario, Canada; ²School of Resource and Environmental Management, Simon Fraser University, Burnaby, British Columbia

Wildfire severity, often correlated with biomass loss, has increased since the 1980s, driving greater biomass depletion across landscapes. Canada's 2023 wildfire season burned over 15 million hectares and released 647 TgC of carbon, surpassing most nations' annual emissions. This trend underscores the need for scalable aboveground biomass (AGB) monitoring for greenhouse gas estimation. While LiDAR has improved AGB estimation, airborne systems remain costly with limited spatial coverage. Researchers have addressed this by scaling LiDAR-derived estimates to satellite imagery for broader monitoring. However, current scaling paradigms are developed predominantly for closed-canopy forests, with limited evaluation in open-canopy ecosystems like peatlands, despite their high fire severity and disproportionate carbon contributions when burned. Peatlands pose unique challenges: low and spatially heterogeneous AGB, open canopies that allow soil and water to obfuscate satellite signals, and non-linear structural-biomass relationships in sparse vegetation.

This systematic review and meta-analysis examines the accuracy of scaling LiDAR-derived AGB estimates to optical and radar satellite imagery across peatlands and structurally analogous ecosystems, including tropical savannas, floodplain forests, mangroves, and arctic shrublands. We searched Scopus and Google Scholar using a four-block query, yielding 271 peer-reviewed studies. Using a random-effects model, R² values were transformed to Fisher's Z scores, and heterogeneity was quantified using the I² statistic.

Preliminary analysis revealed no significant difference between modelling approaches and target ecosystem. Heterogeneity was minimal, indicating model type and ecosystem type exert limited influence on accuracy outcomes. Full dataset analysis is ongoing.

4:00pm - 4:15pm

Habitat suitability mapping using satellite imagery and continuous landscape inventory CLI: a case study for new Brunswick, Canada

Susham Biswas³, Aman Anand², Ankit Sharma¹, Rakesh Mishra⁴, Yun Zhang⁵

¹Rajiv Gandhi Institute of Petroleum Technology, India; ²Rajiv Gandhi Institute of Petroleum Technology, India; ³Rajiv Gandhi Institute of Petroleum Technology, India; ⁴University of New Brunswick, Canada; ⁵University of New Brunswick, Canada

Habitat suitability models are central to conservation

planning, species management, and landscape-level decision

support. Continuous Landscape Inventory (CLI)

datasets provide stand-level forest attributes (species mix,

height, basal area, crown closure, age, disturbance history)

that are rarely used at scale together with satellitederived

biophysical indicators for operational habitat mapping.

This work proposes a replicable workflow that fuses

provincial CLI with multisensor satellite data (Sentinel-

2 MSI, Landsat series, and SAR-derived structure proxies)

and environmental layers (elevation, distance-to-water,

road density) to produce fine-scale habitat suitability surfaces

across New Brunswick, Canada.

4:15pm - 4:30pm

Quantifying Wildfire Impacts on Carbon Stock from Remote Sensing based Forest Disturbance and Recovery Monitoring

Qin Ma¹, Roger Bales², Mike Goulden³, David M. Bell⁴

¹School of Geography, Nanjing Normal University, Nanjing 210023, China; ²School of Engineering and Environmental Systems Graduate Group, University of California, Merced, CA 95343, USA; ³Department of Earth System Science, University of California, Irvine, CA 92697, USA; ⁴Pacific Northwest Research Station, USDA Forest Service, 3200 SW, Jefferson Way, Corvallis, OR 97331, USA

Wildfires significantly impact forest ecosystems by disrupting carbon cycles, with effects varying based on fire intensity and forest bio-physical characteristics such as vegetation types, structures, topography, and climate. These factors collectively influence fire spread, biomass reduction, and post-fire vegetation regrowth, making it crucial to accurately quantify wildfire impacts on forest carbon dynamics for understanding terrestrial-atmosphere interactions and global climate implications.

This study uses wildfires in California's mountainous forests as a case study, employing two aboveground biomass (AGB) datasets—one derived from remote sensing data and the other from process-based ecological models—to assess wildfire impacts on forest carbon stocks. Remote sensing-based indices, while effective in detecting spectral changes, often fall short in quantifying biophysical alterations, particularly carbon dynamics. Conversely, process-based models adhere to ecological principles but may not fully capture fire-induced carbon changes. Our analysis reveals significant variations in post-fire disturbance and recovery patterns based on fire severity, elevation, and forest type. The remote sensing dataset showed faster initial recovery, likely due to herbaceous vegetation greening, while the ecological model dataset indicated slower, more stable recovery, reflecting delayed tree regeneration. These findings underscore the necessity of integrating multi-source datasets to accurately capture post-fire carbon dynamics.

4:30pm - 4:45pm

Wild Fire Early warning system: Global and Canadian Perspectives

Ankit Sharma¹, Aman Anand², Susham Biswas³, Rakesh Mishra⁴, Yun Zhang⁵

¹Rajiv Gandhi Institute of Petroleum Technology, UP, INDIA; ²Rajiv Gandhi Institute of Petroleum Technology, UP, INDIA; ³Rajiv Gandhi Institute of Petroleum Technology, UP, INDIA; ⁴University of New Brunswick, Canada; ⁵University of New Brunswick, Canada

Wildfire Early Warning Systems (EWS) are increasingly essential as climate-driven extreme fire events grow in frequency

and severity. Yet their maturity and operational robustness vary widely across countries due to differences in

resources, data infrastructure, and institutional capacity. This study conducts a systematic global assessment of wildfire

EWS across high-, middle-, and low-income nations, evaluating how multisensor Earth Observation (EO) data and predictive

intelligence are integrated into functional early warning and decision-support systems. A transparent benchmarking

framework is introduced with two core pillars: (i) multisensor geospatial monitoring—assessing temporal resolution,

spectral sensitivity, spatial detail, and GEO–LEO fusion; and (ii) hotspot intelligence and predictive modeling—evaluating

model class, forecast range, validation practices, and real-time operational performance. These pillars are complemented

by an impact-readiness layer aligned with the Sendai Framework, linking hazard detection to exposure, vulnerability, and

alert dissemination.

Results show strong stratification by income. High-income countries achieve near–real-time hotspot detection, GEO–LEO

data fusion, and validated multi-day behaviour forecasts. Middle-income nations display transitional but uneven progress,

while low-income countries rely almost exclusively on global detection platforms, highlighting institutional, not technological,

bottlenecks. Canada’s EWS landscape is evaluated, highlighting gaps in accessibility, standardization, and

timeliness of EO-derived intelligence. Opportunities for strengthening Canada’s system include adoption of emerging EO

technologies, improved fuel characterization, next-generation hybrid physics–ML/QML behaviour modeling, integrated

national decision-support platforms, and enhanced FireSmart community interfaces.

Overall, the study provides a scalable global framework for comparing national wildfire EWS maturity, identifying

investment priorities, and guiding future improvements.

4:45pm - 5:00pm

Integrating UAV imagery and deep learning for small-scale land cover classification in post-rehabilitated ecosystems

Ryan Siu¹, Daniel Nelson¹, Yuhong He¹, Jiali Shang²

¹University of Toronto, Canada; ²Agriculture and Agri-Food Canada

This project explores how drones and deep learning can help monitor the recovery of former aggregate and mining sites. Traditional methods for assessing land restoration such as field surveys and satellite imagery are often time-consuming, expensive, and limited in detail. Using high-resolution drone imagery and a compact deep learning model, this study offers a faster and more flexible way to track how vegetation and land cover change over time.

The approach classifies ground surfaces into three simple categories: healthy vegetation, stressed vegetation, and bare soil or rock - providing clear indicators of how well a site is recovering after extraction/rehabilitation. Tested at two rehabilitated sites in southern Ontario, the model showed strong and consistent results across different months of the growing season, even using only standard colour drone imagery.

This work highlights how drone-based monitoring can make ecological restoration assessment more efficient, objective, and repeatable. Once trained, the model can quickly analyze new imagery without the need for extensive fieldwork, allowing land managers and regulators to identify problem areas and track recovery in near real time. Ultimately, this research points toward a future where rapid, data-driven drone assessments play a role in supporting sustainable land rehabilitation and environmental stewardship.

5:00pm - 5:15pm

Anomalous Moisture Signal in Sentinel-2 Imagery Precedes Overwintering Wildfire

Quinn E. Barber¹, Sophie L. Wilkinson², Koreen Millard¹

¹Carleton University, Department of Geography and Environmental Studies, 1125 Colonel By Drive, Ottawa, ON, Canada, K1S 5B6; ²Simon Fraser University, School of Resource and Environmental Management, 8888 University Drive, Burnaby, BC, Canada, V5A 1S6

Deep, persistent drought in 2023 in the Canadian Boreal Plains was associated with wildfires that persisted underground and re-emerged the following spring, a process known as "overwintering" and sometimes called "zombie fires". We analyzed pre-fire Sentinel-2 multispectral imagery of paired 2023-2024 fires to extract any spectral anomalies, with the goal of characterizing conditions conducive to wildfire overwintering. We assessed several spectral indices, including NDVI, GNDVI, EVI, NDMI, TCW, and others relative to a 2016-2022 baseline using the npphen R package.

We found that sites of overwintering fires exhibited moisture anomalies in the spring of 2024, indicating drought conditions that were conducive to the reemergence of overwintering fires. We show how these anomalies were co-located with early season wildfire with an apparent absence of ignition events. Furthermore, we show how in 2024, 25 overwintering wildfires burned 22.8% of the total area burned, while comprising only 1.3% of the total fire count.

3:30pm - 5:15pm

Forum5C: From Science to Action: Advancing Global Agricultural Monitoring for Food Security and Resilience
Location: 716B

3:30pm - 5:15pm

Forum11: Canadian Earth Observation Supersites for Technology Advancement and Research
Location: 717A

3:30pm - 5:30pm

P5: Poster Session 5
Location: Exhibition Hall "E"

Musings on Doctoral Level Geospatial Education: Lessons from the EPSRC CDT in Geospatial Systems

Jon Mills¹, Stuart Barr¹, Jamie Stogden¹, Phyllida Bailey¹, Stuart Marsh², Panos Psimoulis², Esther Little², Doreen Boyd³

¹School of Engineering, Newcastle University, Newcastle upon Tyne, UK; ²Faculty of Engineering, University of Nottingham, Nottingham, UK; ³School of Geography, University of Nottingham, Nottingham, UK

The EPSRC Centre for Doctoral Training (CDT) in Geospatial Systems was established in 2019 with a vision to establish an internationally recognised centre of excellence and an ambition to graduate 50 doctoral students across five annual cohort intakes. Since that time, the CDT has been delivered through a strategic partnership between Newcastle University and the University of Nottingham in the UK, together with c. 40 external partners from global academia, international industry and UK Government. The first doctoral students graduated from the CDT in July 2024, with the final students expected to complete their PhD studies in 2028. This paper provides an overview of the training structure and skills development initiatives implemented and offers critical reflections on the experiences and challenges encountered throughout the CDT’s lifetime to date (February 2026). While the content will be of particular interest to academics and stakeholders involved in any branch of geospatial doctoral training, many of the findings are transferable. As such, the insights presented may also be of value to the wider academic community, particularly those considering the establishment of similar cohort-based doctoral training models.

Building a unified DEM analysis tool for the CO3D mission

Valentine Bellet¹, Romain Hugonnet², Amaury Dehecq³, Alice de Bardonnèche Richard⁴, Marine Bouchet⁴, Valentin Schaffner¹, Emmanuel Dubois¹

¹CNES, France; ²University of Alaska Fairbanks, United States; ³Institut des Géosciences de l’Environnement (IGE), France; ⁴CS GROUP, France

The CO3D mission, launched in July 2025, aims to reconstruct the Earth’s continental surface in 3D using pairs of synchronous satellite images, generating a Digital Surface Model (DSM) at 1 m Ground Sampling Distance (GSD). Assessing the quality of these DSMs requires an inter-DSM comparison tool, leading CNES to collaborate with the GlacioHack collective and join the governance of their open-source software xDEM.

Originally developed for glacier research, xDEM already offered valuable features for DEM analysis including coregistration, uncertainty analysis, geomorphological terrain attributes computation, etc. Recognizing its potential, CNES made the strategic choice to no longer maintain its own tool and instead contribute to xDEM. The main contributions include the ability to rapidly obtain statistics, scalability improvements through tiling, and the introduction of a command-line interface.

This collaboration has created a more robust tool that benefits both the CO3D mission and the broader scientific community. By combining resources and expertise, the project demonstrates how open-source development can drive innovation while reducing duplication of effort.

DINAMIS: The French National online Facility dedicated to Mutualization and Sharing of very high Resolution Satellite Imageries for Non-commercial Applications

Jean-Francois Faure¹, Anna Cristofol², Etienne Berthier³, Nadia Landachoco⁴, Jean-Philippe Caminade¹, Jean-Paul Sempère², Stéphane Dupuy⁵, Pierre Maurel⁶

¹IRD, France; ²IGN, France; ³CNRS, France; ⁴CNES, France; ⁵CIRAD, France; ⁶INRAE, France

DINAMIS is a French national initiative designed to provide streamlined, cost-effective access to very high-resolution satellite imagery for research, public policy, and innovation. Coordinated by major public institutions—including CNES, IGN, INRAE, and several academic partners—DINAMIS acts as a single entry point for users who need high-quality Earth-observation data to support scientific studies, environmental monitoring, land-use analysis, and operational pu-blic-sector missions.

The platform facilitates access to a range of commercial satellite constellations, most notably Pléiades, Pléiades Neo, and SPOT 6/7, which offer imagery with spatial resolutions from sub-meter to a few meters. Users can request both archived scenes and new acquisitions, enabling them to obtain data tailored to their geographic area and temporal needs. DINAMIS also pro-vides standardized licensing conditions that simplify data sharing within research teams and public organizations.

A key objective of DINAMIS is to democratize the use of very high-resolution imagery by re-ducing financial barriers. Academic and public-interest projects often benefit from free or highly subsidized access, encouraging the development of innovative applications in fields such as agriculture, forestry, natural hazards, coastal management, and urban planning.

By centralizing requests, ensuring data quality, and supporting users throughout the process, DINAMIS strengthens France’s Earth-observation ecosystem and fosters collaboration between scientists, government agencies, and technology developers. Ultimately, DINAMIS contributes to a more informed understanding of the environment and helps public authorities make evi-dence-based decisions for sustainable territory management.

TNE-GPSEducation Advanced Skills for Green Sustainable Environment: An Earth Observation Hub pathway (at ENSMR, Morocco)

Raffaella Brumana¹, Marzia Gabriele¹, Tawfik El Moussaoui², Wafae El Harraki², Nicola Genzano¹

¹Politecnico di Milano, Dept. of Architecture, Built Environment and Construction Engineering (DABC), Via Ponzio 31, 20133 Milan, Italy; ²Mines School of Rabat (ENSMR), Department of Mines, Avenue Hajj Ahmed Cherkaoui BP 753, Agdal, Rabat 10100, Morocco

The “Green & Pink for Sustainable Education” (TNE-GPSEducation) project strengthens international cooperation between ten Italian universities and partner institutions worldwide, promoting multidisciplinary training in sustainability. The initiative integrates expertise in natural resource monitoring, socio-environmental resilience, innovative teaching, health, and gender equality. Partner countries—including Brazil, Argentina, Cambodia, Thailand, Palestine, Georgia, Morocco, China, and Vietnam—play strategic cultural and academic roles and are central to recent international efforts to foster joint education, research, and innovation. Through mobility and capacity-building actions, lecturers, staff, and students enhance their skills while acquiring transferable competencies usable across institutions. Italy’s broader cooperation policies, aligned with UN, EU, and CRUI–CUCS strategies, further support partnerships such as the MoUs signed by POLIMI with ENSMR and UIR in Morocco.

Within this framework, WP4 “Advanced Skills” represents the project’s core, merging socioeconomics, Earth Observation (EO), Nature-Based Solutions (NBS), and health. Five Long Life Learning Courses have been modularised to establish an EO Hub at ENSMR, serving as a regional network node. A Call for Applications invites professors and researchers to attend AS-LLLC programmes at POLIMI, covering EO techniques, BIM–XR workflows, NBS design, LULUCF-based EO monitoring, and decarbonisation methods.

The EO Hub Pathway links global-to-local scales through (a) the systematic use of global EO programmes; (b) LULUCF-aligned indicators and multi-decadal satellite analyses; (c) site-specific phenological monitoring for regenerative agriculture; (d) carbon-removal computation through NBS; and (e) XR/VR tools for immersive awareness raising. Together, these elements support adaptive strategies, MRV systems, regenerative practices, and innovative land-management approaches for regions facing degradation and climate challenges.

Geospatial technology application in factorial ecology of human population in Nepal

Umeshkumar Mandal¹, Kabita Kumari²

¹Central Department of Geography,Tribhuvan University, Nepal; ²Associated to Bernhardt College, Kathmandu, Nepal

Exploration of socio-spatial pertinent dimensions of human population and its geo-spatial distribution in Nepal has been a foremost concern of planners and researchers for development. An input data matrix of 75 X 88 representing Nepal’s demographic, socio-economic, and environmental variables were used to investigate spatial pattern of latent fundamental characteristics and to examine their geo-spatial variability by integrated use of RS, GIS, GPS, Factor Analysis, and ANOVA. Six fundamental socio-spatial dimensions of human population explaining 74.0 percent of total variance were investigated. Demographic was the most prominent and significant dimension accounting for 27.0 percent of the total variance spatially clustered in Terai region indicating demographic pressure: old dependency and family size and also evident by Factorial Areas Analysis (FAA). Facility-Education Dimension was the second most dominant accounting for 19.62 percent of total variance spatially having insignificant geographic variability. Maize production and Ethnic Dimension was found as the third dominant dimension and was significantly concentrated in eastern mountain and hill districts, characterized as high dominancy in ethnic and language issue. Mother Tongue- Marriage age was the fourth accounting for 9.47 percent of total variance spatially clustered on EDR significantly spatial variability among development regions. Kathmandu district locating lower-left corner of both axes indicating the free from both pressure of old dependency and large family size. Family size- Wheat production was the least important dimension, significantly different and spatially distributed in Terai Region. The study demonstrates the usefulness of geospatial technology for demographic, and production planning, and sustainable regional policy in Nepal.

Spatiotemporal Assessment of Black and Organic Carbon Deposition Characteristics over Korba, Chhattisgarh

Aman Srivastava, Aditya Kumar Thakur, Rahul Dev Garg, Pradeep Kumar Garg

Indian Institute of Technology Roorkee, India

Black Carbon (BC) and Organic Carbon (OC) are among the most influential aerosol species affecting air quality, radiative forcing, and climate interactions, especially in regions dominated by coal-based industries. Understanding their temporal behaviour and associated deposition processes is critical for assessing pollution dynamics and guiding regional mitigation measures. Korba, located in Chhattisgarh, India, is widely known as the “Power Hub of India” due to its dense cluster of coal-fired thermal power plants, aluminium smelters, and mining activities, making it an ideal location to examine carbonaceous aerosol loading. The primary objective of this study is to quantify monthly variations in BC and OC and evaluate their atmospheric interactions and deposition characteristics during the study period. Methodology involved extracting BC and OC fractions, including hydrophilic (BCPI, OCPI) and hydrophobic (BCPO, OCPO) components, along with dry and wet deposition fluxes and meteorological drivers such as relative humidity, temperature, pressure, and precipitation. The results show that BC ranged from 3.97×10⁻⁹ to 1.00×10⁻⁸, while OC exhibited higher values between 7.68×10⁻⁹ and 2.24×10⁻⁸, indicating dominance of organic aerosols over black carbon. Dry deposition of BC was significantly high (up to 2.29×10⁹), whereas wet deposition remained several orders lower (≈1.75×10⁻¹² to 1.19×10⁻¹¹). Meteorological conditions, including RH (23–87%) and temperature (290–308 K), modulated concentrations and deposition behaviour. Overall, the study highlights substantial BC–OC loading driven by industrial and combustion sources in Korba. The conclusion emphasizes the need for cleaner combustion practices, while future work may integrate chemical transport modelling to identify precise source contributions.

The Application of Unmanned Aerial Vehicle and Lidar in Undergraduate Education of Geographic Information Science in Beijing City University

Yunqing Li, Yuan Meng, Xiaolei Chu, Yanchao Huang, Haiying Mao, Yifei An

School of Urban Construction, Beijing City University, Beijing, People's Republic of China

The school of urban construction in Beijing City University (BCU) is committed to cultivating application-oriented talents who serve for urban planning, urban construction and urban management. The Geographic Information Science (GIS) program in our university began in 2019. It is carried out on the basis of the investigation of the national needs, the industry development, and the actual situation of our university and other universities in Beijing. Based on the above analysis, we have explored Unmanned Aerial Vehicle (UAV) remote sensing technology and LiDAR as two of the training orientations, focusing on the training of data acquisition and processing capabilities using UAV and LiDAR. We have carried out a lot of explorations and practice in curriculum structure and practical teaching. Student's professional ability is obviously improved. Their competitiveness is significantly enhanced.

Digital Imaging Applications or Fabrications: Preserving Academic Integrity in a Geomatics Engineering Technical Elective Course

Ivan Detchev

University of Calgary, Canada

This is an abstract for a paper on best pedagogical practices in engineering education. In particular, the paper will focus on a project-based course involving group work. Post pandemic, the course has been run twice. In both iterations there were serious breaches of academic integrity. This happened even though reasonable measures to prevent cheating had been put into place. The aim for future offerings of the course would be to preventatively tighten those measures and in the unfortunate scenario that cheating happens again to explore tools for its early detection.

EuroSDR e-learning for strengthening capacity in the geospatial domain

Anka Lisec¹, Markéta Potůčková², Joep Crompvoets³, Conor Cahalane⁴

¹University of Ljubljana, Faculty of Civil and Geodetic Engineering, Ljubljana, Slovenia; ²Charles University in Prague, Faculty of Science, Prague, Czechia; ³Public Governance Institute, KU Leuven, Belgium; ⁴Maynooth University, Department of Geography, Ireland

Due to rapidly advancing technology and increasing societal needs, there is a significant demand for capacity building in the geospatial domain, which involves developing the skills, knowledge, and resources of individuals and organisations. The European association EuroSDR, a not-for-profit international network organisation linking National Mapping and Cadastral Agencies (NMCA) with research institutes and universities in Europe, recognised the challenges of skills development in the geospatial domain more than two decades ago. The EduServ annual training programme, organised under the EuroSDR umbrella, is a well-established and internationally recognised series of e-learning courses in photogrammetry, remote sensing, and geospatial information (GI) science. Since its inception in 2002, it has primarily aimed to transfer knowledge from EuroSDR research projects to the wider GI community. In recent years, interest in EduServ courses has increased significantly, and the topics have expanded to address new geospatial technologies and growing societal needs. This paper aims to share EuroSDR’s experience in distance education with the wider scientific community. Rather than limiting EuroSDR expertise to the European GI community, European mapping agencies can share their knowledge and experience with the international GI community.

India’s Geospatial Information Management in the Global Geopolitical Landscape

Nithiyanandam Yogeswaran, Swathi Kalyani

Takshashila Institution, India

The discussion on this topic is particularly relevant, as the changing geopolitical landscape has impacted the dissemination of geospatial data globally, as evidenced by reduced NASA funding for Earth and atmospheric studies, as well as the recent US government shutdown. The political alliances of countries also restrict the data availability during critical periods, such as war or disaster. This reminds countries to invest on sovereign geospatial dissemination portals to sustain research, innovation, and public discourse. At the same time, the emerging global conflicts open a new window of opportunity for India’s “Unified Geospatial Portal,” which is under development to become a predominant source not just for India but for the global community to leverage datasets generated by India's satellites, covering India and beyond.

Heritage at Risk and Pedagogical Approaches: Training Professionals in Digital Documentation for UNESCO World Heritage Sites Under Threat at the Saint-Sophia Cathedral Complex in Kyiv, Ukraine.

Miquel Reina Ortiz¹, Mario Santana Quintero², Serhii Revenko³

¹Université de Montréal, Montréal, Canada; ²Carleton University, Ottawa, Canada; ³UNESCO Antenna Office in Ukraine, Kyiv, Ukraine

This paper presents a tailored pedagogical approach to digital heritage documentation in contexts where heritage is under threat. It was developed during the July–August 2024 UNESCO/ICOMOS mission to Kyiv, Ukraine, within the UNESCO/Japan Funds-in-Trust project “Support for Ukraine in Culture and Education through UNESCO / Emergency response for World Heritage and cultural property: damage assessment and protection,” in relation to the UNESCO World Heritage property “Kyiv: Saint-Sophia Cathedral and Related Monastic Buildings, Kyiv-Pechersk Lavra.” The mission focused on the Metropolitan’s Residence and the Bell Tower of the Saint-Sophia Cathedral Complex. In parallel with the production of documentation for emergency preparedness and future conservation assessment, the mission implemented a dedicated capacity-building programme for the staff of the National Conservation Area “Sophia of Kyiv.” The paper discusses five interconnected components of this training programme: preparation before the mission, structure and content of the sessions, training activities and didactic material, learning outcomes and targeted competencies, and adaptive responses to a conflict-affected environment. The case study shows that integrating training within an active documentation workflow can strengthen both the immediate value of the records produced and the longer-term capacity of local professionals to support conservation, monitoring, and risk preparedness at World Heritage sites under threat.

Cloud-based remote sensing platforms in remote sensing experiment course

Jun LI, Jinjing Deng, Qiaozhi Wang, Xiangyang Zeng, Lin Hu, Yanzhong Liu

Wuhan University of Science and Technology, China, People's Republic of

Processing massive archives of satellite imagery has historically paralyzed traditional desktop-based remote sensing laboratories. The sheer volume of computationally heavy tasks-from bulk atmospheric correction to long-term radiometric calibration-frequently exceeds the hardware capacity of local campus networks and student laptops. To bypass these severe limitations, this study presents a dual-cloud pedagogical architecture that integrates Google Earth Engine (GEE) and Alibaba's AI Earth. This hybrid framework allows students to instantly access petabytes of analysis-ready data while maintaining low-latency processing for complex modelling via domestic servers. We operationalized this setup through four core practical modules: urbanization monitoring, urban heat island analysis, nighttime light assessment, and AI-driven road extraction. By entirely eliminating the overhead of raw data management and environment configuration, students can finally redirect their cognitive focus toward the actual physics and algorithmic logic of remote sensing—such as parameterizing radiative transfer equations and interpreting radiometric time-series. Furthermore, in light of AI Earth's recent policy shift in March 2026, which heavily restricts free access for educational usage, we critically evaluate the long-term sustainability of this curriculum. To maintain unhindered access to cloud-native geoprocessing, our future instructional designs will assess alternative localized platforms and open-source AI frameworks, ensuring the uninterrupted evolution of rigorous Earth observation education.

Web-based tools for synthetic spatial data generation

Paddy Gorry¹, Peter Mooney²

¹Hamilton Institute, Maynooth University, Ireland; ²Department of Computer Science, Maynooth University, Ireland

Web-based tools for synthetic spatial data generation offer flexibility and accessibility to students and educators alike. This abstract takes a brief overview of some of the existing and developing tools to this end.

Complex Adaptive Blended Learning for Higher GIS Education: A Theory-Driven Pedagogy

Yingwei Yan, Chen-Chieh Feng, Lewis Kwek Boon Leong

Department of Geography, National University of Singapore, Singapore

The COVID-19 pandemic reshaped higher education and accelerated the shift toward blended learning (BL). In GIS education, however, most BL practices have emphasized technologies rather than pedagogical foundations. This study introduces a Complex Adaptive Blended Learning System for GIS education (CABLS-GIS) — a theory-driven framework that conceptualizes BL as an interdependent system comprising the learner, teacher, content, technology, learning support, and institutional environment. The framework was implemented in an introductory GIS course at the National University of Singapore through a flexible-mode BL design integrating face-to-face and online components. Survey results from undergraduate and graduate students revealed positive perceptions of the CABLS-GIS approach, particularly regarding learning flexibility, motivation, and conceptual understanding. The findings highlight how theoretically grounded BL design can enhance pedagogical coherence, technological integration, and educational resilience in the post-pandemic era. CABLS-GIS thus provides a holistic and adaptive model for advancing GIS education and serves as a foundation for developing future personalized and data-driven learning strategies.

Climate Change-Induced Rapid Flood Assessment through Landsat-8, Sentinel-2, UAV, and Machine Learning Techniques: 2022 Swat Flood, Pakistan

Hafiza Sayyeda Farah Batool, Dr. Syed Amer Mahmood

Institute of space science, university of the punjab, Lahore, Pakistan

Remote sensing imagery is a crucial resource for evaluating flood-affected areas following inundation events. The integration of optical satellite data and UAV-based drone surveillance enables the development of precise flood extent maps. This research determined inundated areas by applying spectral water indices and classification methods to both Landsat and Sentinel-2 imagery, supplemented by UAV-based damage assessment. To delineate flooded regions, the study utilized the Normalized Difference Water Index (NDWI), the Modified NDWI (MNDWI), and the Water Ratio Index (WRI). Additionally, land use and land cover analysis were conducted using supervised classification with the maximum likelihood algorithm, enabling effective identification and comparison of flood extents across the indices. The flood coverage was estimated at approximately 107 km² via Landsat, 111 km² through MNDWI, and 115 km² using NDWI. By leveraging classification insights from each index, a targeted correction process was implemented to address misclassifications and enhance delineation accuracy. Notably, both MNDWI and NDWI yielded accuracy rates surpassing 90%, reinforcing the reliability of the results. The proposed remote sensing techniques offer a reliable and innovative approach for detecting flood-affected areas, contributing significantly to timely disaster response and targeted relief efforts.

Managing curriculum development and improvement quality

Ritu Khati

Samridhha Commune Development Center, Nepal

The author aims to introduce some concepts and practical tools, which were usefully applied in the curriculum development influenced by the Bologna process and successfully used in the quality improvement practice.

The first part of the paper is dealing with the definition of education/training needs and involvement of stakeholder’s curriculum planning. One of the most important outcomes from these activities is the definition of skills and competences; and stakeholder management plan. The curriculum is a crucial component of any education/training activities, it is a road map to knowledge, and it builds knowledge topology. The implementation of new curricula often needs capacity building for faculty delivering education or training. Faculty of Geoinformatics (GEO) at Tribhuvan University of Kathmandu, Nepal participated or managed in many relevant international projects. The author will share some good educational practices.

The second part is focusing on curriculum and learning material development methods. The competency matrix will be introduced as a tool used to document and compare the required competencies for graduates. It is used in a gap analysis for determining where critical overlaps between courses are or which skills/competencies are not taught deeply enough.

Quality is omnipresent, ubiquitous – like the cloud of computers. Understanding and evaluating the quality of education requires a comprehensive picture of the unique and complex characters of the system that produced them. The third part of the paper is dealing briefly with quality impro issues.

MODERNIZING THE PHOTOGRAMMETRY CURRICULA WITH SMALL UAVs

Ahmed Elaksher

NMSU, United States of America

Photogrammetry has been known for a little less than a century as the art and science of making precise measurements from optical images. In the last few decades, photogrammetry was taught with traditional approaches focusing on using images captured by metric cameras. Recently, new sensors have been adopted in the surveying and mapping communities. Employers are now looking for graduates with the knowledge and skills required to extract accurate and reliable data from these sensors. Therefore, novel approaches are needed to blend essential principles and cutting-edge technologies in the photogrammetric courses. This article outlines the design and implementation of a new syllabus for a photogrammetry class, the experience delivering the material, and student feedback. The new curriculum introduces students to non-metric camera calibration; laser scanning; and satellite image rectification. sUAV flight planning and data processing were the core of the redevelopment; hence, the article focus on blending sUAV in the curriculum. Topics are taught in lectures and then practiced in labs. Comments received from students and academic and industry experts supported the new design and recommended it as part of renovating new surveying programs.

Geomatics-based approach for the geometric characterization of historical masonry towers

Gabriele Bitelli, Giulia Fiorini, Valentina Girelli, Maria Alessandra Tini, Luca Vittuari

Department of Civil, Chemical, Environmental and Materials Engineering - DICAM, Alma Mater Studiorum - University of Bologna, Bologna, Italy

The geometric monitoring of historic masonry towers is a central topic in heritage preservation, where structural safety must be ensured despite complex geometries, heterogeneous materials and deformation processes that evolve over time. This contribution presents an integrated surveying workflow developed by the DICAM Geomatics Laboratory and tested on the Garisenda Tower in Bologna, one of the most emblematic slender structures in Italy. The tower, built in the early 12th century and today inclined by more than 3 m, represents a challenging case study due to its ongoing deformation, dense urban context and the impossibility of establishing forced-centering stations.

The proposed methodology combines the high-precision capabilities of a Leica TS30 total station with the geometric completeness of a Leica RTC360 terrestrial laser scanner. The total station defines a stable local reference system and ensures accurate vertical alignment of the scanning instrument, while the TLS provides detailed three-dimensional reconstruction of the tower’s surfaces. The resulting 3D model enabled the computation of out-of-plumb parameters, wall flatness and local deformation patterns. Validation against TS30 control points confirmed the metric reliability of the integrated approach.

Three Layers of Authenticity in Augmented Reality Heritage: A Case Study from Suzhou’s Twin Pagodas

Anna Vichnevetskaia^1,2, Jiachen Liang^1,2, Yue Li¹, Nicholas Webb², Yi-Wen Wang¹

¹Xi'an Jiaotong-Liverpool University; ²University of Liverpool

Cultural heritage is increasingly reinterpreted and experienced through digital and immersive environments, including Extended Reality (XR) and Augmented Reality (AR) technologies. While these engage visitors in novel ways, the trend raises questions about what constitutes an “authentic” digital experience. This study examines perceptions of authenticity in an AR experience at the Twin Pagodas, a small-scale heritage site in Suzhou, China.

Building on a framework that distinguishes between objective authenticity (the accuracy of content), constructive authenticity (the interpretive meaning conveyed through stories), and subjective authenticity (the personal and emotional experience), the study explores how these dimensions interrelate and are mediated during digital engagement. Data were collected from 108 participants (ages 8–67, Chinese and international visitors) via pre- and post-experience surveys and 20 semi-structured interviews. Participants rated statements capturing each authenticity dimension, and Pearson correlation analysis examined relationships among them. Ethical approval was obtained prior to data collection.

Findings indicate that authenticity in mobile AR heritage experiences operates across multiple interacting layers. Cognitive judgments about historical accuracy shape interpretive meaning-making, while affective engagement forms a relatively independent experiential dimension. This pattern aligns with existing scholarship that emphasizes the interpretive and experiential nature of authenticity in heritage contexts, while providing empirical evidence from a mobile AR implementation at a modest urban heritage site. Limitations include the focus on a single site and AR design, indicating the need for further research across diverse contexts to strengthen generalizability.

Adaptive PCA-Scale Optimization for Edge Extraction from 3D Scanned Cultural Heritage Point Clouds

Tomoya Okajima¹, Satoshi Takatori¹, . Brahmantara², Fadjal Thufail³, Liang Li¹, Satoshi Tanaka¹

¹Ritsumeikan University, Japan; ²Indonesian Heritage Agency, Indonesia; ³Research Center for Area Studies, National Research and Innovation Agency

Digital archiving of cultural heritage using 3D scanned point cloud data requires effective edge-highlighting visualization to analyze fine structures. However, conventional methods often produce edges that are too thick, obscuring fine details.

This study proposes a method for adaptively optimizing the scale (range) used for local statistical analysis. This allows for the extraction of both sharp and rounded soft edges with high visibility. The core idea is to automatically determine the optimal scale for the analysis.

First, an eigenvalue-based feature value is calculated at multiple scales. Next, the scale that yields the minimum sample variance of this feature value across the entire point cloud is found and selected as the optimal scale.

Using this optimal scale, edge regions are extracted using another feature value. Opacity gradation is applied to emphasize soft edges as well. When this method was applied to a complex cultural heritage relief, fine structures such as ship hulls and human figures, which were indistinct with conventional methods, were clearly visible in the visualization results of the proposed method.

Seasonal Hydro-Optical Assessment of NDWI and Satellite-Derived Bathymetry in the Coastal Waters of Goa (2022–2024)

Vijaydeep Singh, Rahul Dev Garg, Pradeep Kumar Garg

Indian Institute of Technology Roorkee, India

Coastal bathymetry and water-clarity assessment using multispectral remote sensing is essential for understanding nearshore dynamics, sediment transport, and environmental variability. Optical indices such as the Normalized Difference Water Index (NDWI) and satellite-derived depth models provide a rapid means of monitoring these changes. This study focuses on the coastal region of Goa, located along the central western coast of India, an area influenced by strong monsoonal cycles, tidal fluctuations, and high sediment exchange from estuarine systems and open-sea interactions. The objective of this work is to evaluate monthly and annual variations in NDWI and satellite-derived bathymetric depth from 2022 to 2024 and to assess their seasonal and statistical relationships. Sentinel-2 imagery was processed to generate monthly median composites, from which NDWI and bathymetry were extracted; monthly mean NDWI and median depth values were calculated to represent surface water conditions and subsurface optical penetration, respectively. Results show clear seasonal contrasts, with NDWI values ranging from –0.02 to 0.33 and depth values varying between –8.5 m (deep, clear water) and +8.4 m (high turbidity). Annual mean NDWI remained relatively stable (~0.15), whereas median depth became progressively shallower from –2.01 m in 2022 to –0.52 m in 2024, indicating declining optical water clarity. Seasonal correlations between NDWI and depth shifted from strongly positive in winter (r = 0.70) to strongly negative during the pre-monsoon period (r = –0.83), reflecting the influence of sediment resuspension and monsoonal turbidity. Future work may integrate turbidity, wave climate, and machine-learning models for enhanced depth estimation.

A five-level LoD concept for modelling of Buddhist statues in 3D with semantic information

Yuehan Pan¹, Yanju Kong², Hongchao Fan³

¹Beijing University of Civil Engineering and Architecture, China; ²The Palace Museum, China; ³Norwegian University of Science and Technology, Norway

The concept of Levels of Detail (LoDs) plays a critical role in 3D semantic modelling by balancing geometric and semantic complexity with application needs. In our earlier work, we proposed a four-level LoD framework tailored to Buddhist statues, ranging from symbolic representation to detailed geometry, aiming to fulfil the needs for about 60 applications. However, when implementing this concept to applications in the cultural heritage domain, it is suggested to introduce an intermediate LoD between LoD2 and LoD3 because some applications need geometries coarser than the LoD3 but more detail than LoD2. In this paper, we present the analysis of these requirements and propose a new LoD for the 3D modelling of Buddhist statues. To verify the updated concept, we conducted a questionnaire among experts in geomatics and archaeology. Feedback from 170 participants confirmed that the five-level LoD concept is more appropriate and the revised framework provides a more comprehensive alignment with tasks in archaeology, conservation, museum exhibition, and risk management, and demonstrates strong potential for standardization within CityGML ADE.

Feature-Enhanced Visualization of 3D Point Clouds of Cultural Heritage in Transparent Virtual Reality

Taishi Matsuo, Satoshi Takatori, Liang li, Satoshi Tanaka

Ritsumeikan University, Japan

In recent years, digital archives using VR technology have been actively created, but most are intended for viewing culutual properties, with few designed for analysis. In this study, we create a VR system for understanding the 3D structure of cultural properties, using the 3D point cloud data of Tamaki Shrine, a World Heritage site in Nara Prefecture, Japan, as an example. As a feature enhancement method, we performed feature enhancement using principal component analysis. Furthermore, by applying it to a transparent VR environment, we aimed to improve the visibility of 3D structures.

Evaluating Multispectral Data Fusion for Dense Instance Segmentation in Vegetation and Artificial Objects Point Clouds

Clodoaldo Faria Junior¹, Isabella Norberto², Marcos Ricardo Maximo¹, Antonio Tommaselli², Mauricio Galo²

¹Aeronautics Technological Institute, São José dos Campos, São Paulo 12228-900, Brazil; ²Faculty of Science and Technology, São Paulo State University (UNESP) at Presidente Prudente, São Paulo 19060-900, Brazil

Multispectral data improves instance segmentation in digital agriculture by combining geometric and spectral information to distinguish complex natural features. While geometric information captures structural details, it often falls short when dealing with complex natural features that exhibit high spectral similarity, rather than due to limitations inherent to geometric representation itself. This work presents a feasibility analysis of instance segmentation using a spectral point cloud. A combination of spectral bands is selected based on class separability and proximity to a normal distribution as estimated by the Shapiro–Wilk test. The aim is to identify the minimum number of bands required to produce optimum results. For the normality analysis, Euclidean magnitude normalisation was applied, and it was also used alongside standard scaling to support the Multilayer Perceptron (MLP) for classification and segmentation. To refine the MLP predictions and consolidate instance labels, a graph-based post-processing step was applied, linking each point to its nearest neighbours and using a majority-voting scheme, resulting in spatially coherent clusters and refining the MLP predictions. The results demonstrate that multispectral data can reliably segment individual objects, with ten spectral bands being sufficient to achieve highly satisfactory segmentation and accurately delineate natural features such as leaves and tree trunks. Further increasing the number of bands improved spectral definition even more, with 14 bands achieving the highest performance across all metrics (mIoU: 96.59%; AP50: 96.14%). These findings highlight the strong potential of multispectral point clouds for precise and scalable object-level segmentation in agricultural environments.

Multi-temporal, Multi-modal UAV and Machine Learning Framework for Early Detection and Mapping of Bacterial Leaf Blight in Rice

Megharani Doddamani¹, Ziyi Wang¹, Florian Ellsäßer¹, Nancy Castilla², Alice Laborte², Andrew Nelson¹, Stephen Klassen², Roshanak Darvishzadeh¹

¹Department of Natural Resource, Faculty of Geo-information Science and Earth Observation (ITC), University of Twente, Enschede, Netherlands; ²International Rice Research Institute (IRRI), Los Banos, Laguna, Philippines

This study presents a UAV-based framework for early detection of Bacterial Leaf Blight (BLB) in rice using multi-temporal and multi-modal data. Conducted at the International Rice Research Institute (IRRI) during the 2023 wet season, the experiment integrated multispectral, thermal, and RGB imagery with crop physiological measurements from both healthy and artificially inoculated fields. Spectral (NDVI, NDRE), thermal (canopy temperature), and textural features were extracted and analyzed using a Random Forest classifier to identify early indicators of BLB infection. Results demonstrated that combining spectral and thermal data enhances early disease detection before visible symptoms appear, supporting precision agriculture and sustainable rice disease management.

The use of geospatial artificial intelligence technologies (geoai) within national mapping agencies: a review

Zineb Didouz^1,2, Imane Sebari², Kenza Ait El Kadi²

¹Agence Nationale de la Conservation Foncière du Cadastre et de la Cartographie; ²Institut Agronomique et Vétérinaire Hassan II

National mapping agencies (NMAs) provide authoritative and authoritative geospatial data for their respective countries. All geospatial agencies face significant challenges, including rapid technological advancements, societal expectations, and environmental pressures.

To produce high-quality geospatial information that meets user needs, NMAs combine image data acquisition from various sensors, field data collection, and manual interpretation and processing.

The use of geospatial artificial intelligence (GeoAI) offers opportunities to optimize workflows and reduce manual workload.

This article presents preliminary results from a study on the applications of GeoAI in the activities of National Mapping Agencies, along with key challenges and ethical considerations.

Fusion of PlanetScope SuperDove and Orthorectified Aerial Images for Tree-Level Stress Monitoring in Boreal Forests

Xiaoyang Zhao, Langning Huo

Swedish University of Agricultural Sciences, Department of Forest Resource Management, 90654 Umea, Sweden

Detecting early-stage vegetation stress at the individual tree scale is a pivotal remote sensing application. The ``green shoulder'' band at 530 nm serves as a key signal for early stress detection due to its sensitivity to carotenoid changes. However, existing remote sensing systems often struggle to simultaneously capture fine-scale canopy structures and stress-sensitive spectral data, making heterogeneous fusion a promising topic. Unlike mainstream supervised methods that rely on prescribed degradation models and high-quality samples, an unsupervised blind fusion framework based on Implicit Neural Representation and low-rank decomposition is proposed in this paper. Guided by orthorectified aerial images, the framework performs per-band super-resolution on PlanetScope SuperDove data to achieve a 0.16-meter resolution. It employs Sinusoidal Representation Networks to learn a continuous joint implicit representation of spatio-spectral information, effectively modeling the non-linear relationship between canopy structure and spectral response.To mitigate high-dimensional feature redundancy during heterogeneous data fusion, low-rank decomposition is integrated to reduce computation overhead. Experimental results show that the proposed method can fuse heterogeneous images effectively, providing a solid solution with practical guidance for subsequent early stress monitoring at the individual tree level.

LLM-Enhanced Semantic Segmentation of Large-Scale Urban LiDAR Point Clouds via Contextual Prompting

Jinlong Wang, Tao Shen, Liang Huo, Fulu Kong, Jiahui Wang, Xuejia Wei

School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, No. 15, Yongyuan Road, Huangcun Town, Daxing District, Beijing, China

Urban LiDAR point clouds provide rich geometric information but pose significant challenges for automated interpretation due to their scale, noise, and semantic complexity. Traditional convolutional and graph-based networks (e.g., PointNet++, RandLA-Net) have made significant strides by focusing on local geometric feature learning. However, they often lack the ability to incorporate high-level, global semantic context. This limitation leads to persistent errors in object boundary delineation and category confusion, particularly for semantically or geometrically similar classes (e.g., 'road' vs. 'sidewalk',or 'low-wall' vs. 'curb').Recent advances in large language models (LLMs) have demonstrated remarkable capabilities in contextual understanding, reasoning, and knowledge retrieval. Inspired by these developments, and motivated by the growing trend of cross-modal alignment in vision-language models, we propose an LLM-enhanced segmentation framework that integrates linguistic priors into the 3D perception pipeline. Our key contribution is the use of contextual prompts—textual descriptions generated or retrieved by an LLM based on 3D scene content—to guide the segmentation network. These prompts provide disambiguating cues, enabling the model to better distinguish between challenging classes and to recognize objects that are rare in the training data.The main contributions of this work are:1.A novel framework that synergistically combines a geometric point cloud encoder with an LLM-based contextual prompter for semantic segmentation.2.A methodology for generating and fusing contextual prompts from point cloud data, bridging the gap between geometric perception and linguistic reasoning.3.Extensive experiments demonstrating superior performance over state-of-the-art methods, particularly on semantically ambiguous and long-tailed object categories.

Developing an Urban Road Dataset: A Multi-Sensor Framework for DT and AI-Based Road Infrastructure Management

Vittorio Scolamiero¹, Piero Boccardo²

¹Sapienza Università di Roma, Italy; ²Politecnico di Torino, Italy

This contribution presents a new multi-sensor dataset of the urban road network of Turin, designed to support research in Digital Twins, AI-based road monitoring, and semantic 3D modelling. The dataset integrates mobile mapping (MMS), aerial LiDAR, imagery, and BIM/IFC models into a unified spatial and semantic framework. It includes detailed point cloud classifications, pavement defect annotations, and metadata to ensure full reproducibility. By combining geometric precision with semantic labelling, the dataset enables applications such as automated defect detection, semantic segmentation, 3D reconstruction, and predictive maintenance. Compared to existing benchmarks, it offers a unique focus on road surface condition and DT interoperability. The contribution outlines the methodology used to structure, validate, and document the dataset, positioning it as a valuable resource for both academic research and operational urban infrastructure management.

Application of LiDAR technology for identifying surface anomalies in concrete structures through reflective intensity analysis

Andrea Nathaly Escobedo Tamez¹, Nallely Salazar González¹, Fabiola Doracely Yépez Rincón¹, Melanie Sepúlveda García¹, Ian Eliseo Cruz Bulmes¹, Milena Mesa Lavista²

¹Department of Geomatics, Faculty of Civil Engineering, Universidad Autónoma de Nuevo León, San Nicolás de los Garza; ²Department of Structural Engineering, Faculty of Civil Engineering, Universidad Autónoma de Nuevo León

Structural inspection is crucial for comprehensive risk management, especially given the accelerated deterioration caused by factors such as climate change and obsolescence. The accurate determination of the percentage of surface damage is fundamental for optimizing maintenance decision-making and the administration of resources for infrastructure preservation. This work presents a methodological exploration to assess the superficial condition of a concrete pedestrian bridge located over an urban river. The study focuses on determining the structural conditions by calculating the percentage of surface damage to evaluate maintenance needs. For data acquisition, Light Detection and Ranging (LiDAR) technology is employed using a Terrestrial Laser Scanner (TLS) Trimble X7 laser scanner, generating a 3D point cloud that models the bridge surface with precise spatial coordinates. The methodology utilizes the reflective intensity of the laser pulses to obtain quantitative information about the surface. This approach allows for the precise identification, demarcation, and quantification of deteriorated areas. The application of this methodology facilitates a non-invasive and detailed diagnosis of the surface condition, providing quantitative and visual information that can enhance the maintenance planning of critical infrastructure such as pedestrian bridges.

Understanding Public Experiences of Urban Greenspace: A Novel Data-driven Multimodal Method based on Online Review Data and Natural Language Processing

Zian Wang¹, Yifan Yang², Peter van Oosterom¹, Steffen Nijhuis¹, Stefan van der Spek¹

¹Faculty of Architecture and Built Environment, Delft Univ. of Technology, Delft, Netherlands; ²Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands

Understanding public experiences in urban greenspace is essential for supporting more human-centric design and management. While traditional survey methods are often time- and labor-intensive, user-generated content (UGC) offers a rapid and scalable alternative for capturing public experiential insights. However, extracting detailed user experience information from this data remains methodologically challenging. This study proposes a novel multimodal analytical framework based on online review data and natural language processing techniques, combining LoRA fine-tuned RoBERTa model with CLIP vision-language model to analyze multidimensional ecosystem service experience patterns in urban greenspace from user-generated text and image reviews. Results demonstrate that the proposed approach achieves more robust extraction and analysis of user experience insights compared to conventional deep learning and lexicon-based methods, exhibiting greater capacity to process contextually embedded experiential information. The multimodal framework enables more comprehensive capture of user experiences than either text or image data alone, with particular gains on dimensions that are difficult to represent through a single modality. Applying the analytical framework to Amsterdam and Rotterdam as case studies, statistical and spatial analysis reveals heterogeneity in user urban greenspace experiences and identifies key experiential bundles alongside their associated synergies and trade-offs. This study offers a novel approach to quantifying urban greenspace experiences from a user perspective, and provides insights for evidence-based urban greening practices.

Capturing, processing and analysing 3D Data in a National Mapping Agency

David Holland, James Gillespie, Chris Heap

Ordnance Survey, United Kingdom

This paper describes the development of a 3D mesh product by Ordnance Survey, Britain's National Mapping Agency. The work originated in the research team and was then taken up by a multi-disciplinary cross-business team which used product development techniques and extensive customer interviews to determine the feasibility (could it be made) and viability (would it generate sufficient revenue) of a potential 3D mesh product. The 3D mesh, generated from nadir aerial imagery already captured for topographic map update, was introduced as a beta product and is currently being tested by potential users.

Leveraging Close-range Photogrammetry and Inverse Rendering Engine for Photorealisitic Material Reconstruction

Han Hu, Xiaolin Guo, Junfan Wang, Qing Zhu

Faculty of Geosciences and Engineering, Southwest Jiaotong University, 611756 Chengdu, China

Photorealistic 3D reconstruction fundamentally requires recovering the intrinsic optical properties of object surfaces. Traditional multi-view photogrammetry, based on Structure-from-Motion (SfM) and Multi-View Stereo (MVS), effectively reconstructs geometry and texture but assumes Lambertian reflectance, failing on non-Lambertian materials with specular highlights and subsurface scattering. While recent implicit representations like NeRF and its extensions have advanced novel view synthesis, their effectiveness is constrained by the inherent coupling of geometric, material, and luminous properties. To overcome these issues, we propose a differentiable rendering method for photorealisitic material reconstruction in close-range photogrammetry, enabling physically accurate forward and inverse rendering of PBR parameters. Experimental results demonstrate that our method achieves high-fidelity reconstruction of object geometry and multi-channel SVBRDF/BSSRDF materials, robustly recovers HDR environment maps under complex indoor and outdoor illumination, can effectively removes indirect illumination artifacts through Monte Carlo ray tracing, and produces editable assets that enable realistic relighting and material editing.

Decoupling Visual and Textual Representation for Remote Sensing Image Segmentation

Ce Zhang, Boyi Li

School of Geographical Sciences, University of Bristol, United Kingdom

The emergence of vision–language models (VLMs) has enabled joint multimodal understanding beyond traditional visual-only approaches. However, transferring VLMs from natural images to remote sensing (RS) segmentation remains challenging due to limited category diversity and significant domain gaps. We propose a training-free framework that decouples visual and textual inputs and performs multi-scale visual–language alignment for RS segmentation. At the global–local decoupling module, we separate text into local class nouns and global modifiers, while images are partitioned into class-agnostic mask proposals via unsupervised mask generation. At visual–textual alignment module, we introduce a context-aware cropping strategy and a knowledge-guided prompt engineering method to enhance text representations, enabling mask classification for open-vocabulary semantic segmentation (OVSS). A Cross-Scale Grad-CAM module refines activation maps using contextual cues from global modifiers, facilitating accurate and interpretable alignment for referring expression segmentation (RES). Evaluations on the benchmarks demonstrate strong performance, highlighting the potential of training-free VLM transfer to the RS domain.

A Geo-Foundation Framework for Retrogressive Thaw Slump Detection Using High-resolution Remote Sensing Data

Mohammad Marjani¹, Masoud Mahdianpari¹, Fariba Mohammadimanesh²

¹Memorial University of Newfoundland, Canada; ²Canada Centre for Remote Sensing, Natural Resources Canada, 580 Booth Street, Ottawa, ON, Canada

Retrogressive thaw slumps (RTSs) are key indicators of permafrost degradation in Arctic regions. Yet, their detection remains challenging due to spectral similarity with surrounding terrain and the limited generalization of conventional deep learning approaches. This study presents a Geo-Foundation framework that integrates pretrained Clay embeddings with high-resolution PlanetScope multispectral imagery, spectral indices, and ArcticDEM data for RTS detection in the Northwest Territory (NWT), Canada. The proposed dual-branch architecture combines high-level geospatial representations with physically meaningful environmental features to improve segmentation performance. The model achieved an F1-score of 0.83 and a mean Intersection-over-Union (mIoU) of 0.75 on the validation dataset. Analysis of patch size indicates that intermediate spatial context provides optimal performance, while feature importance results highlight the dominant role of vegetation-sensitive spectral bands and indices. Qualitative evaluation further confirms accurate boundary delineation and spatial consistency across diverse terrain conditions. The results demonstrate that Geo-Foundation models enhance detection accuracy, reduce dependence on large labeled datasets, and improve generalization across heterogeneous Arctic landscapes. This approach provides a scalable and efficient solution for monitoring permafrost-related disturbances under a changing climate.

Combining and Processing Airborne Laser Scanning and Crowdsourced Terrestrial Images for bilberry high-yield maps

Susanna Hyyppä¹, Josef Taher¹, Matti Hyyppä¹, Leena Matikainen¹, Xiaowei Yu¹, Harri Kaartinen¹, Oskari Saarinen¹, Kirsi Karila¹, Ville Luoma³, Marjut Turtiainen⁴, Birgitta Partanen⁴, Roope Näsi¹, Jyri Maanpää¹, Eetu Puttonen¹, Arttu Kivimäki¹, Mikko Vastaranta⁵, Antero Kukko¹, Hannu Hyyppä², Matti Vaaja², Fabio Remondino⁶, Narges Takhtkeshha⁶, Juha Hyyppä¹

¹Finnish Geospatial Research Institute, Finland; ²Aalto university, Finland; ³University of Helsinki, Finland; ⁴Arctic Flavours Association, Finland; ⁵University of eastern Finland, Finland; ⁶Bruno Kessler Foundation, Italy

Forests provide essential ecosystem services beyond timber, yet locating high-yield areas for non-wood forest products such as bilberries (Vaccinium myrtillus) remains a challenge for both recreational and commercial pickers. By integrating Airborne Laser Scanning (ALS), Geographical Information System (GIS) data, and crowdsourced terrestrial imagery analyzed via deep learning (YOLO), we developed a predictive system optimized for identifying high-yield hotspots. We demonstrate that YOLO detection remains highly accurate, but plant height significantly contributes to berry omission. However, this limitation can be mitigated by selecting the maximum berry count from multi-angle terrestrial images. Using a Random Forest classifier across a 36-km² study area in Nuuksio, Finland, we achieved a precision of 58% for the highest yield category. This represents a 20-fold increase in the probability of encountering a high-yield area compared to random searching. Extensive user testing over two years validated the practical utility of the system, showing a 22.5% increase in harvested yield and a 36.5% reduction in time required to locate hotspots. Furthermore, 97% of users reported that the platform provided an accurate big picture of bilberry yield. These results highlight the potential of combining crowdsourced citizen science with advanced LiDAR metrics to create digital twins of forest ecosystems that enhance human interaction with nature and optimize the sustainable harvest of wild food resources.

A Knowledge Service System for Cultural Heritage Integrating Knowledge Graph and Semantic 3D Model

Jiadong Zhang^1,2,3, Miaole Hou^4,5, Jun Chen^2,3, Xiaoguang Zhou¹, Jiaxin Ren^3,6, Shunxi Yin^2,7, Huiqiang Zhao⁵, Zhihua Zhang⁸, Bingliang Cui⁹

¹School of Geosciences and Info-Physics, Central South University, Changsha 410083, China; ²National Geomatics Center of China, Beijing 100830, China; ³Moganshan Geospatial Information Laboratory, Huzhou 313299, China; ⁴School of Land Engineering, Chang'an University, Xi'an 710054, China; ⁵School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, Beijing 100044, China; ⁶School of Earth Sciences, Zhejiang University, Hangzhou 310058, China; ⁷School of Environment and Spatial Informatics, China University of Mining and Technology, Xuzhou 221116, China; ⁸Shanxi Cultural Relics and Museum Industry Group Co., Ltd., Taiyuan 030001, China; ⁹Guangzhou Alpha Software Information Technology Co., Ltd., Guangzhou 510060, China

Cultural heritage (CH) digitization currently suffers from fragmented multi-source heterogeneous data, insufficient knowledge organization, and limited semantic expression in 3D CH models. Existing knowledge graphs and HBIM in CH field lack unified semantic representation and effective GIS integration, thus restricting intelligent knowledge services. To overcome these issues, a knowledge service approach integrating knowledge graph and semantic 3D models is proposed, enabling the transformation from data process to knowledge-driven services. An extension model for CH (CHADE) is developed using the CityGML ADE mechanism to support the construction of semantically enriched 3D geospatial scenes. Meanwhile, A domain ontology (CHOnto) based on CIDOC CRM is constructed to formalize CH knowledge, and multi-source heterogeneous data are organized into a Cultural Heritage Knowledge Graph (CHKG). By establishing semantic connections between knowledge graph and 3D models, the proposed method achieves integrated representation of geometry, spatial context, and domain knowledge. A prototype system (3DCHKS) is implemented and validated through multiple heritage scenarios. Results demonstrate that the approach enhances semantic connectivity, knowledge organization, and scenario-based representation, supporting intuitive visualization and intelligent application. Although limitations remain in generalizability and knowledge extraction robustness, this study provides a novel framework for integrated CH knowledge services and lays a foundation for scalable, knowledge-driven heritage applications.

Evaluating different satellite-based Aerosol Optical Depth (AOD) in predicting inland daytime PM2.5 using machine learning-based regression approach

Roseanne Ramos^1,2, Mark Joseph Calubad³, Alvin Christopher Varquez¹

¹Department of Transdisciplinary Science and Engineering, School of Environment and Society, Institute of Science Tokyo; ²Department of Geodetic Engineering, University of the Philippines, Diliman, Quezon City, Philippines; ³Department of ICT Integrated Ocean Smart City Engineering,Dong-A University, Busan, South Korea

Aerosols play a critical role in the development of boundary layer and build-up of air pollution in urban environments. Their presence in the atmosphere is calculated and represented by Aerosol Optical Depth (AOD). Satellite sensors observe aerosol quantities and different algorithms are applied to retrieve AOD at varied spatial and temporal resolutions. In air quality monitoring, satellite-based AOD products are useful in modelling particulate matter (PM). This study evaluates AOD products observed by Moderate Resolution Imaging Spectroradiometer (MODIS), Visible Infrared Imaging Radiometer Suite (VIIRS) and Advanced Himawari Imager of Himawari-8 in predicting inland daytime PM2.5 for test sites in Japan and South Korea. Prediction models are constructed using eXtreme Gradient Boosting (XGBoost) regression with input variables from observation datasets matched on PM2.5 station locations. In addition to AOD, seventeen (17) predictor variables were considered to account topographic and meteorological parameters that can influence the formation and transport of PM2.5 near the ground surface. Overall results show that prediction model using MODIS MAIAC AOD generate relatively higher accuracy for daily estimates considering both spatial coverage and prediction skill metrics. For future work, model improvements will be done by exploring additional predictor variables to reduce overfitting and additional statistical tests to generate more accurate estimates of PM2.5.

Learning Height from Geospatial Embeddings: an initial investigation of the Google AlphaEarth dataset

Alireza Hamoudzadeh¹, Hugo De Vecchi², Valeria Belloni¹, Roberta Ravanelli²

¹Geodesy and Geomatics Division, DICEA, Sapienza University of Rome, Rome,, Italy; ²Geomatics Unit, Department of Geography, Faculty of Sciences, University of Liège, Liège, Belgium

Geospatial embeddings represent a promising paradigm for encoding geospatial information into compact and learnable representations that support scalable downstream tasks in remote sensing. Among recent developments, Google’s AlphaEarth embeddings are a dataset of 64-dimensional embeddings, made available globally at 10 m resolution, derived from multimodal inputs, including multispectral and SAR imagery, elevation, gravity and text data. In this study, we explore the feasibility of inferring surface height from AlphaEarth embeddings within a deep learning framework. The analysis focuses on an 8000 km² area in Nouvelle-Aquitaine, France, where a 5 m resolution Digital Surface Model (DSM) is available. A U-Net architecture with a ResNet34 encoder was trained to predict surface heights from the 64 embedding channels using a spatial cross-validation strategy to ensure independence between training and testing subsets. For computational efficiency in this preliminary experiment, both the embeddings (input) and DSM (target) were resampled to 100 m. Results indicate promising agreement between predicted and reference heights, achieving an R² of 0.83 and a Pearson correlation of 0.93 on the test set. However, a systematic bias was observed. These findings highlight the potential of AlphaEarth embeddings to capture height-related features, despite being trained on a broader geospatial domain. Future work will address bias investigation, increase inference spatial resolution, and expand the analysis across diverse geographical regions. Additionally, comparisons with alternative embedding datasets, such as Tessera, will be conducted to better evaluate the strengths and limitations of embedding-based surface height estimation.

Hierarchy-Aware Intent Recognition and Task-Oriented Text Generation for Non-Expert Satellite Instructions

Xinyue Li¹, Chunhui Wang², Hongwei Li², Zifeng Jiang³

¹School of Aeronautics and Astronautics, Zhejiang University; ²College of Information Science and Electronic Engineering, Zhejiang University; ³STAR.VISION Aerospace Group Limited, Hangzhou

With the rapid advancement of large language models, natural-language-based understanding of satellite task requests is becoming increasingly important for improving the accessibility of remote-sensing services. However, satellite commands issued by non-expert users are often conversational, ambiguous, and terminologically inconsistent, leading to a substantial gap between free-form expressions and structured task representations. To address this challenge, we propose a hierarchy-aware framework for intent recognition and task-oriented text generation from non-expert satellite instructions. Specifically, we design a hierarchical annotation scheme that models intent levels, parameter structures, inter-element relations, and execution complexity, and we further construct a hierarchical sequence representation for learning. We then introduce a boundary-aware sample organization method based on semantic similarity and structural proximity, together with a retrieval-augmented multi-type negative-sample reorganization strategy to enhance robustness. Finally, we adopt Qwen3-8B with LoRA for parameter-efficient domain adaptation and unified generation of top-level intents and task-oriented outputs. Experiments on a manually curated dataset of 4,025 non-expert satellite instructions show that the proposed method consistently outperforms multiple baselines on both intent classification and task-oriented generation, demonstrating a resource-efficient and scalable solution for natural-language satellite task interfaces.

A Tracking-Free Automatic Target Recognition (ATR) Radar Methodology for Real-Time Airspace Management in China’s Low-Altitude Economy

Jiangkun Gong^1,2, Yan Jun², Deren Li²

¹Shanghai University, China, People's Republic of; ²Wuhan University, China, People's Republic of

China’s Low-Altitude Economy (LAE) requires robust airspace surveillance for the safe integration of Vertical Take-off and Landing (VTOL) aircraft and Unmanned Aerial Systems (UAS). Traditional radar Automatic Target Recognition (ATR) approaches—both micro-Doppler-based and tracking-based—depend on track accumulation, introducing Detection Response Times (DRT) exceeding 3–5 seconds that are incompatible with real-time low-altitude operations. This paper proposes a tracking-free ATR methodology that restructures the conventional serial “Detection–Tracking–Recognition” chain into a parallel “Integrated Detection and Recognition” (IDR) architecture. The classifier operates independently of the tracker, extracting target attributes from single-dwell echoes within one Coherent Processing Interval (CPI), achieving a DRT below 100 milliseconds—more than an order-of-magnitude improvement over existing systems. The methodology is validated through field trials using a X-band radar, demonstrating reliable identification of VTOL at ranges exceeding 12 km. We further clarify the precise definition of DRT and argue for NATO ATR hierarchy level T3 (Recognition) or above as the minimum performance standard for low-altitude radar sensors.

Beyond Alerts: spatiotemporal Trade-offs in near-real-time Detection Systems for Forest Disturbance in the Brazilian Amazon

Alby Duarte Rocha¹, Alessandra Rodrigues Gomes², Luis Waldyr Rodrigues Sadeck³, Karine Reis Ferreira¹

¹Earth Observation and Geoinformatics Division, National Institute for Space Research (INPE); ²Amazon Spatial Coordenation (COEAM), National Institute for Space Research (INPE); ³Graduate Program in Environmental Sciences, Institute of Geosciences, Federal University of Pará (UFPA)

The Amazon rainforest faces threats from anthropogenic disturbances, which also increase greenhouse gas emissions and contribute to global climate change. In 2004, a system to detect disturbance for the Brazilian Legal Amazon (BLA) was created to mitigate forest loss. The system, Detection of Deforestation in Real Time (Deter), from the National Institute for Space Research (INPE), alerts to seven types of anthropogenic forest disturbances through the visual interpretation of optical satellite imagery from CBERS-4, CBERS-4A and Amazônia-1. Many near-real-time systems currently generate alerts using automated algorithms, primarily leveraging SAR sensors to compensate for the absence of cloud-free images over tropical forests. Deter uses spatial patterns to identify types of disturbances, minimising commission errors, while most algorithms prioritise the temporal dimension for early-stage detections. Discrepancies in space and time across systems and disturbance types, such as omissions, delays, and mismatches, are linked to the selection of sensor technologies, forest masks, and algorithm strategies. Forest disturbances detected between 2020 and 2024 for the entire Brazilian Amazon Biome were extracted from the systems: Deter, Prodes, MapBiomas, SAD, RADD, GLAD, LUCA and TropiSCO. Based on this dataset, we conducted an exploratory analysis revealing agreement and disagreement between detection systems regarding five classes of disturbances (clear-cut, selective logging degradation, fire scars, mining and windthrow). The results emphasise the importance of systems that consider the trade-off between spatial and temporal context to detect different disturbance types, similar to Deter, but using automated near-real-time alert approaches.

An Intelligent Matching Method for Archaeological Pottery Shards Based on the Fusion of Lang SAM and DINO v2

Chang Xinfeng^1,2,4, Gao Zhenhua³, Wei Shengnan^1,4, Hou Miaole^1,4

¹Beijing University of Civil Engineering and Architecture, Beijing, China; ²Pingdingshan University, Henan, China; ³Shanxi Provincial Institute of Archaeology, Shanxi, China; ⁴Beijing Key Laboratory for Architectural Heritage Fine Reconstruction & Health Monitoring, Beijing, China

In archaeology, the long-standing problem of low efficiency and high experience-dependence in manual matching of numerous unearthed pottery shards has been a challenge. This paper presents and develops an intelligent matching and annotation tool for pottery shard images, integrating advanced computer vision technologies. Using 35,159 pottery shard images from Pit H690 at the Daxinzhuang Site in Shandong as the dataset, a comprehensive “segmentation-feature extraction-cross-verification-screening” technical process is established. The core steps are as follows: First, the natural-language-based visual segmentation model Lang SAM is employed to precisely segment individual pottery shards from the original images, obtaining clean front and back images. Second, the self-supervised visual feature model DINO v2 is used to extract deep visual feature vectors of the shards, calculate image similarities for the front and back sides respectively, and generate a Top-N candidate matching list for each shard. Finally, cross-verification is carried out by taking the intersection of the front and back candidate lists, and the final screening is conducted with archaeological metadata. This research demonstrates the great application potential of AI in archaeological fragment assembly, offering an automated, interpretable, and efficient solution for handling massive cultural relic fragments.

Multi-Source Remote Sensing for Maritime Security: A Performance Evaluation of SAR and RGB Imagery for Small-Scale Fishing Vessel Detection

Stefano Chiappini¹, Marsia Sanità¹, Alessandro Galdelli², Anna Nora Tassetti³, Roberto Pierdicca¹, Carmen Ferrà Vega³, Andrea Fiorani²

¹Department of Civil, Building Engineering and Architecture (DICEA), Università Politecnica delle Marche 60131 Ancona, Italy; ²Department of Information Engineering (D3A), Università Politecnica delle Marche, 60131; ³CNR-IRBIM, Institute for Marine Biological Resources and Biotechnology, National Research Council, 60125 Ancona, Italy

Effective maritime surveillance and management of small-scale fisheries remains challenging in coastal waters because small vessels are not systematically tracked and are weakly represented in medium-resolution satellite imagery. Within the AI4COPSEC Horizon Europe framework, this study investigates an object-detection workflow for small-vessel monitoring along the Adriatic coasts of Marche and Puglia, Italy. A multisource dataset was prepared in which Sentinel-2 and PlanetScope optical imagery were manually annotated to enrich an existing SAR and optical imagery training dataset and support a two-stage training strategy. The first stage used a larger, more heterogeneous dataset for robust feature learning, while the second refined the model on a smaller, higher-quality subset to improve domain adaptation and detection performance. The resulting dataset comprised 4,202 image tiles (pretraining) and 706 image tiles (fine-tuning), with 16,096 and 1,716 vessel annotations, respectively, all belonging to a single target class.

Detection experiments were conducted with several YOLOv26 configurations trained under a consistent protocol to assess the trade-off between model complexity, accuracy and computational efficiency. Among the standard variants, YOLOv26-M achieved the most balanced performance, with a Precision of 0.813, Recall of 0.846, F1-score of 0.829, Accuracy of 0.719 and mAP50-95 of 0.306. Pruned and lightweight alternatives showed competitive efficiency-oriented behaviour. Results indicate that, in small-target coastal environments, scaling up model size does not necessarily yield proportional gains, whereas task-oriented architectural design improves the balance between detection quality and computational cost. The workflow provides a practical benchmark for AI-enabled maritime monitoring and supports the advancement of Copernicus-oriented coastal surveillance applications.

Toward IFC-Compatible HBIM Semantics for Component-Level Representation of Architectural Heritage

Wanqin Liu¹, Fabrizio Banfi¹, Marco Pela¹, Angelo Giuseppe Landi², Fabio Roncoroni¹

¹Politecnico di Milano, Dept. of Architecture, Built Environment, and Construction Engineering (ABClab-GICARUS); ²Politecnico di Milano, Dept. of Architecture and Urban Studies (DAStU)

The growing use of artificial intelligence (AI) and data-driven methods in architectural heritage research requires structured and reusable semantic units to support consistent modelling, annotation, and knowledge alignment. In this context, Historic Building Information Modelling (HBIM) can serve as a semantic anchor by linking surveyed geometry with object-based representations and non-geometric information. However, current HBIM workflows remain semantically fragmented: point cloud segmentation often relies on project-specific labels, object modelling adopts inconsistent decomposition and naming logics, and semantic enrichment is frequently implemented through custom parameters without a shared component-level framework. Although Industry Foundation Classes (IFC) provide the most widely adopted canonical structure for interoperability, their standard entities are often too coarse to represent heritage-specific subcomponents. To address this gap, this study proposes an IFC-compatible semantic framework for component-level representation in HBIM. The framework combines a canonical IFC-aligned layer with a heritage extension layer and introduces a mapping strategy for representing semantically meaningful subcomponents without modifying the core IFC schema. A Serliana arch on the church of SS. Paolo e Barnaba in Milan is used as a case study to illustrate the implementation of the proposed approach. The study establishes a preliminary semantic foundation for component-level heritage representation in HBIM, providing both a conceptual basis for structuring heritage subcomponents and an operational basis for their IFC-compatible implementation. This foundation may also support future developments in ontology alignment and cross-modal AI applications, where stable semantic anchors are required for data integration and annotation.

Point Cloud Semantic Segmentation of Thousand-Buddha Niches in Grotto Temples Based on PointNet++ Transfer Learning

Songze Wei¹, Miaole Hou^1,2, Bo Ning³, Yong He³

¹School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, China, People's Republic of; ²School of Land Engineering, Chang’an University, Middle Section, South 2nd Ring Road, Xi'an, Shaanxi, 710054, China;; ³Yungang Research Institute, No. 1, Dong Street, Yungang Town, Yungang District, Datong City, 037007, China

Thousand-Buddha niches on the walls of grotto temples are core carriers of China's Buddhist cultural heritage. Their high-precision digital extraction is a key prerequisite for virtual restoration of cultural relics, stylistic lineage research, and digital display. Currently, close-range photogrammetry is mostly used for digital acquisition of small and medium-sized grotto temples to obtain point clouds. This technology, through non-contact multi-view image collection and matching, can not only retain the fine morphological features of niches but also comply with the core requirement of "non-destructiveness" in cultural relic protection, making it the mainstream method for grotto temple point cloud collection. However, the segmentation of thousand-Buddha niche point clouds still faces two core challenges: first, the sample scarcity bottleneck in cultural relic scenes. Manual annotation of niches requires professional archaeological knowledge, which is time-consuming and labor-intensive, resulting in limited sample size that is difficult to support the full training of deep learning models; second, the segmentation adaptation problem of target characteristics. Niches are densely distributed with similar shapes, and point clouds from close-range photogrammetry are prone to local noise due to lighting differences. Traditional segmentation methods are prone to boundary blurring, misclassification, and missing segmentation. Pure transfer learning without combining the characteristics of cultural relic scenes leads to insufficient segmentation accuracy.

Comparative Study of Stable Diffusion-Based Super-Resolution Methods for Remote Sensing Imagery

Yiran Lu^1,2, Hongjie He^1,2,3, Fei Zhu^1,2, Jonathan Li^1,2,3

¹School of GeoAI and Hinton STAI Institute, East China Normal University; ²Key Laboratory of Geographic Information Science (Ministry of Education), , East China Normal University; ³Department of Geography and Environmental Management, University of Waterloo

Remote sensing image super-resolution aims to recover fine structural and textural details from degraded low-resolution observations. However, conventional methods and early deep learning models often produce over-smoothed results and struggle to reconstruct realistic high-frequency content. Stable Diffusion-based (SD-based) methods offer a promising alternative by using strong generative priors to synthesize more natural, detail-rich super-resolved images. Although many SD-based super-resolution methods have been proposed in computer vision, their use in remote sensing imagery remains limited, and systematic comparative evaluation in this domain is still lacking, leaving insufficient empirical guidance for method development. Therefore, this paper compares four representative SD-based super-resolution methods, namely Stable Super-Resolution (StableSR), Semantics-Aware Super-Resolution (SeeSR), Different Blind Image Restoration (DiffBIR), and Pixel-Aware Stable Diffusion (PASD), on the WHU-Mix remote sensing dataset. The evaluation uses seven metrics: Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index Measure (SSIM), Learned Perceptual Image Patch Similarity (LPIPS), Frechet Inception Distance (FID), CLIP Image Quality Assessment (CLIP-IQA), Multi-Scale Image Quality Transformer (MUSIQ), and Multi-Dimension Attention Network for No-Reference Image Quality Assessment (MANIQA). Quantitative results show that StableSR achieves the highest PSNR of 23.16 dB, PASD obtains the best SSIM of 0.81 and lowest LPIPS of 0.45, SeeSR achieves the best MUSIQ of 64.57 and MANIQA of 0.46, and DiffBIR achieves the best FID of 110.58 and CLIP-IQA of 0.68 but with weaker full-reference fidelity. These findings indicate that current SD-based methods favor different aspects, including fidelity preservation, perceptual quality, and generative realism, and should be selected according to the target remote sensing application.

Learning-based monocular depth estimation for photogrammetric 3D reconstruction

Chunyu Dou^1,2, Yifei Yu¹, Xin Wang¹, Zongqian Zhan¹

¹School of Geodesy and Geomatics, Wuhan University, China; ²School of Geography, Nanjing Normal University, China

Monocular depth estimation (MDE) infers depth from a single image, offering significant advantages in computational efficiency and memory consumption compared to conventional Multi-View Stereo (MVS) methods. However, most MDE methods suffer from poor multi-view geometric consistency, which limits their application to photogrammetric 3D reconstruction. To address this issue, this paper employs sparse point clouds of Structure-from-Motion (SfM) as extra geometric constraints and proposes a framework that achieves photogrammetric 3D reconstruction using off-the-shelf learning-based MDE models without the need for additional fine-tuning. Specifically, when SfM priors are available during inference, globally geometrically consistent depth maps can be directly predicted. Otherwise, the estimated monocular depths are aligned to a consistent scale using SfM results via a post-correction step. The resulting depth maps are then fused using a truncated signed distance function (TSDF) to generate dense 3D reconstructions. Experiments on photogrammetric datasets demonstrate that the proposed framework effectively improves geometric consistency across depth maps and enables high-quality scene reconstruction. In addition, we systematically analyze the impact of key parameters in depth inference and fusion, including depth map resolution, voxel size, denoising steps, and ensemble size, on reconstruction performance, and further explore the potential of MDE for photogrammetric 3D reconstruction.

From Peaks to Crowns: A Morphology-Based UAV-LiDAR Framework for Individual Tree Segmentation

Yijun Guo¹, Hangpu Li¹, Jingyi Yuan¹, Huiyi Su¹, Tianhui Xu², Honggang Sun², Qin Ma^3,1,4,5

¹School of Geography, Nanjing Normal University, Nanjing 210023, China.; ²Research Institute of Subtropical Forestry of Chinese Academy of Forestry, Hangzhou 311400, China.; ³State Key Laboratory of Climate System Prediction and Risk Management, Nanjing, China.; ⁴Key Laboratory of Virtual Geographic Environment (Nanjing Normal University), Ministry of Education, Nanjing, China.; ⁵Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing, China.

Recognising individual trees has important applications in forest ecology and management. Conventional individual tree segmentation methods tend to favour dominant trees with pronounced canopy surface features but have limited capability in detecting subdominant trees that are partially occluded or have smaller crowns. To mitigate this issue, we propose a morphology-based method for individual tree segmentation. First, a treetop extraction method is developed based on morphological criteria. Candidate treetops are initially detected using local maximum filtering, followed by classification and validation through vertical profile analysis integrated with crown morphological characteristics. Subsequently, the extracted treetops serve as seed points to guide individual tree crown delineation within a Min cut/Max flow graph cut framework, leveraging the spatial relationships among points. Our method enhances the detection of subdominant trees, with detection rates climbing to 90–95%, and achieves an average F score of 0.8 for crown delineation, which outperforms the other methods by 0.24 points. By integrating treetop information with local crown features, the proposed method improves the detection and segmentation accuracy of subdominant trees in complex forest environments, supporting overstory structure analysis and individual tree inventory in intricate forests.

Steel Transmission Towers UAV Photogrammetric reconstruction for Corrosion Quantification supported by Deep Convolutional Neural Networks

Vincenzo Di Pietra¹, Pierclaudio Savino², Fabio Graglia³

¹Department of Environment, Land and Infrastructure Engineering - Politecnico di Torino, Italy; ²Tecne - Gruppo Autostrade per l'Italia, Roma, Italy; ³Rai Way S.p.A., Roma, Italy

This paper presents an automated approach for quantifying corrosion surface areas in steel transmission towers by integrating Unmanned Aerial Vehicle (UAV) photogrammetry and deep convolutional neural networks (DCNNs). Traditional visual inspections for corrosion pose significant challenges to structural safety and maintenance planning due to their complexity, subjective nature, high costs, and safety risks associated with inspecting tall structures. The proposed methodology utilizes a DeepLabv3+ model for the semantic segmentation of corroded areas. The network was trained and validated using a robust dataset of 999 field photographs collected from on-field tower inspections. A comparative analysis of DCNN backbones identified MobileNetV2 as the optimal choice, offering a superior balance between accuracy and computational efficiency. After fine-tuning, the network achieved an acceptable validation accuracy of 90.8% and a validation loss of 0.23. A major contribution of this study is the integration of these deep learning algorithms with metrically accurate photogrammetric products. The trained network was applied to orthomosaics derived from the 3D reconstruction of the South-East tower at the Torino Eremo broadcasting center. Unlike traditional image segmentation which lacks spatial reference, the photogrammetric approach enables the quantification and localization of the corrosion extent in exact physical dimensions. The high accuracy of the orthomosaic was confirmed against ground-truth measurements, achieving a root mean square error of 0.87 mm.

This automated, deep learning-based framework streamlines the detection process, provides reliable and quantitative data for assessing structural integrity, and represents a significant advancement over manual inspections, enhancing the overall efficiency, safety, and accuracy of infrastructure maintenance

Urban Building Function Mapping using AlphaEarth Foundations and OpenStreetMap

Yaoqi Xie, Hao Wu, Lanfa Liu

School of Urban and Environmental Science, Central China Normal University, China

Accurate identification of urban building functions is crucial for smart city planning and sustainable development. AlphaEarth Foundations introduce a new paradigm in remote sensing by providing semantically rich, pre-trained embeddings that integrate multi-sensor, spatiotemporal, and contextual information. In this study, we propose a novel fusion of 64-dimensional AlphaEarth embeddings and OpenStreetMap (OSM) derived building spatial indicators. We use the city of Toulouse as the study area, with the French official OCS GE database providing the ground truth labels. A random forest classification model was constructed, and the classification performance of single-source versus multi-source feature fusion was systematically compared. Results demonstrate that the multi-source feature fusion model achieves optimal classification performance, with an overall accuracy of 72.1\%, significantly surpassing models relying solely on embedding features (68.7\%) or spatial features (53.3\%).

The findings demonstrate the effectiveness and superiority of integrating AlphaEarth embeddings and OSM-derived building spatial indicators for automated urban building function identification, and provide a reliable technical approach for achieving large-scale and high-precision urban functional mapping.

Risk-Aware LLM Agents for Geospatial Data Retrieval: Design and Preliminary Adversarial Evaluation

Kyle Gao¹, Joel Cumming², Jonathan Li^1,3, Linlin Xu⁴, David A. Clausi¹

¹Department of Systems Design Engineering, University of Waterloo, Canada; ²SkyWatch, Canada; ³Department of Geography and Environmental Management, University of Waterloo, Canada; ⁴Department of Geomatics Engineering, University of Calgary, Canada

We present an LLM-driven framework for retrieving remote sensing data from cloud-based geospatial catalogues using natural language queries. The system converts user intent into structured API calls, enabling efficient access to satellite imagery and environmental datasets. The architecture integrates three agents: Guardrail for safety and policy enforcement, General-QA for intent interpretation, and Recommender-Analyst for schema-aware API call generation. This coordinated design ensures reliable, semantically aligned interaction with external data services. The modular framework is portable across platforms through API schema substitution and supports applications in environmental monitoring, disaster response, and climate analysis. It establishes a scalable interface between user intent and geospatial infrastructure, enabling streamlined and automated Earth observation workflows. Preliminary experiments under adversarial multi-turn settings show that prompt-level safety instructions improve robustness, although rare high-impact failures persist in API manipulation scenarios and highlight the need for adaptive, system-level defenses that balance safety, usability, and cost efficiency, which motivates the use of our intercept-level Guardrail agent.

Real-time solar farms defect detection with YOLO based EDGE OVDs using thermal UAV images

Mouad Jabrane^1,2, Aissam Salah¹, Kenza Ait El Kadi^1,2, Imane Sebari^1,2

¹Department of Photogrammetry and Cartography, School of Geomatics and Surveying Engineering; ²Research Unit of Geospatial Technologies for a Smart Decision

This paper introduces the second version of an end-to-end framework, which is the EDGE-Solar Farm Observation System (EDGE-SFOS v2.0). This system was developed for real-time solar farm defect detection with Edge generative detectors using drone images.

Benchmarking and Deep-Learning-Based Bias Adjustment of Gridded Meteorological Datasets for Agricultural Applications

Abdelrahman Saleh, Ali Youssef, Nasem Badreldin

Digital AgroEcosystems Lab, Department of Soil Science, Faculty of Agricultural and Food ScienceUniversity of Manitoba, Canada

This study addresses the critical issue of systematic biases in gridded meteorological datasets, which can lead to inaccurate agricultural predictions and flawed decision-making. The primary objective is to develop a unified, high-accuracy meteorological dataset for Manitoba to support agricultural applications. The study focuses on the 2005–2024 period and on key variables commonly used in agriculture, including minimum temperature, maximum temperature, precipitation, and solar radiation. The methodology involves two main stages. First, four widely used national and international gridded datasets, ERA5-Land, Daymet, CHIRPS, and ANUSPLIN, will be benchmarked by comparing gridded values extracted at the locations of more than 120 Manitoba weather stations with the corresponding station observations. Second, the best-performing dataset for each variable will be selected for bias adjustment. Traditional statistical methods, such as Linear Scaling and Quantile Mapping, will be compared with machine-learning and deep-learning approaches, including Linear Regression, Random Forest, XGBoost, DNN, LSTM, and 1D-CNN. The study is expected to provide a quantified assessment of dataset reliability for Manitoba and to produce an improved bias-adjusted meteorological dataset for regional applications. The resulting dataset is intended to support more accurate agro-climatic assessments, regional yield estimation, and crop modelling, while also offering a scalable framework for similar agricultural regions.

Comparative Assessment of GeoAI-based Frameworks for Automatic Urban Tree Cover

Francesco Pirotti^1,2, Linda Avesani³, Matteo Pianetti⁴, Riccardo Greco³, Catherine Dezio², Davide Quaglia⁴

¹Interdepartmental Research Center in Geomatics (CIRGEO), University of Padova, Italy; ²Department of Land, Environment, Agriculture and Forestry (TESAF), University of Padova, Italy; ³Department of Biotechnology, University of Verona, Verona, Italy; ⁴Department of Informatics, University of Verona, Verona, Italy

Accurate mapping of urban tree canopy is essential for quantifying ecosystem services and assessing the impact of green infrastructure on wellbeing and public health. This study evaluates and compares three Geospatial Artificial Intelligence (GeoAI) frameworks for the automated detection and segmentation of tree cover. The frameworks are YOLO, Detectree, and TreeEyed Utilizing high-resolution aerial imagery (0.2 m and 0.5 m ground sampling distance), the research tests different deep-learning paradigms, including object detection and semantic segmentation. The results indicate that while object-based models like YOLO align closely with statistical baselines (30.83% vs 30.11%), pixel-based models such as Detectree may underestimate fragmented urban vegetation. The study highlights the effectiveness of the TreeEyed QGIS plugin for urban applications and emphasizes the necessity of local LiDAR-derived data for model validation. Further studies would benefit from ad-hoc training with correct co-registration and consistent coordinate reference systems across layers.

MRGF:A robust SLAM Framework based on Millimeter wave Radar and GNSS Fusion in Harsh Environments

Fangcheng Qu¹, Yuheng Zhang³, Xianlang Wei⁴, Youchen Tang⁵, Jian Zhou^2,5

¹Wuhan University, School of Geodesy and Geomatics; ²Hubei Luojia laboratory; ³Wuhan University, College of Earth and Space Sciences; ⁴Wuhan University, School of Electronic Information; ⁵Wuhan University, State Key Laboratory of Information Engineering in Surveying

Maritime vehicles face significant positioning challenges under adverse weather conditions where visual and laser SLAM systems suffer from severe degradation. Millimeter-wave radar offers inherent robustness to weather interference, yet single-band radar cannot simultaneously achieve accurate translation and robust attitude estimation.This paper proposes a complementary fusion framework for multi-band radar odometry.This system leverages W-band radar (CFEAR) for reliable translation estimation and combines it with X-band radar (LodeStar) to improve rotational estimation robustness. The main innovations are as follows:(1) A complementary fusion framework exploiting the complementary characteristics of W-band and X-band radar; (2) A quality-aware adaptive weighting mechanism dynamically computing fusion weights based on sensor data quality assessment; (3) A consistency gating mechanism monitoring inter-sensor agreement and activating protective measures during sensor degradation.Experiments on the MOANA maritime dataset demonstrate that the proposed method achieves stable and reliable local motion estimation, reaching an RTE RMSE of 1.67 m on the Near-Port sequence.

Gaussian splatting for the reconstruction of complex and highly detailed object

Sara Gonizzi Barsanti¹, Dario Billi^2,3, Etienne Sommer², Erika Elefante¹, Arnadi Murtiyoso²

¹Department of Engineering, Università degli Studi della Campania Luigi Vanvitelli 81031 Via Roma 29, Aversa (CE) Italy; ²Université de Strasbourg, INSA Strasbourg, CNRS, ICube Laboratory UMR 7357, Photogrammetry and Geomatics Group, 67000 Strasbourg, France; ³Department of Civil and Industrial Engineering, ASTRO Laboratory, University of Pisa, Largo Lucio Lazzarino, 56122 Pisa, Italy

In recent years, Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) have emerged as advanced methods for photogrammetry-based 3D reconstruction. Since its introduction in 2020, NeRF has gained significant attention due to its capability to generate high-fidelity reconstructions from multi-view imagery. More recently, 3D Gaussian Splatting (3DGS), introduced in 2023, has proposed an alternative explicit scene representation based on a collection of anisotropic Gaussian primitives optimized directly in 3D space. This representation allows efficient rendering and scalable modelling of complex scenes while maintaining high visual quality. This paper analyses the performance of different 3DGS methods when dealing with complex geometry and less-cooperative surfaces compared to standard SfM IM procedures. Included in the comparison is also the Mesh-In-the-Loop Gaussian Splatting for Detailed and Efficient Surface Reconstruction (MILo), a novel meshing method using Gaussian splats. Three Gaussian splatting methods as implemented in the Postshot commercial software were also tested. Our experiments show that MILo shows very promising results in terms of detail reconstruction, while standard Gaussian splatting excels in visualisation but is still plagued by a high rate of noise especially when converted into a geometric point cloud form.

Towards a national geospatial digital twin in Slovenia

Nina Crnigoj¹, Anka Lisec¹, Ana Potočnik Buhvald¹, Krištof Oštir¹, Jernej Tekavec¹, Anže Žnidaršič², Aleš Vidiček², Katja Šušteršič²

¹University of Ljubljana, Slovenia; ²Flycom Technologies d.o.o., Slovenia

In this paper, we present the design and pilot implementation of Slovenia’s national geospatial Digital Twin (DT), coordinated by the Surveying and Mapping Authority of the Republic of Slovenia (GURS). Geospatial digital twins are enriched digital replicas of real world environments, dynamic models capturing past, present, and projected states to support geospatial decision making, location based services, and scenario simulations.

To demonstrate how the Slovenian Geospatial DT can be applied in practice, a prototype for modelling and managing flood hazards was developed. A flood-hazard prototype demonstrates the approach using the August 2023 event. The flood-modelling framework integrates very high-resolution (VHR) geospatial datasets with in situ environmental observations to ensure detailed spatial representation and analytical consistency. It combines ALS-derived terrain models with hydrological time series, meteorological forecasts, and satellite-based water detection from sources such as Sentinel-1/-2 and PlanetScope, enabling three-dimensional simulation and visualisation of flood dynamics.

The results show how a geospatial DT can transform authoritative datasets into operational intelligence for emergency management, spatial planning and climate-risk scenarios. Beyond floods, the architecture generalises to landslides, drought monitoring, infrastructure condition assessment and biodiversity applications.

UAV data fusion approach to assess vegetation recovery dynamics after pipeline construction

Mattia Balestra^1,2, Stefano Chiappini¹, Bojana Petrovic^1,3, Filippo Giusti⁴, Giacomo Quattrini², Simone Pesaresi², Marco Bonacoscia⁴, Roberto Pierdicca¹

¹Department of Construction, Civil Engineering and Architecture (DICEA), Università Politecnica delle Marche, 60131 Ancona, Italy; ²Department of Agricultural, Food and Environmental Sciences (D3A), Università Politecnica delle Marche, 60131 Ancona, Italy; ³Department of Geology and Soil Science, Faculty of Forestry and Wood Technology, Mendel University in Brno; ⁴Hystrix - Società di ricerca, progettazione e consulenza naturalistica ed ambientale, 61032 Fano, Italy

Post-construction vegetation monitoring along linear infrastructures is increasingly required to support evidence-based restoration assessment, yet conventional ground surveys remain spatially sparse and difficult to scale over narrow, heterogeneous corridors. This limitation is particularly critical in recently replanted pipeline clearings, where plant-level restoration outcomes must be inferred under operational constraints and where satellite-based monitoring cannot reliably resolve early post-restoration signals at the scale of individual saplings. This study addresses the problem by developing a UAV data-fusion workflow that integrates UAV laser scanning (ULS), UAV multispectral imagery (UAV-MS), and ultra-high-resolution UAV-RGB observations for sapling-level vitality assessment. The workflow was tested in two restored pipeline corridor sites in the central Apennines (Italy), Ponte Baffoni (4.6 ha) and Ca' Romano (1.4 ha), surveyed in May 2025. ULS data were used to detect and geolocate individual saplings, UAV-MS data were used to extract vegetation-index metrics (NDVI, GNDVI, NDRE), and UAV-RGB imagery supported plot-level expert validation. A PCA-based soft-labelling strategy generated proxy vitality labels, which were then used to train a Random Forest classifier to derive corridor-scale probabilistic maps of sapling vitality, subsequently expressed as ALIVE, DEAD, and UNCERTAIN classes. Random Forest classification achieved balanced accuracies of 0.78 and 0.83, respectively. The resulting corridor-scale maps suggested mortality rates of 48.9% in Ponte Baffoni and 40.0% in Ca' Romano. These results suggest that multi-sensor UAV fusion can provide spatially explicit, sapling-level indicators of restoration performance, complementing field surveys and supporting operational post-construction assessment in narrow restoration corridors.

A pipeline for automatic building reconstruction for Digital Twins in complex urban environments

Laura Candela¹, Luigi D'Amato¹, Alessandro Di Benedetto², Margherita Fiani², Lucas Matias Gujski²

¹Italian Space Agency (ASI), Rome, Italy; ²Department of Civil Engineering, University of Salerno, Fisciano (SA), Italy

Automatic building reconstruction is a strategic component for creating urban Digital Twins (DTs), enabling the generation of accurate and interoperable Level of Detail 2 (LOD2) models. These models provide an essential standard for applications such as Geographic Information Systems (GIS), energy and hydraulic simulations, and urban planning. To address these needs, the MEDUSA (MEDiterraneo: Uso Sostenibile dell’Ambiente) project, promoted by the University of Salerno and funded by the Italian Space Agency (ASI), developed an innovative pipeline. The method was optimized to model areas with complex geometries and articulated roofs, utilizing the Amalfi Coast as a test area. The developed workflow is based on the City3D algorithm, integrating LiDAR (Light Detection And Ranging) data with building footprints derived from the Regional Topographic Database (RTDB). The process involves point cloud segmentation to isolate buildings and the generation of a Triangulated Irregular Network (TIN) mesh. Roof contours are identified using edge detection operators, simplified into polylines, and regularized using geometric constraints like parallelism and orthogonality to ensure LOD2 compliance. Finally, polygons are vertically extruded and optimized through the PolyFit framework, ensuring closed and topologically correct polygonal models. To overcome computational challenges and LiDAR data variability, significant improvements were introduced, including process parallelization, alignment with the Digital Terrain Model (DTM), and batch management of GeoJSON files. These enhancements successfully increased the pipeline's robustness and efficiency. The enriched pipeline produces high-quality LOD2 models, laying a solid foundation for next-generation urban modeling capable of meeting the scalability and interoperability requirements of future smart cities.

Synthetic data generation for architectural typology documentation using diffusion models

Pedro Marco Achanccaray Diaz¹, Leonhard Wesche², Markus Gerke¹, Sebastian Hoyer², Klaus Thiele²

¹Institute of Geodesy and Photogrammetry, Technische Universitat Braunschweig, Germany; ²Institute of Steel Structures, Technische Universitat Braunschweig, Germany

The identification and systematic recording of industrial buildings pose significant challenges for modern monument preservation. In particular, system halls have shaped the industrial landscape since the 19th century but often elude complete documentation because of their widespread distribution. These buildings serve as vital witnesses to technical innovations and economic transformation; however, assessing their architectural value requires a comprehensive inventory to determine the rarity or preservation state of specific building types.

Deep learning (DL) approaches are commonly used for the automatic recording of these buildings in aerial photographs, where the primary obstacle is the scarcity of curated training datasets. We overcome this by employing generative AI, specifically Stable Diffusion (SD), to produce synthetic data. By fine-tuning the SD model with Low-Rank Adaptation (LoRA), we successfully replicate the appearance and textures of various hall types. To resolve the spatial incoherence and geometric inaccuracies inherent in standard text-to-image generation, we integrated ControlNet. This allows for precise structural grounding using semantic masks, where specific colors represent building types, and polygon shapes define their exact locations. The resulting model generates accurate synthetic samples that maintain both spectral authenticity and an accurate spatial layout. Their usability was assessed by training a building detection model on both the real and synthetic datasets, achieving 71.9 and 66.7 mIoU, respectively. Moreover, introducing a few real samples for validation during training increased the mIoU to 82.7. The detection results demonstrate that the synthetic dataset is a reliable source for training, yielding robust generalization.

Crops and Varietal Discrimination using PRISMA Hyperspectral Data

Manik Kalubarme¹, Dharmesh Verma², A.N. Singh³

¹Space Application Centre, Indian Space Research Organization (ISRO), Ahmedabad, India; ²Terrasesnse Intellicrop Pvt. Ltd. New Delhi, India; ³Remote Sensing Applications Centre, Uttar Pradesh (RSAC-UP), Lucknow, India

The PRISMA hyperspectral narrow-band data covering part of the Jind district during the Kharif Season 2024 were acquired to discriminate between two rice varieties, namely High-Yielding Variety (HYV) and Aromatic Basmati. In this study, hyperspectral bands were selected from the 240 hyperspectral bands of PRISMA data using Selective Principal Component Analysis (SPCA), which is specifically useful for crop classification. The subset of 9 PRISMA hyperspectral bands corresponding to the Sentinel-2 MSI bands was selected for rice crop classification and variety discrimination. The main difference between PCA and SPCA is that SPCA chooses only a subset of bands depending on the desired objectives of the study. The first three principal components (PCs) explained over 98 % of the variance of all spectral bands. The scatter plots of PC-1 and PC-2 indicated that there is a clear distinction between HYV and Basmati rice varieties.

In the present analysis, narrow-band hyperspectral red-edge group indices, such as Ratio Vegetation Index (RVIs), Green Normalised Difference Vegetation Index (GNDVI), and Chlorophyll Green Index (Clgreen), were generated to study their effectiveness for rice variety discrimination. The Spectral Angle Mapper (SAM) algorithm was used for supervised classification, and the results were validated using the time series S1 and S2 classified data. The results of validation indicated that using single-date hyperspectral data with 30 m spatial resolution, it was possible to discriminate between Basmati and HYV rice; however, it was not possible to discriminate between traditional and evolved Basmati rice varieties.

Real-Time Visualization of Cadastral Information from German Authorities Using Augmented Reality

Eike Marvin Sippel¹, Andreas Wichmann²

¹Landesamt für Geoinformation und Landesvermessung Niedersachsen (LGLN), Germany; ²Jade University of Applied Sciences, Germany

Real-time visualization of cadastral information through augmented reality (AR) has emerged as a significant challenge for public authorities in recent years. This paper addresses the potential, usage, and challenges of AR in the public sector. The prototype developed for this study demonstrates the visualization of geospatial data from ALKIS (Amtliches Liegenschaftskatasterinformationssystem, engl. Authoritative Real Estate Cadastre Information System), visualizing the boundaries and points of parcels in AR. Field tests conducted within this study assess the accuracy and usability of the AR visualization. As part of the study, existing AR libraries and frameworks were evaluated to select the most suitable platform for the prototype. The research underlines the potential of AR for geospatial applications, although it points out current precision limitations in the absence of external GNSS (Global Navigation Satellite System) receivers. The outcomes demonstrate the capabilities of AR visualization in a geospatial context and provide concrete approaches for optimizing future applications and research initiatives.

Integrating timber stability analysis for building life cycle management and HBIM framework support

Maria Chizhova¹, Junquan Pan¹, Gerold Kempter², Thomas Eissing¹, Thomas Luhmann³, Mona Hess¹

¹University of Bamberg, Germany; ²BauCaD *K+R* Kempter GmbH; ³Jade University of Applied Sciences

Modelling old buildings according to BIM standards is challenging, as historical architecture often features complex geometries and subject-specific information that is difficult to classify. This applies also to historic timber roof structures. The geometric complexity of historic timber structures makes them laborious and time-consuming to model using standard 3D software. In the case of aged heritage wooden beams, a lot of additional information should be parameterised. This information is derived from optical analysis as well as timber geometry and surface features, what is usually omitted in Open BIM.

In this paper we demonstrate a pipeline of data transfer from smartphone-based interface analysing automatically wood strength factor to BIM. This prototype interface allowing wood knottiness estimation for assessment of unknown strength values by aged heritage timbers as well as information connection to BIM framework.

A Multilingual LLM-Based GeoAI Framework for Natural-Language-Driven Remote Sensing Analysis

Hossein Barakati¹, Majid Kiavarz¹, Hamid Kiavarz²

¹Department of Remote Sensing and GIS, Faculty of Geography, University of Tehran, Ira; ²Department of Earth & Space Science & Engineering, York University, Toronto, Canada

The exponential growth of remote sensing data in recent years has underscored the need for intelligent, fast, and user-friendly analytical tools. Despite advancements in platforms such as Google Earth Engine and ENVI, the computation of spectral indices still demands specialized expertise, considerable time, and complex parameter tuning. This study aims to reduce the complexity of spatial data analysis and enhance its accessibility for non-expert users by developing an intelligent system capable of transforming simple natural language commands into automated remote-sensing index calculations. The main innovation lies in integrating Large Language Models (LLMs) with geospatial processing to establish a lightweight, multilingual, and fully automated framework capable of identifying index types and selecting appropriate spectral bands from Landsat data. The system was implemented using the Bloomz-560m language model in combination with open-source image-processing engines and deployed as a web-based interface. Experimental results over Tehran demonstrated that the model outputs were highly consistent with those generated by Google Earth Engine and ENVI, achieving an RMSE of 0.016 and a correlation coefficient of R² = 0.957. The total processing time was under 45 seconds, with the entire workflow executed automatically without user intervention. By simplifying the analytical process and significantly reducing computation time, this framework represents a crucial step toward democratizing remote sensing and spatial analysis. It can be effectively applied to urban surface heat island (SUHI) monitoring, water resource management, and precision agriculture applications.

Urban-Graph: Bridging Local SLAM and Global EO for Fine-Grained LCLU Mapping

MingHao Yu^1,2, Chenyang Wang³, Youchen Tang¹, Xiao Wu⁴, Jian Zhou^1,2

¹Wuhan University, State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, 430070 Wuhan, Chin; ²Hubei Luojia Laboratory, Wuhan University, Wuhan, China; ³State Key Laboratory of Marine Thermal Energy and Power Wuhan Second Ship Design and Research Institute, 430074 Wuhan, China; ⁴Wuhan University of Science and Technology, Key Laboratory of Metallurgical Equipment and Control Technology, Ministry of Education, Wuhan 430081, China

Global Earth Observation provides coarse LCLU maps, classifying complex urban areas as a single Built-Up class. This limits urban modeling and product validation. Conversely, local SLAM offers fine-grained semantic detail but suffers from large-scale drift and lacks a global coordinate system.

We introduce Urban-Graph, a novel AI fusion framework to bridge this gap. Our system centers on a semantic scene graph to manage multi-scale information. It fuses three data sources: satellite imagery as a global prior, vehicle-based SLAM for local semantic detail, and fixed roadside infrastructure for high-precision GNSS anchors. A factor graph optimizer integrates these local, global, and anchor constraints.

This process generates a large-scale, globally-consistent, and geospatially-anchored semantic map. This resulting graph serves a dual purpose. It provides a drift-free map for local systems and functions as a scalable, high-fidelity ground-truth product to automate the fine-grained validation and decomposition of coarse urban LCLU classes.

Using NeRFs for UAV-based 3D reconstruction of complex scenes: A comparison to MVS

Frederik Schulte, Perpetual Akwensi, Lukas Winiwarter

Unit of Geometry and Surveying - University of Innsbruck, Austria

High-resolution 3D documentation of cultural heritage sites is essential for their preservation. While terrestrial laser scanning (TLS) remains the gold standard, it is often cost-intensive compared to photogrammetry. This study evaluates three image-based reconstruction techniques, Multi-View Stereo (MVS), Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS), by applying them to a complex scene featuring a chapel and its surrounding vegetation, sensed from an uncrewed aerial vehicle (UAV). A hybrid TLS/MVS model provides a high-accuracy reference. Using identical interior and exterior camera parameters of the 105 UAV-acquired images, we generate dense point clouds with all methods and assess geometric accuracy and completeness using the M3C2 algorithm. Results show that MVS achieves superior accuracy (standard deviation of all M3C2 distances: MVS = 0.11 m, NeRF = 0.15 m), whereas NeRF attains up to 20% higher completeness, particularly in low-texture and vegetation-occluded regions. The 3DGS point cloud was deemed too sparse and was therefore not used for further analysis. The study highlights the potential of NeRFs to recover partially occluded or sparsely textured geometries that are challenging for MVS and suggests a complementary use of both approaches for cost-efficient documentation of cultural heritage.

Pre-ignition forest fire risk prediction using multi-temporal vegetation indices and machine learning: A case study from California

Jayantrao Diliprao Mohite, Suryakant Ashok Sawant, Ankur Pandit, Dineshkumar Singh

Tata Consultancy Services, India

This study presents a machine learning-driven approach to forecast forest fire risk in California’s high-risk regions, aiming to predict fire-prone areas one month in advance. By integrating static topographical features with dynamic vegetation indices—such as NDVI, NDWI, GPP, and LAI—and their derivative components like trend and Exponential Moving Average (EMA), the model captures critical indicators of vegetation health and moisture. Among several algorithms tested, Logistic Regression (LR) consistently outperformed others, achieving a validation AUC of 0.90 when combining static and dynamic features. A 12-month historical time window proved most effective, enabling the model to learn seasonal and long-term vegetation patterns. Validation on independent datasets showed promising results for 2021 (AUC 0.84), though performance dropped in 2024 (AUC 0.64), likely due to satellite data shifts and ecological changes. These findings underscore the importance of long-term vegetation monitoring and robust feature engineering for accurate fire risk prediction. The study offers a practical tool for early warning systems, while highlighting the need to address data variability and environmental dynamics for sustained performance.

Mapping natural disasters using social media posts with an encoder-decoder model

Nima Ekhtari¹, Andrew Niccolai², Kevin Slocum², Craig Glennie¹

¹University of Houston, United States of America; ²Cold Regions Research and Engineering Lab, Army Corps of Engineers

This work showcases mapping a natural disaster using social media posts of users (tweets) during the ongoing event. We have finetuned an encoder-decoder model and created a model that detects toponyms from tweets very efficiently. Toponyms are then resolved to geographical coordinates and features so that temporal heatmaps can be created effectively mapping the natural disasters through social media posts.

Encoder-decoder models are generally used for machine translation or summarization tasks in NLP. We show that through finetuning with proper data, a lightweight encoder-decoder model deployed locally can generate comparable results to prompting web-deployed large language models.

Enhanced Urban Land Cover Mapping and Green Space Assessment for a Medium-Sized City: A Case Study in Alta Gracia, Argentina

Silvana Palavecino

Mario Gulich Institute for Advanced Space Studies (CONAE–UNC), Córdoba, Argentina

High-resolution mapping of urban land cover and urban green infrastructure (IVU) is essential for medium-sized cities, where global datasets often fail to capture fine-scale patterns. This study presents a local-scale land-cover classification for Alta Gracia, Argentina. The approach integrates medium- and high-resolution imagery with object-based segmentation (SNIC) and Random Forest classification. Six vegetation indices (NDVI, EVI, SAVI, GNDVI, MSAVI, VARI) were used to enhance class separability, while PlanetScope mosaics and local orthophotos improve spatial detail. Accuracy was assessed using overall accuracy, Cohen’s Kappa, and F1 Score.

The resulting land-cover map was used to delineate and quantify urban green infrastructure. Green-cover areas were summarized across city-defined sectors. Results were compared with regional layers from IDECOR and the global Dynamic World product, showing that global datasets underestimate fine-scale vegetation and fail to capture small or fragmented patches. The high-resolution local map substantially improves spatial accuracy and IVU delineation, serving as a baseline for urban planning, green-space management, and climate-resilient strategies.

This study demonstrates the value of combining multi-source imagery, object-based methods, and machine-learning classification to refine local land-cover mapping and IVU assessment. The methodology is reproducible using open-source tools (Google Earth Engine, QGIS, and R) and transferable to other medium-sized Latin American cities with limited data availability. Future work will integrate LiDAR-derived canopy metrics and citizen science to validate and enhance local products.

This contribution links local mapping with broader land-cover/use frameworks, supporting the ISPRS ThS21 global–local dialogue and providing actionable evidence for sustainable urban development.

Single-image estimation of Brown–Conrady distortion in Fringe Projection Profilometry

Osman Ali¹, Dalia Elobaid³, Xiangjun Kong¹, Tibebe Yalew¹, Waiel Elmadih², Samanta Piano¹

¹University of Nottingham, United Kingdom; ²Taraz Metrology, United Kingdom; ³Sudanese Materials Scientists & Engineers

This work presents a hybrid camera calibration approach that combines the strengths of standard photogrammetric camera calibration with data-driven lens distortions correction. Conventional calibration methods, such as those based on Zhang’s model, estimate lens distortions by fitting polynomial functions to the calibration images coordinates. While these methods are well established, they may struggle to fully describe complex or setup-dependent distortions, particularly near image borders or under varying environmental conditions.

To address this, a learning-based model is introduced to directly predict the distortion coefficients from calibration images. The network is trained using real data, allowing it to capture lens- or condition-specific variations that conventional calibration may overlook. The predicted coefficients maintain the same format as those used in standard photogrammetric models, ensuring compatibility with existing calibration toolchains such as OpenCV or MATLAB.

The proposed approach, therefore, aims to automate the estimation of distortion parameters while preserving the interpretability and mathematical foundations of traditional models. Although the primary focus is on camera calibration, the method offers further advantages for optical metrology systems such as fringe projection, where accurate and consistent distortion compensation is essential for depth measurement reliability.

Integrating Advanced AI techniques to assist Urban Digital Twins Generation

Jiaojiao Tian, Mareike Weishaupt, Veronika Gstaiger, Guneet Mutreja, Chaikal Amrullah, Corentin Henry, Thomas Krauß, Pablo d' Angelo, Sandeep Kumar Jangir, Stefan Auer, Franz Kurz, Ksenia Bittner

German Aerospace Center (DLR), Germany

Digital twins play a crucial role in autonomous driving applications and transportation system simulations. The need for large scale and dynamic information has increased interest in generating urban digital twins from remote sensing data.

Aerial high resolution imagery of urban areas serves as the one of the most important data sources for this task. Advances in deep learning and machine learning allow more accurate and automated extraction of urban elements. In recent years, we have developed and integrated advanced deep learning models to extract various land cover types surrounding road networks, including buildings, roads, and vegetation. Furthermore, we have conducted proof of concept studies aimed at detecting and delineating linear landmarks from aerial imagery, including curbstones and road borders. These developments contribute to the creation of more accurate and detailed urban digital twins, which are essential for advanced urban analytics and intelligent transportation systems.

Results from the deep learning models are presented for the Schwarzer Berg district in Brunswick, Germany, which is a test region for the development of mobility services and technologies at the German Aerospace Center (DLR). The AI models are trained using benchmark datasets from other urban regions, indicating that the proposed approaches can be readily transferred and evaluated in other European cities.

Towards visualizing oceanographic Bibliometric Data across Canada

Colin Orian, Poppy Riddle, Zaiyang Zhong, Geoff Krause, Stephen Brooks, Philippe Mongeon

Dalhousie University, Canada

In this work, we demonstrate our early results in geocoding oceangraphic research articles across Canada. Through the use of AI, we extracted locations out of the abstracts of research articles and then assigned a latitude and longitude to those works based off of the locations extracted. The geocoded works are then displayed. Our work allows a user to identify locates across Canada that are being actively researched and find research specialists of those locations. We intend to develop this tool further by collaborating with journalists.

Data-centric approach for land use and land cover classification in Brazil

Glauber J. Vaz^1,2, João Francisco G. Antunes¹, Júlio C.D.M. Esquerdo¹, Alexandre C. Coutinho¹, André S. Tavares¹, Adriane Calaboni¹, Lídia S. Bertolo¹, Filipe C. Felix¹, Anderson Rocha²

¹Embrapa Digital Agriculture, Brazil; ²Recod.ai, Institute of Computing, University of Campinas

Land use and land cover (LULC) classification plays a crucial role in addressing numerous real-world challenges. Hence, we proposed methodological advances in LULC classification from a data-centric artificial intelligence perspective, which prioritizes data quality as a key factor in improving machine learning performance. The main contributions include evaluations of novel approaches for: (i) constructing an accurately labeled dataset based on agreement among existing reliable maps; (ii) curating remote sensing data to improve accuracy, consistency, unbiasedness, relevance, diversity, and completeness; (iii) generating training samples that capture the spatial, temporal, and spectral dimensions of remote sensing data; and (iv) developing a deep learning model designed to leverage multidimensional features. The study evaluates a sample generation method grounded in reference map agreement and multidimensional feature extraction, along with a deep learning model that leverages these features, attaining high accuracy across all LULC classes and providing a robust basis for large-scale, data-centric LULC mapping.

Forest cover dynamics: impact on ecosystem services and environmental sustainability in biodiversity-rich Western Ghats of India

S Abdul Rahaman¹, K R Balaramdas²

¹Sathyabama Institute of Science and Technology, Chennai, India; ²Bharathidasan University, Tiruchirappalli, India

Global forested areas are decreasing at a rapid rate, leading to environmental instability, altered climate patterns, and a decline in ecosystem services. In the present study, the Western Ghat (WG) region is one of the major forest resources in the Indian southern peninsula; it regulates/balances the weather conditions with the unique features of high-rise mountains and tall trees. This mountain chain is recognised as one of the world’s eight ‘hottest hotspots’ of biological diversity. These mountains cover an area of approximately 140,000 km² along a 1,600 km long stretch, traversing the states of Kerala, Tamil Nadu, Karnataka, Maharashtra, Goa, and Gujarat. This region is one of the richest biodiverse hotspots and biosphere reserves identified by UNESCO. The WG region is of immense global importance for the conservation of biological diversity and endemism. This region encompasses a number of protection regimes, ranging from Tiger Reserves, National Parks, Wildlife Sanctuaries, and Reserved Forests. The forests of the WG include some of the world's best representatives of non-equatorial tropical evergreen forests. Around 325 globally threatened species (IUCN Red List) occur in the Western Ghats, of which 129 are classified as vulnerable, 145 as endangered, and 51 as critically endangered.

Leveraging Large Language Models for Automated Assessment and Mapping in Participatory Urban Planning

Mohammad Hasan Zali¹, Mohammad Ghanbarei Baba ahmadi², Meysam Argany³, Mir Abolfazl Mostafavi⁴

¹University of Tehran, Iran, Islamic Republic of; ²University of Tehran, Iran, Islamic Republic of; ³University of Tehran, Iran, Islamic Republic of; ⁴Center for Interdisciplinary Research in Rehabilitation and Social Integration, Université Laval, Québec (Qc), Canada

This research introduces an innovative platform designed to enhance citizen engagement in urban planning and management by integrating emerging technologies such as Artificial Intelligence (AI), Large Language Models (LLMs), and chatbots. Traditional Public Participation Geographic Information Systems (PPGIS) often face challenges in effectively capturing and analyzing citizen input. This platform addresses these limitations by enabling users to articulate urban issues or ideas in natural language, which are then processed through AI-driven Natural Language Processing (NLP) techniques to identify key elements such as location, issue type, and intensity. Furthermore, the platform facilitates interactive dialogues, allowing citizens to inquire about perspectives from other community members, thereby fostering a dynamic exchange of views. In the absence of an initial user base, a dataset comprising 2,000 tweets related to Montreal's public transportation was curated. An LLM was fine-tuned using this data, equipping the model to respond to queries concerning Montreal's public transportation system. The findings demonstrate the feasibility of leveraging AI and LLMs to create a responsive and interactive platform that not only streamlines data collection but also enriches the participatory planning process. This approach has the potential to transform urban governance by making it more inclusive and data driven.

Robust Alignment Learning under incorrectly- and weakly-correlated Relationships for Remote Sensing Image-Text Retrieval

Yuchen Sha¹, Yichi Jin², Cheng Peng³, Yujian Feng⁴, Shuai You⁵, Jing Sun², Fei Wu⁶, Yimu Ji⁷

¹Nanjing University of Posts and Telecommunications, China, People's Republic of; ²Nanjing University of Posts and Telecommunications, School of Computer Science and Technology; ³National University of Singapore, Department of Civil and Environmental Engineering; ⁴Jiangsu University of Technology,School of Computer Engineering; ⁵Wuhan University, School of Computer Science; ⁶Nanjing University of Posts and Telecommunications, College of Automation; ⁷Zhejiang University, State Key Laboratory of Blockchain and Data Security

Remote Sensing Image-Text Retrieval (RSITR) aims to retrieve target textual descriptions from the gallery images, and vice versa. RSITR faces the key challenge of establishing accurate alignment between two heterogeneous modalities. Existing methods typically assume that image-text pairs are semantically aligned, where each textual description corresponds to a single image. However, this assumption does not always hold because factual errors in textual descriptions lead to incorrectly-correlated relationships. Moreover, some samples exhibit weakly-correlated relationships, i.e., an image corresponds to multiple similar texts. These incorrectly- and weakly-correlated relationships hinder effective cross-modal alignment. To address these challenges, we propose the Robust Dual Embedding Alignment (RDEA) network, which improves the robustness of cross-modal alignment by jointly learning both instance-level and feature-level correspondence between image and text modalities. Firstly, we propose an Incorrectly-Correlated Feature Rectification (ICFR) module, which employs a dynamic margin-guided mechanism to adaptively balance original and auxiliary descriptions generated by a large language model, guiding the model to learn correct image-text correspondences at the instance-level. Secondly, a Weakly-Correlated Feature Decoupling (WCFD) module constructs modality-specific intermediate features via learnable distributions, which decouple overlapping semantics across modalities. These intermediate features enable the model to distinguish semantically similar texts, thereby establishing more discriminative and accurate image-text correspondences at the feature-level. We conduct extensive experiments on benchmark datasets, demonstrating that our approach outperforms state-of-the-art methods.

From BIM–SAR Fusion to API-Based Digital Twin Services for Building Deformation Monitoring

Chia-Hsiang Yang¹, Carsten Stemmler¹, Nils Wolf¹, Florian Lahn¹, Boris Idesman², Joana Miguéns², Oliver Buck¹

¹EFTAS Remote Sensing Transfer of Technology, Germany; ²Clarity AI UG, Darmstadt, Germany – EnviroTrust

This contribution presents an operational framework that advances BIM–SAR fusion into a commercial, API-based Digital Twin service for building deformation monitoring. Building on the BIMSAR research project, the system integrates multi-frequency MTInSAR results from Sentinel-1, TerraSAR-X, and PALSAR-2 with IFC-based BIM models to provide semantically structured deformation indicators for individual building components. Persistent and distributed scatterer analyses generate millimetre-scale deformation time series, which are stored in a harmonized database and exposed through a RESTful API that supports standardized queries for deformation values, risk metrics, and metadata. A pilot implementation in Ahlen, Germany, demonstrates the service’s interoperability with existing digital twin platforms and validates the workflow using previously established BIMSAR datasets. Developed jointly by EFTAS Remote Sensing and EnviroTrust, the system showcases the successful transition of research-driven BIM–SAR fusion methods into an operational, cloud-ready monitoring service supporting resilient building and infrastructure management.

TreeCLIP: Unsupervised Tree Species Classification via Multi-view CLIP Feature Fusion

Dening Lu¹, Jonathan Li^1,2

¹Department of Systems Design Engineering, University of Waterloo; ²Department of Geography and Environmental Management, University of Waterloo

Accurate tree species classification is fundamental to forest ecology, biodiversity monitoring, and sustainable resource management. However, large-scale species-level labeling in remote sensing remains challenging due to the need for expert annotation and the limited generalization of supervised models. This study introduces TreeCLIP, an unsupervised framework that adapts the CLIP vision–language model for ecological analysis through multi-view feature fusion.

TreeCLIP renders each individual tree point cloud into multiple orthogonal 2D projections that capture its geometric and morphological characteristics. CLIP’s pre-trained image encoder extracts visual embeddings from each view, which are then L2-normalized and fused into a unified multi-view representation. By applying clustering methods such as K-means and DBSCAN, TreeCLIP achieves species-level grouping without any manually defined textual prompts or labeled training data.

Experiments on multi-platform airborne laser scanning datasets from German forest stands demonstrate that TreeCLIP surpasses traditional machine learning approaches (e.g., Random Forest, SVM) and achieves accuracy comparable to supervised deep models. The results highlight CLIP’s capacity to generalize across domains and reveal the potential of foundation models for fine-grained ecological recognition.

TreeCLIP provides a scalable, annotation-efficient framework for large-scale forest inventory and vegetation monitoring, bridging the gap between general-purpose vision–language models and domain-specific ecological applications.

Interactive 3D Scene Segmentation for Construction Sites via Gaussian Splatting and Foundation Models

Wentao Sun¹, Dayou Mao², Zhengsen Xu³, Quanyun Wu¹, Jingxiang Li¹, Kyle Gao¹, Yiping Chen⁴, Yuhao Chen¹, John S. Zelek¹, Jonathan Li¹

¹University of Waterloo, Canada; ²National Research Council, Canada; ³University of Calgary, Canada; ⁴Sun Yat-sen University, China

Construction sites are complex, dynamic environments that demand accurate, real-time monitoring for progress and safety management. Traditional on-site supervision and image-based UAV monitoring often fall short in providing detailed and timely 3D information. Recent digital twin technologies offer virtual replicas of construction sites, but existing 3D reconstruction methods—typically relying on LiDAR or depth cameras—remain limited by high hardware costs, heavy energy consumption, and extensive manual annotation requirements.

This study investigates the feasibility of applying 3D Gaussian Splatting (3DGS) for 3D scene reconstruction and segmentation in digital twin–based construction monitoring. Leveraging only visual inputs, 3DGS enables high-fidelity modeling while avoiding costly hardware. Combined with foundation models such as the Segment Anything Model (SAM), it supports unsupervised or weakly supervised segmentation adaptable to continuously evolving site conditions. Moreover, integrating 3DGS with large vision–language models allows for interactive segmentation through clicks or natural language prompts, advancing toward intelligent and adaptive digital twins.

We evaluate several Gaussian-based segmentation algorithms on construction-related datasets, assessing their effectiveness in capturing structural details and object semantics. Results show that 3DGS-based methods achieve promising segmentation quality for simple geometric objects but face challenges in complex, cluttered environments. These findings highlight both the potential and current limitations of 3DGS in realizing fully automated, adaptive digital twins for smart construction management.

EarthDaily FM: A Change Detection and Forecasting Foundation Model for Daily Global Multi-Modal Imagery

Chris Rampersad, Seungwoo Lee, Jakob Leben, Peter McElroy, Rob Irwin, Rick Chern, Will Parkinson

EarthDaily, Canada

EarthDaily FM is a foundation model purpose-built for high-frequency change detection and short- to medium-range forecasting across global Earth Observation (EO) time series. It is designed around the forthcoming EarthDaily Constellation (EDC)—a systematic, near-daily mission with 22 VNIR, SWIR, and LWIR bands engineered for AI-ready analytics, high geolocation and radiometric accuracy, CEOS ARD compliance, and spectral compatibility with Sentinel-2 and Landsat. This design enables a single self-supervised model to fuse years of historical S2/Landsat data with new daily EDC observations, closing the temporal gap that constrains existing EO foundation models focused on static scene understanding.

Preliminary experiments using open and proxy datasets demonstrate the model’s capability for diverse forecasting tasks, including harvest date prediction, crop yield estimation, and soil moisture retrieval. Using VENµS imagery as a proxy for EDC’s cadence and 5-m resolution, the model achieves low median errors in harvest date prediction at 50–60-day lead times, while multimodal training with meteorological and radar inputs improves soil moisture estimation. The impact of incorporating EarthDaily Constellation data on forecasting accuracy and model generalization will be demonstrated as new observations become available.

EarthDaily FM represents a practical step toward operational, time-aware EO modeling—integrating optical, radar, and weather data to support forecasting in agriculture, water resources, and environmental resilience.

Improving Planet Fusion Surface Reflectance Gap-filling using Sentinel-1 Backscatter and AMSR-2 Brightness Temperature

Julie Nutini, Rasmus Houborg

Planet Labs PBC, San Francisco, California, USA

We propose an innovative method to improve the reliability of Planet Fusion surface reflectance during periods of extended cloud cover. Planet Fusion offers daily, 3 m, cloud-free data (RGB-NIR) by radiometrically harmonizing all available PlanetScope imagery using the CESTEM algorithm, which employs MODIS/VIIRS and FORCE data for correction, and then uses a spatially and temporally driven gap-filling algorithm to ensure spatial completeness. A critical weakness arises during prolonged cloudiness, where the certainty of Planet Fusion's gap-filled pixels diminishes.

The proposed research directly addresses this weakness by incorporating Sentinel-1 synthetic aperture radar and AMSR-2 brightness temperature data. Both Sentinel-1 and AMSR-2 operate in the microwave spectrum, guaranteeing data acquisition regardless of weather or light conditions. By fusing these multi-sensor, multi-modal datasets into the Planet Fusion workflow, we are able to improve the accuracy of gap-filled pixels during months-long periods of persistent cloud cover. This work not only seeks to increase the reliability of the Planet Fusion product, but also advances the field of multi-modal data fusion, highlighting its necessity for uninterrupted, observation-driven monitoring of land surface change from space.

Bridging Physical and Digital Spaces: Interfaces for Sensor Planning and Situated Analytics

Valerio Signorelli

UCL University College London, United Kingdom

This work presents the development of a web‑based interface designed to support both on‑site and remote exploration of environmental sensor deployments.

The growing accessibility and standardisation of IoT technologies have led to their adoption across diverse fields, including environmental studies, urban planning, architecture, agriculture, archaeology, and museum studies, yet shared challenges persist around planning, deployment, interpretation, and communication of sensor data. When multiple disciplines operate within the same test environment, their activities can affect one another, highlighting the need for interfaces that reduce disciplinary barriers and rely on spatially grounded visualisation rather than domain‑specific terminology. The system builds on principles of Situated Analytics, enabling data to be interpreted directly within its spatial or contextual setting while also supporting remote interaction through proxy representations of real‑world environments.

In this contribution, three modelling techniques, dense point cloud, 3D Tiles, and Gaussian Splatting, were generated from drone images and integrated into a Babylon.js platform. A WebAR application, developed with 8th Wall, allowed sensor locations to be placed in situ, with data visualised through a shared information layer using MQTT to stream live or simulated readings.

The results indicate promising developments for cross‑disciplinary knowledge exchange through accessible, device‑agnostic web tools. Ongoing work explores the improvements to point‑cloud handling, AR localisation accuracy, and the long‑term collection of historical environmental data.

A multi-scale attention and texture enhancement method for ancient mural inpainting

Bing Xia, Dandan HE, Chaokui Ning, Haifeng Hu, Aiwan Fan

PINGDINGSHAN UNIVERSITY, China, People's Republic of

To address the common deterioration of ancient Chinese murals—including pigment loss, texture blurring, and color fading—this paper proposes a deep learning-based approach integrating multi-scale attention and texture enhancement modules for high-fidelity virtual restoration. The model employs a multi-scale attention mechanism to maintain structural continuity and a dedicated texture enhancement module to recover fine details often lost in conventional methods. The restoration process consists of three stages: multi-scale feature extraction using partial convolutions, feature reconstruction that transfers statistical properties from intact regions, and a texture refinement module for detail completion. Evaluated on the Dunhuang mural dataset, the method outperforms existing techniques in PSNR, SSIM, and FID scores, producing visually coherent and stylistically consistent results. This approach offers a scalable and adaptable solution for digital conservation, supporting customizable restoration levels tailored to various degrees of damage.

AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Agriculture Mapping

Wenyuan Li, Shunlin Liang, Yuxiang Zhang

Department of Geography, University of Hong Kong, Hong Kong, China

Climate change and population growth intensify the demand for precise agriculture mapping to enhance food security. Such mapping tasks require robust modeling of multi-scale spatiotemporal patterns from fine field textures to landscape context, and from short-term phenology to full growing-season dynamics. Existing methods often process spatial and temporal features separately, limiting their ability to capture essential agricultural dynamics. While transformer-based remote sensing foundation models (RSFMs) offer unified spatiotemporal modeling ability, most of them remain suboptimal: they either use fixed windows that ignore multi-scale crop characteristics or neglect temporal information entirely. To address these gaps, we propose AgriFM, a multi-source, multi-temporal foundation model for agriculture mapping. AgriFM introduces a synchronized spatiotemporal downsampling strategy within a Video Swin Transformer backbone, enabling efficient handling of long and variable-length satellite time series while preserving multi-scale spatial and phenological information. It is pre-trained on a globally representative dataset comprising over 25 million samples from MODIS, Landsat-8/9, and Sentinel-2 with land cover fractions as pre-training supervision. AgriFM further integrates a versatile decoder specifically designed to dynamically fuse multi-source features from different stages of backbone and accommodate varying temporal lengths, thereby supporting consistent and scalable agriculture mapping across diverse satellite sources and task requirements. It supports diverse tasks including agricultural land mapping, field boundary delineation, agricultural land use / land cover mapping, and specific crop mapping (e.g., winter wheat and paddy rice) with difference data sources. Comprehensive evaluations show that AgriFM consistently outperforms the general-purpose RSFMs across multiple agriculture mapping tasks.

Digitizing Bamboo Scaffolding for Sustainable Construction: Structure-aware Mapping and Stock Analysis

Linghao Kong, Wen-Jun Cao

The University of Hong Kong, Hong Kong S.A.R. (China)

An AI-driven framework for structural identification and stock analysis of bamboo scaffold systems to enable lifecycle management for firms, regulators, and workers. The method addresses irregular geometry, dense packing, and occlusions through three components. First, Node-guided Pole Fitting detects bamboo nodes and poles; the Bamboo of Building dataset trains a neural network to generate a Node Candidate Set. Within each node’s bounding box, Line Segment Detector (LSD) extracts linear features; representative segments are clustered, connected, and curve-fitted to model a pole. Second, multi-view 3D reconstruction maps the scaffold; cross-image matching projects poles into a unified space, refining NCS into Real Node Set and Fake Node Set for reliable topology. Third, a digital model estimates member lengths/diameters to quantify stock and potential CO2 reductions.

Does remote sensing-based Solar-Induced Chlorophyll Fluorescence (SIF) data enable agricultural drought detection in Germany?

Fatemeh Saba, Nima Ahmadian, Olena Dubovyk

University of Hamburg (UHH), Institute of Geography, Germany

Agricultural drought is one of the most damaging natural hazards, causing ecological disruption, economic losses, and reduced crop yields. Recent extreme droughts in Central Europe, particularly after 2018, have underscored the need for reliable, spatially explicit drought monitoring. Traditional ground-based indices often fail to capture crop-specific physiological responses, while commonly used remote-sensing indicators, such as NDVI, are limited by soil background effects and saturation in dense vegetation. Sun-Induced Chlorophyll Fluorescence (SIF) directly reflects plant photosynthesis and responds sensitively to water and heat stress, making it a promising alternative for drought assessment.

Despite its potential, SIF-based drought monitoring remains largely unexplored in Germany. Most studies focus on specific regions or individual crops and rely on other remote-sensing indices rather than SIF. To fill this gap, this study evaluates whether multi-temporal SIF data can detect agricultural drought signals across Germany and how consistently these signals relate to crop yield anomalies. Using the Soil–Climate Regions (SCRs) of Germany as an ecologically meaningful spatial framework, we examine spatial correlations between SIF and yield across SCRs, and compare time-series SIF anomalies with average yield anomalies.

This research highlights the potential of SIF as an early and robust indicator of agricultural drought, offering insights for improved drought monitoring and crop management strategies in Germany.

A new training-, marker-, and calibration-free vision framework for structural 3D displacement measurement with UAV-oriented design

Qingyu Xian, Berend Jan van der Zwaag, Ozlem Durmaz Incel

Pervasive Systems Research Group, Faculty of EEMCS, University of Twente, Enschede, The Netherlands

Vision-based displacement measurement offers a promising pathway toward UAV-enabled structural monitoring, where contact-free, lightweight, and rapidly deployable sensing is essential. However, existing vision approaches typically estimate only 2D motion or require model training, artificial markers, or complex calibration, which hinders their applicability on real structures.

To address these limitations, this paper presents a new training-, marker-, and calibration-free vision-based framework designed with future UAV deployment in mind for structural 3D displacement measurement. Leveraging the reasoning capability of a state-of-the-art vision foundation model, the proposed method achieves millimeter-level 3D displacement accuracy without any scene-specific training, calibration, or fine-tuning.

To support rigorous evaluation, we establish a compact multi-modal dataset collected from two full-scale bridges, including synchronized stereo videos, accelerometer measurements, and an evaluation protocol. Experiments on real bridges demonstrate that the proposed framework delivers accurate, robust, and practical in-situ 3D displacement measurement under uncontrolled field conditions.

The system is inherently suited for airborne visual sensing, and integrating the framework with UAV-based data acquisition constitutes the next step of this research.

Integration of Crowd-Sourced Community and Cloud-Based Google-Earth-Engine Data for Spatiotemporal Mapping of Invasive Pests: A Case of Desert Locust Invasion in Kenya

Eliud Gachie Baraka^1,2, Guido D’Urso³, Oscar Belfiore³

¹Sapienza University of Rome, Italy; ²Ministry of Agriculture in Kenya; ³University of Naples Federico II

Invasive pests such as the desert locust are both detrimental to people and the environment. The desert locust is documented as one of the most destructive polyphagous plant pests. This study, about the integration of crowd-sourced field dataset and Google Earth Engine (GEE) satellite data, demonstrates how community-based initiatives and freely available cloud-based earth observation resources can be used to provide innovative, evidence-based and data-driven decision support insights that are of critical use to government agencies in desert locust crisis management. The study integrated 160,810 desert locust field survey records collected from January 2020 to December 2021, with vegetation and water indices time-series computed from Sentinel 2 bands B2, B3, B4, B8 and B11 on GEE. The results indicate that the peak of desert locust mature adult (67) and hopper (75) incidents coincided with the highest spectral index values in June 2020. However, the peak of desert locust immature adult (70) incidents in February 2021 coincided with low spectral index values. This means that spectral indices can be used to identify suitable breeding areas for desert locusts, but may not reliably identify all the areas where the pest might be present. Among the assessed indices, the Modified soil adjustment vegetation index (MSAVI) produced the best prediction with a β=0.703, t=6.983 and p=<0.001. The study concludes that, because Hotspot 1 denotes arid and semi-arid lands (ASAL), MSAVI would be the most suitable for monitoring desert locusts in this area, as the index accounts for soil brightness in the deserts.

GLARS - Remote sensing over the Great Lakes basin

Brian Huberty

SharedGeo, United States of America

This paper reviews the evolution, achievements, and future direction of remote sensing across the Great Lakes Basin (GLB), emphasizing the unique binational collaboration between the United States and Canada. Beginning with post–World War II aerial photography, remote sensing in the region rapidly expanded through pioneering work in forestry, water quality mapping, and early satellite-based observation. The formation of the Great Lakes Alliance for Remote Sensing (GLARS) marked a major step toward coordinated, cross-border environmental intelligence. Enabled in part by the Great Lakes Restoration Initiative (GLRI), GLARS brought together federal agencies, universities, and private partners to deliver high-resolution, multi-temporal products supporting natural resource management.

Key achievements include production of 2-meter digital surface models for the entire basin using petascale computing; integrated optical and SAR approaches for dynamic wetland mapping; multi-year RADARSAT-2 monitoring of seasonal wetland saturation; InSAR applications for water-level change detection; and successful classification of invasive species such as Phragmites australis using multi-sensor datasets.

Looking ahead, the paper identifies priorities such as harnessing new SAR missions (RCM, NISAR), expanding daily high-resolution multispectral monitoring, building fully automated analysis pipelines, and formalizing binational data-sharing systems. Continued integration of AI, cloud computing, and stakeholder-driven design is essential for climate-resilient management of the world’s largest freshwater system.

A National Application for assessing Rooftop Solar Potential in Israel

Elisha Goldstein, Eran Keinan, Gila Droshkowitz, Hagi Ronen, Nir Alush, Saar Ben-Zvi, Uri Dorozko

Survey of Israel, Israel

This work details the development of a comprehensive national assessment application for rooftop solar photovoltaic (PV) potential in Israel, designed to support the national target of 30% renewable electricity generation by 2030. Faced with limited land and increasing electricity demand, Israel's policy prioritizes PV installations on existing building rooftops.

The technological approach integrates solar radiation modeling, Deep Learning (AI) obstacle segmentation, GIS, and governmental data. The system utilizes advanced models incorporating DSM data, shading, and meteorological variables to calculate solar radiation. Crucially, multiple Convolutional Neural Network (CNN) models (U-net, Mask RCNN) were trained on high-resolution aerial imagery to accurately segment and deduct rooftop obstacles, such as existing PV systems, solar collectors, and vegetation, achieving over 95% IoU.

The final assessment feeds into a two-pronged system:

A Public Application allowing citizens and businesses to receive address-specific estimates of usable roof area, expected electricity production, and economic return on investment.

A National Management System and Dashboard for policymakers and local authorities, enabling spatial examination, progress monitoring, and data-driven strategy formulation (e.g., targeted encouragement campaigns).

This multi-level system, combining remote sensing, machine learning, and governmental data, provides an adaptable, data-driven framework for facilitating the renewable energy transition across all stakeholder levels.

VGGT-SLAM for 3D Reconstruction of Low-altitude Remote Sensing Data: Feasibility and Limitations

Quanyun Wu, Wentao Sun, Kyle Gao, Jingxiang Li, Yuhao Chen, Jonathan Li

University of Waterloo, Canada

Low-altitude remote sensing using unmanned aerial vehicles (UAVs) has become a crucial method for large-scale 3D reconstruction in various applications, including urban planning, environmental monitoring, and disaster management. However, due to issues such as proportion blurring, projection distortion, and failed loop closure, obtaining precise and dense 3D point cloud maps from monocular RGB cameras remains challenging. Recent advances in feed-forward 3D scene reconstruction, such as VGGT (Visual Geometry Grounded Transformer), which generates dense point clouds and camera poses from uncalibrated RGB images, offer potentially promising solutions. VGGT-SLAM extends this capability to large-scale scenes by aligning local submaps optimized on the SL(4) manifold, which addresses projective ambiguity that similarity transformations (Sim(3)) cannot resolve. The enhanced large-scale reconstruction capability of VGGT-SLAM is precisely what is needed for 3D reconstruction of remote sensing datasets. This study investigates the feasibility of applying VGGT-SLAM to UDD (Urban drone datasets) and highlights its limitations in real-world scenarios.

A Robust Two Stage LiDAR–Camera Extrinsic Calibration Framework via Monocular Depth Assisted Joint Optimization

Junhua Kang¹, Jiwen Jia¹, Haotian Wu¹, Yuan Tan²

¹College of Geological Engineering and Geomatics,Chang'an University, China,; ²Shanghai Algebra Rhythm Technology Co., LTD, China

Accurate LiDAR–camera extrinsic calibration is crucial for reliable multi-sensor fusion in robotics, autonomous navigation, and UAV photogrammetry. This study presents a robust two stage LiDAR–camera calibration framework that integrates geometric and monocular depth assisted information constraints within a unified joint optimization scheme.

In the initial stage, geometric features from both LiDAR and camera views are extracted and aligned via Singular Value Decomposition (SVD) to provide stable initialization. The refined stage introduces a hybrid optimization that combines spatial distance constraints with a Normalized Mutual Information Distance (NID) term between LiDAR-measured depth and monocular depth estimation (MDE) results. The deep learning–based MDE provides dense and metrically consistent depth maps, effectively bridging the modality gap between 3D point clouds and 2D images. This dual-constraint formulation enhances calibration robustness against LiDAR sparsity and texture deficiencies. Experimental evaluations using a circular calibration target demonstrate mean rotational errors below 0.3° and translational errors under 3 cm, surpassing traditional FastCalib methods. Qualitative visualizations further confirm precise alignment between LiDAR projections and image contours. The proposed framework eliminates the need for precise calibration targets and manual initialization, achieving automatic, high-accuracy extrinsic calibration adaptable to complex outdoor environments

A Machine-Learning Based Landslide Susceptibility Modelling and Runout Analysis Framework in the Nolichucky River Gorge of East Tennessee Following Hurricane Helene 

Grace Braver, Arpita Nandi

East Tennesseee State University, United States of America

Extreme rainfall from Hurricane Helene (September 2024) triggered widespread landslides across the southern Appalachian region, highlighting the need for rapid landslide susceptibility assessments that capture both landslide initiation and downstream runout. Traditional susceptibility models often focus solely on initiation zones, limiting their ability to identify which slopes will generate destructive landslides or where material will travel. This study addresses that gap by (1) integrating Geographic Information System (GIS)-based machine learning susceptibility modeling using ArcGIS Pro: Maximum Entropy (MaxEnt) and Random Forest-Based and Boosted Classification and Regression (FBBC) and (2) the U.S. Geological Survey (USGS) Grfin (Growth, Flow, and Inundation) runout toolbox. The study focuses on the Nolichucky River Gorge in eastern Tennessee and western North Carolina, where intense rainfall (4-20 in;10.1-50.8 cm) triggered numerous shallow landslides. Results provide a framework for emergency response along TN-107 and US-19W corridors, infrastructure vulnerability assessments, and hazard planning in Unicoi and Carter counties.

Automated building extraction from airborne laser scanning data on national scale – Slovenia's approach

Primož Kete¹, Katja Šušteršič², Marjana Duhovnik³, Andreja Švab Lenarčič³

¹Geodetic Institute of Slovenia, Slovenia; ²Flycom Technologies d.o.o., Slovenia; ³Surveying and Mapping Authority of the Republic of Slovenia, Slovenia

The Surveying and Mapping Authority of the Republic of Slovenia (GURS) carried out nationwide airborne laser scanning project (CLSS) between 2023 and 2025, with a minimum spatial resolution of ten points per square metre across the entire territory of Slovenia. In 2025, the project for automated building extraction from the acquired LiDAR data was initiated, with the objective of systematically processing approximately one third of Slovenia’s territory per year. The automatically extracted building data (2.5D building footprints and 3D building models) will serve as a fundamental topographic dataset, a key source for detecting and monitoring changes in the Real Estate Cadastre, and a foundational dataset for property valuation at scale. Moreover, this initiative represents a pivotal step towards the establishment of a geospatial digital twin of Slovenia. The production workflow is based on an integrated processing method that combines a classified LiDAR point cloud (GKOT) and True Orthophoto imagery (POF) from CLSS. The quality evaluation is conducted in accordance with the international standard ISO 19157 — Geographic Information — Data Quality.

Mapping Wildfire Risk under Future Climate Scenarios in Scania’s Forests, Sweden

Maya Soraya Ghadiri¹, Sadegh Jamali²

¹Department of Human Geography, Lund University, Sweden; ²Department of Technology and Society, Faculty of Engineering, Lund University, Sweden

Climate change is expected to significantly alter environmental conditions in southern Sweden, increasing the risk of natural hazards such as wildfires. This study assesses wildfire susceptibility in forest areas of Scania under projected climate conditions corresponding to the Representative Concentration Pathways RCP8.5 scenario. Using Geographic Information Systems (GIS) and a fuzzy multicriteria decision analysis (MCDA), climatic variables (temperature, precipitation, wind speed) and forest type data were integrated to generate a continuous fire risk map. Forest types were reclassified based on fire susceptibility, and fuzzy membership functions were applied to climatic variables, with a fuzzy gamma overlay (γ = 0.6) used to combine criteria. Results indicate that several coastal and fragmented forest areas exhibit high wildfire risk, while northern inland regions show relatively lower susceptibility. The fuzzy approach enables a nuanced representation of risk gradients, providing valuable spatial information for climate adaptation and hazard mitigation planning. Despite limitations in input data and parameter quantification, the produced map highlights priority areas for monitoring and management under future climate scenarios.

CO3D - Shaping the Future of Optical Earth Observation and Its Applications

Erwin Willem Johan Bergsma, Dimitri Lallement, Delphine Leroux, Jean-Baptiste Henry, Christian Hummer, David Youssefi, Stephanie Artigues, Laurent Lebegue, Olivier Queyrut, Sophie Le Gac

CNES, France

The debut of the Constellation Optique en 3D (CO3D) in July 2025 represented a significant advancement in Earth observation. This state-of-the-art satellite mission captures the earth in breathtaking three dimensions by using four satellites in a novel out-of-phase tandem arrangement that mimics mammalian vision. CO3D produces high-resolution Digital Elevation Models (DEMs) at one-meter grid spacing with previously unheard-of accuracy—one-meter relative height precision and four-meter absolute height precision.

Synchronous stereo imaging enables tracking of moving objects even in the dark, and each CO3D satellite provides 0.50-meter resolution images in the red, green, blue, and near-infrared bands. This innovative technology, which offers cutting-edge capabilities for coastal monitoring, disaster response, urban planning, and climate research, helps the scientific, defense, and civil communities equally.

Applications for CO3D are numerous, ranging from improving post-disaster evaluations and urban resilience to tracking glaciers and coastal erosion. CO3D enables governments, businesses, and researchers to tackle important issues with unmatched accuracy by offering almost worldwide 3D data. Welcome to the era of CO3D, the future of Earth observation.

Automatic mapping of marine oil slicks in SAR images: How can foundation models help tackle the lookalike challenge?

Julien Vadnais, Benjamin Aubrey Robson

University of Bergen, Norway

The oil slick look-alike challenge occurs when natural ocean phenomena reduce synthetic aperture radar (SAR) return in the same backscatter range as mineral oil. We revisit this challenge through the lens of geospatial foundation models (FMs), large neural networks which are a current frontier in automatic, deep learning-based mapping methods. In their benchmark evaluations, FMs promote state-of-the-art performance across a wide range of downstream tasks including segmentation. In contrast, our findings suggest that, in their current state, FMs do not outperform other neural network backbones for segmentation in an unconventional remote sensing modality such as SAR imaging of oceans. Surprisingly, backbones that were partly pretrained on SAR data do not show improved segmentation over those pretrained on natural images (here ImageNet). Rather than improving model backbones for segmentation, we argue that the breakthrough made by FMs may well lie elsewhere, such as in data management and pruning techniques. We make available the dataset used in our experiments, consisting of Sentinel-1 IW images annotated for semantic segmentation of oil slicks.

Foundation Models for improved live Fuel Moisture Content Estimation

Tony Boston, Marta Yebra

Australian National University, Australia

This study will evaluate whether the analysis-ready, global, cloud-free, annual, 10 m resolution embedding field layers of the Google AlphaEarth and Tessera foundation models can be used to improve estimation and prediction of biophysical variables such as live fuel moisture content, as well as contributing to an understanding of the global transferability of developed models to different regions.

High resolution earth observation quantifies insect-based biodiversity intactness across Africa

Tobias Landmann, Komi Agboka, Vincent Magomere

International Centre of Insect Physiology and Ecology (ICIPE, Kenya

Quantifying biodiversity intactness—a central indicator of ecosystem health and resilience—remains difficult across Africa due to scarce standardized baseline data and limited biodiversity monitoring. Traditional indicators based on vertebrates or vegetation provide only partial insights, as they respond more slowly to environmental change and have limited spatial coverage.

This study presents a novel, continent-wide framework that integrates multi-sensor Earth Observation (EO) data (Sentinel-2, GEDI, and TerraClimate) with extensive in situ insect occurrence records to derive an insect-based biodiversity intactness index (IBI). Insects, which dominate terrestrial biodiversity and respond rapidly to microclimatic and habitat changes, are used as sensitive ecological proxies for ecosystem condition. Their ubiquity and fine-scale environmental sensitivity make them particularly suited to detect patterns of habitat degradation and recovery that other taxa may overlook.

By coupling EO-derived indicators of vegetation structure, productivity, and climatic variability with insect diversity models, the framework provides spatially explicit, continuous estimates of ecosystem integrity across Africa. The resulting IBI fills a major information gap in biodiversity monitoring by offering a harmonized, scalable, and policy-relevant assessment tool.

The approach directly supports reporting needs under the Kunming–Montreal Global Biodiversity Framework (GBF) and African Union ecosystem restoration goals. It demonstrates how EO and biodiversity data integration can operationalize continent-wide monitoring of ecosystem condition—helping countries to track progress toward conservation and sustainable land-use targets through an ecologically grounded, insect-based lens.

Comparing deep and traditional machine learning models for countrywide classification of dominant tree species

Urša Kanjir¹, Ana Potočnik Buhvald², Aleš Marsetič^1,3, Mitja Skudnik^4,5

¹ZRC SAZU, Slovenia; ²University of Ljubljana, Faculty of Civil and Geodetic Engineering, Slovenia; ³Space-SI, Slovenia; ⁴University of Ljubljana, Biotechnical Faculty, Slovenia; ⁵Slovenian Forestry Institute,

Tree-species classification from multispectral remote sensing has advanced rapidly with the improved spatial and spectral capabilities of sensors such as Sentinel-2, enabling accurate discrimination of forest taxa across large areas. This paper deals with two approaches for tree species classification at the national scale using multi-temporal S2 imagery. We compare a machine learning algorithm (LightGBM) and a deep learning transformer-based model (ForestFormer) to classify dominant tree species in Slovenia based on seasonal characteristics. The resulting classifications are validated against National Forest Inventory datasets, provided by the Slovenian Forestry Institute.

BetaEarth: Embedding Sentinel-2 and Sentinel-1 with a little Help of AlphaEarth

Mikolaj Czerkawski

Asterisk Labs, London, United Kingdom

This work explores the practicalities of emulating a closed-source Earth embedding AI model from a large set of its pre-computed outputs. It also demonstrates how behaviour of a multi-modal multi-temporal embedding dataset can be probed using individual observational inputs. The framework is tested using Major TOM Core datasets with Sentinel-2 and Sentinel-1 data and an existing global dataset of AlphaEarth Foundations embeddings.

Exploring the temporal transferability of AlphaEarth satellite embedding for land cover classification

Yawogan Jean Eudes Gbodjo¹, Dino Ienco²

¹VTT Technical Research Centre of Finland, Finland; ²INRAE, UMR TETIS, INRIA, EVERGREEN, University of Montpellier, France

In an ever-changing global context, accurate and up-to-date land use and land cover (LULC) information becomes critical to understanding the dynamics of the Earth surface and managing natural resources. Nowadays, a common workflow for LULC classification involves training a supervised machine learning model using satellite image time series (SITS) and a collection of ground truth (GT) samples. Unfortunately, GT data are not always available across years due to costly and time consuming field campaigns or restrictions on field access. For this reason, the possibility of transferring a model learned on a particular year (with GT data available) to another mapping year (without GT data) has received traction, recently. To cope with such temporal transfer scenario, unsupervised domain adaptation (UDA) has been considered in order to address possible data distribution shifts originating from different acquisition conditions affecting mapping years. In recent years, self-supervised learning has emerged as a promising paradigm to mitigate the reliance on large amounts of GT data through the learning of general purpose and robust feature representations, enabling the development of geospatial foundation models (GFM) in Earth observation. GFM, trained on large volume of multi-modal geospatial data can provide embeddings that encode rich spatio-temporal, spectral, and semantic information. A notable example is AlphaEarth satellite embedding, released lately on a global scale and annual basis for the seven past years. In this study, we propose to evaluate its potential for temporal transfer scenarios in LULC classification, using a multi-year open dataset collected in Burkina Faso, West Africa.

UniTS: Unified Time Series Generative Model for Earth Observation

Yuxiang Zhang, Shunlin Liang, Wenyuan Li

University of Hong Kong, Hong Kong S.A.R. (China)

One of the primary objectives of Earth observation is to capture the complex dynamics of the Earth system using satellite image time series. This process encompasses tasks such as reconstructing continuous cloud-free image sequences, identifying changes in land cover types, and forecasting future surface evolution. However, existing methods typically require specialized models tailored to different tasks, lacking unified modeling of spatiotemporal features across multiple time series tasks. In this paper, we propose a Unified Time Series Generative Model (UniTS), a general framework applicable to various time series tasks, including time series reconstruction, time series cloud removal, time series semantic change detection, and time series forecasting. Based on the flow matching generative paradigm, UniTS constructs a deterministic evolution path from noise to targets under the guidance of task-specific conditions, achieving unified modeling of spatiotemporal representations for multiple tasks. Furthermore, we construct two high-quality multimodal time series datasets, TS-S12 and TS-S12CR, filling the gap of benchmark datasets for time series cloud removal and forecasting tasks. Extensive experiments demonstrate that UniTS exhibits exceptional generative and comprehension capabilities in both low-level and high-level time series tasks. More details can be found on the project page: https://yuxiangzhang-bit.github.io/UniTS-website/

Mapping Cocoa Mosaic Landscape in Ghana using High Resolution Remote Sensing Data and Machine Learning Models

Matthew Abutunghe, Booker Ogutu, Jadu Dash

University of Southampton, United Kingdom

Advancements in remote sensing technologies and spatial data analytics have continued to transform how we map and monitor landscapes, including urban and agroforestry systems. Land use and land cover (LULC) analysis provides useful insights for sustainable land management, especially for agricultural stakeholders. Cocoa production is an agricultural system that benefits greatly from appropriate land use management. The system provides economic stability for millions of households worldwide through job creation, livelihoods, and raw materials for confectionery industries. However, its sustainability faces growing threats from environmental and socioeconomic challenges, such as climate change, land use conflicts, and extensive deforestation. One serious threat to cocoa production, particularly in West Africa (which supplies over 70% of the world's cocoa), is the widespread occurrence of the cocoa swollen shoot virus, among other pests and diseases that substantially decrease annual yields.

Therefore, accurate and current maps of cocoa farms are required for managing deforestation, supporting disease monitoring, and guiding climate-resilient agricultural strategies in the region. Previous efforts in mapping cocoa landscapes with remote sensing have not achieved the desired results, partly due to their spectral similarity to forests and shrublands, especially where they are part of agroforestry systems. This study aims to overcome this challenge by developing a robust methodology for detecting full-sun cocoa plantations using high-resolution satellite imagery and machine learning techniques for sustainable land utilisation.

A Multi-Modal Feature Fusion Framework for Pattern Classification of Cultural Relic Textiles

DANdan HE, Bing Xia, Hong Li, Haifeng Hu, Aiwan Fan

PINGDINGSHAN UNIVERSITY, China, People's Republic of

This research addresses the challenges in classifying patterns of textile cultural relics by developing a multi-modal feature fusion approach. Current methods struggle with fine-grained classification and cultural-period analysis due to fragmented data and insufficient feature integration. The proposed framework integrates high-resolution images, historical documents, and spectral data through Vision Transformers and BERT models, enhanced by a Feature Enhancement Fusion Module. Validation on Han and Tang dynasty textiles demonstrates 3-5% accuracy improvement in fine-grained classification while maintaining model size under 300MB. This research establishes a new paradigm for digital heritage preservation, enabling precise pattern recognition and cultural evolution analysis with practical applications in museums and digital curation.

Impact of Personal Laser Scanning Schemes on the Estimation Accuracy of Individual Tree Attributes in Lowland Pedunculate Oak (Quercus robur L.) Forest

Andro Kokeza¹, Albert Seitz¹, Damir Medak², Ivan Balenović¹

¹Division for Forest Management and Forestry Economics, Croatian Forest Research Institute, Cvjetno naselje 41, HR-10450 Jastrebarsko, Croatia; ²Faculty of Geodesy, University of Zagreb, Kačićeva 26, HR-10000 Zagreb, Croatia

This study examines the impact of various personal laser scanning (PLS) schemes on the accuracy of individual tree attribute estimation in lowland pedunculate oak (Quercus robur L.) forests in central Croatia. Using a FARO Orbis PLS system, three scanning schemes were tested on sample plots with different densities: (i) a walking scheme with a planned trajectory, (ii) a static flash-scanning scheme with multiple fixed positions, and (iii) a combined scheme integrating walking and static scans. For each plot, multi-scan terrestrial laser scanning (TLS) was first conducted and used as a reference for diameter at breast height (DBH) and tree height (H). All PLS point clouds were processed using a consistent workflow, which included filtering, normalisation, individual-tree segmentation, and attribute estimation, and then compared against TLS-derived values. Preliminary results indicate that, although the static scheme yields denser point clouds and higher measurement precision, it does not consistently improve DBH and H accuracy compared to the walking scheme and can even increase errors in denser plots. The combined scheme performs similarly to the walking scheme. These findings indicate that well-designed walking-based PLS schees can provide accurate, operationally efficient estimates of individual-tree attributes in structurally complex deciduous stands, supporting wider adoption of PLS in forest inventory practice.

IMU propagation as preintegration

Jianzhu Huai

Wuhan University, China, People's Republic of

Despite its popularity, IMU preintegration is often perceived as requiring a dedicated implementation that is separate from conventional IMU propagation. In practice, however, many codebases already contain a reliable propagation module, often tied to a particular state or error-state definition. This raises two practical questions. First, does adopting IMU preintegration require reimplementing the IMU model from scratch? Second, how can one validate that a preintegration implementation, especially its bias Jacobians and covariance, is correct?

This note shows that IMU preintegration and IMU propagation can be viewed as two equivalent realizations of the same underlying computation. We first describe both in a way that is not tied to a particular perturbation convention. We then show that the preintegrated measurement, its Jacobian with respect to the initial IMU bias, and its covariance can all be obtained by wrapping an existing IMU propagation routine. Conversely, a preintegration module can be used to recover state-transition matrices and propagated covariances. This view also clarifies how to adapt preintegration across different error-state definitions without re-deriving bias Jacobians and residual covariances from scratch.

We validate the analysis by converting an RK4-based IMU propagation implementation to and from the GTSAM preintegration modules. In experiments with random IMU sequences, the recovered Jacobians, covariances, and transition matrices closely match those produced by GTSAM's tangent and manifold preintegration. These results suggest that a robust propagation implementation can serve both as a simple path to preintegration and as a practical reference for validating preintegration code.

Evaluation of Two QSM Reconstruction Methods for Tree Volume Estimates using PLS Data

Albert Seitz¹, Andro Kokeza¹, Krunoslav Indir¹, Damir Medak², Ivan Balenović¹

¹Croatian Forest Research Institute, Cvjetno naselje 41, HR-10450, Jastrebarsko, Croatia; ²Faculty of Geodesy – The university of Zagreb, Kačićeva 26

Accurate information on tree structure is fundamental for forest management, biomass estimation, and carbon accounting. Personal Laser Scanning (PLS) has recently emerged as an efficient method for capturing detailed three-dimensional representations of trees under operational field conditions. At the same time, Quantitative Structure Models (QSMs) have become an important tool for deriving structural attributes such as diameter at breast height (DBH), tree height, and total tree volume directly from point cloud data. Despite increasing use of these approaches, systematic comparisons of different QSM reconstruction methods applied to PLS data remain limited.

This study evaluates two QSM workflows, PyTLidar and AdQSM, using PLS point clouds collected for pedunculate oak and European beech trees in leaf-off conditions. Data were acquired with the FARO Orbis system using both continuous mobile scanning and stationary flash scans, enabling the creation of mobile-only, flash-only, and combined point cloud variants. After preprocessing and single-tree extraction, each tree cloud was reconstructed separately with both QSM approaches. Key structural attributes were derived from each reconstruction to assess how the methods differ in estimating tree volume.

The comparison employs statistical measures that quantify natural variability among trees relative to variability introduced by each workflow. This allows the study to identify situations in which the two QSM methods produce consistent results and where their outputs diverge. The findings will support improved understanding of QSM behaviour when applied to PLS data and contribute to ongoing efforts to strengthen digital tree modelling for forest monitoring and ecological applications.

Optimization of LIDAR Point Size to Simulate Shortwave Radiation in Savanna Canopies

Cameron Proctor¹, Sarah Baker¹, Ethan Sutts¹, Pracithaa Vijayakumar¹, Bin Wang²

¹University Of Windsor, Canada; ²State Key Laboratory for Vegetation Structure, Function and Construction, Yunnan University, Kunming, China

LIDAR point clouds combined with canopy-light extinction software can provide 2D simulations of shortwave radiation to identify crucial microclimates that control the overall water balance in savanna ecosystems. However, the point size necessary to accurately depict the wide range of tree species and forms that temperate savannas contain is largely unknown. To determine the optimal point size, hemispherical canopy imagery and field measured insolation will be compared to synthetical hemispherical imagery derived from LIDAR point clouds at different point sizes. The optimal point size will be validated against FLApy predictions and Hobo MX2022 measured illumination across 20 sample plots. The index of agreement between observed and predicted values will quantify systemic biases. Accurate point size is needed to assess tree removal scenarios and equip ecologists with the tools needed to understand the long-term implications for tree removal choices and how to best restore the tree canopy for long-term savanna resilience.

ProbGLC: A Generative Cross-view Geolocalization Approach for Rapid Disaster Response

Hao Li¹, Steffen Knoblauch²

¹National University of Singapore, Singapore; ²Heidelberg University, Germany

As Earth’s climate changes, it is impacting disasters and extreme weather events across the planet. Record-breaking heat waves, drenching rainfalls, extreme wildfires, and widespread flooding during hurricanes are all becoming more frequent and more intense. Rapid and efficient response to disaster events is essential for urban climate resilience and sustainability. A key challenge in disaster response is to correctly and quickly identify diaster locations for timely decision-making and resources allocation. In this paper, we propose a Probabilistic Cross-view Geolocalization approach, called ProbGLC, exploring new pathways towards generative location awareness for rapid disaster response. Herein, we combine the probabilistic and deterministic geolocalization models into a unified framework to simultaneously ensure model explainability and state-of-the-art geolocalization performance. Designed for rapid diaster response, the ProbGLC is able to address cross-view geolocalization across multiple diaster events as well as to offer unique features of model explainability and uncertainty quantification.

Date: Saturday, 11-July-2026

8:30am - 10:00am

ThS21: The Global-local Exchange Loop: Coupling Earth Observation and Citizen Sciences for LCLU Mapping
Location: 713A

8:30am - 8:45am

OntoLULC-SOTA: An ontology based approach to make systematic reviews for LULC data

Martin Cubaud¹, Ana-Maria Olteanu-Raimond¹, Cidalia C. Fonte^2,3, Diogo Duarte^2,4, Jacinto Estima⁵, Linda See⁶, Nicolas Gonthier¹, Laurence Jolivet¹, Clément Mallet¹, Arnaud Le Bris¹, Vyron Antoniou⁷

¹Univ Gustave Eiffel, Géodata Paris, IGN, LASTIG, F-77454 Marne-la-Vallée, France; ²Institute for Systems Engineering and Computers at Coimbra (INESC Coimbra), 3030-290 Coimbra, Portugal; ³University of Coimbra, Department of Mathematics, Apartado 3008, EC Santa Cruz, 3001-501 Coimbra, Portugal; ⁴Department of Electrical and Computer Engineering, Polo 2, 3030-290 Coimbra, Portugal; ⁵University of Coimbra, CISUC, Department of Informatics Engineering, Rua Sílvio Lima, 3030-290 Coimbra, Portugal; ⁶International Institute for Applied Systems Analysis (IIASA), Laxenburg, Austria; ⁷Hellenic Army Geographical Directorate, 15561 Cholargos, Greece

Land Use (LU) and Land Cover (LC) data allow us to understand the physical and human activities associated with a given land. Thus, LULC is a dynamic and highly researched field. LULC review papers are numerous and provide high-level insights about the proposed approaches, the data used, the study cases, the strengths and limitations, and the identification of new research gaps. Nevertheless, these reviews are not systematic and reproducible. The goal of this work is to propose an ontology to help the research community conduct systematic and shareable literature reviews and comparable analytical analyses of scientific papers. To achieve this, we formalize their metadata, content, strengths, and weaknesses. In particular, we consider the scientific paper as the central element of our ontology and we define formal semantics for all relevant items (data process, LULC life cycle and scientific paper). We hope to open the path to more efficient synthesis, discovery, and reuse of research outcomes from the literature. To facilitate the instantiation process and make it accessible to a broader range of researchers, we designed a tabular-based template. We used our template to simulate the process of conducting a literature review on three use cases: building function, global land cover mapping, and multi-class change detection.

8:45am - 9:00am

Manual Annotations meet Fine-Tuned Foundation Models: a Comparison on Tree Crown Segmentation Task

Rewanth Ravindran, Janik Steier, Samer Karam, Dorota Iwaszczuk

Technical University of Darmstadt, Germany

Accurate segmentation of individual tree crowns (ITCs) from remote-sensing imagery is essential for forest monitoring and ecological analysis, yet remains challenging due to overlapping canopies and structural variability. The Segment Anything Model (SAM) shows strong generalization capabilities but requires effective prompting and domain adaptation for remote sensing applications. In this study, we investigate a lightweight fine-tuning strategy using Low-Rank Adaptation (LoRA) to adapt SAM for ITC segmentation on the BAMFORESTS dataset. The impact of different prompting strategies is evaluated, including manually annotated point and bounding box prompts, as well as automatically generated bounding boxes derived from a pre-trained tree detector. SAM is fine-tuned with instance-level ITC masks, enabling prompt-aware segmentation of multiple tree crowns per image. Performance is assessed before and after fine-tuning using standard instance segmentation metrics, including IoU and F1-score. Results show that LoRA-based adaptation improves mask delineation and robustness to prompt variability, with bounding box prompts consistently outperforming point-based inputs. Automatically generated prompts enable a fully automated workflow, although their effectiveness depends on detection quality. Evaluation on an independent validation site with manually annotated ITC labels shows that the fine-tuned LoRA-SAM model achieves performance comparable to manual annotations, while significantly reducing annotation effort. These findings highlight the importance of prompt design in adapting foundation models for remote sensing tasks and demonstrate that parameter-efficient fine-tuning provides a practical pathway toward scalable ITC segmentation.

9:00am - 9:15am

Evaluation of the IGN FLAIR-HUB Model Transferability Performance for Land Cover Mapping in Iasi, Romania

Ana-Maria Loghin¹, Loredana-Mariana Crenganis¹, Constantin Stoian², Ana-Maria Olteanu-Raimond², Anatol Garioud², Valeria-Ersilia Oniga¹, Bogdan Rusu¹

¹quot;Gheorghe Asachi" Technical University of Iasi, Romania; ²Univ. Gustave Eiffel, IGN-ENSG, LaSTIG – Saint-Mande, France

This research rigorously evaluates the transferability of the pre-trained FLAIR-HUB deep learning model, developed by the French National Institute of Geographical and Forest Information (IGN), in terms of spatial generalizability and multi-resolution robustness, when transferred from its native French domains to the complex urban-agricultural landscape of Iasi, Romania.

The core objective of this investigation is to test the model's performance stability across severe multi-resolution domain shifts and temporal scenarios. The model architecture is applied to orthophotos acquired over Iasi in 2019 (at 0.5 m resolution) and 2024 (at 0.2 m and at a very high resolution of 0.084 m), enabling a comprehensive assessment of cross-resolution and temporal robustness.

A novel validation framework is introduced, combining conventional 2D raster-based evaluation with a 3D point-wise assessment using semantically labeled UAV-derived point clouds. The results demonstrate strong performance for dominant classes such as buildings and herbaceous vegetation, with improved accuracy at higher spatial resolution, while stable classes such as buildings and impervious surfaces show a comparatively robust performance, confirming the model’s capability to consistently represent invariant land cover types. However, performance decreases for heterogeneous and vegetation-related classes due to seasonal variability and class complexity. The 3D validation reveals slightly lower but consistent results, highlighting its role as a more rigorous evaluation approach. Overall, the study confirms the potential of transferring pre-trained semantic segmentation models to new geographic contexts, while emphasizing the importance of spatial resolution, temporal consistency, and validation strategy.

9:15am - 9:30am

Towards efficient Giant Tree Inventories: Deep Learning with crowdsourced Training Data

Yu-Hui Wang¹, Chi-Kuei Wang¹, Chung-Cheng Lee¹, Rebecca Chia-Chun Hsu²

¹Dept. of Geomatics, National Cheng Kung University, Chinese Taipei; ²Forest Ecology Division, Taiwan Forestry Research Institute, Chinese Taipei

Airborne Laser Scanning (ALS) data have been used to identify giant trees in Taiwan, yet current workflow included volunteers to visually inspect ALS profile images. This study proposed to replace the volunteer-based verification step by applying deep learning to ALS profile images. Candidate treetop locations were first extracted from a Canopy Height Model (CHM) using a 65 m threshold and local maxima filtering. For each candidate, a representative ALS profile image was generated following an automated angle-selection method based on terrain fitting.

An EfficientNetV2-S model was trained using volunteer-labelled profile images from previous nationwide surveys. After label cleaning, a refined dataset was constructed, and a hybrid resampling strategy was applied to address class imbalance. The final model achieved 99.0% overall accuracy, 98.1% precision, and 100% recall on the independent test set, successfully detecting every true giant tree.

To evaluate generalization, the model was applied to 97,487 candidates from the latest national ALS survey. Predictions exhibited a strongly bimodal confidence distribution, demonstrating stable between true and false positives and effectively reducing the manual inspection workload.

This study shows that deep learning can reliably replace crowdsourced verification, enabling scalable, supporting efficient updates of large-scale forest inventories.

9:30am - 9:45am

The Global-Local loop: what is missing in bridging the gap between geospatial data from numerous communities ?

Clément Mallet, Ana-Maria Olteanu-Raimond

Univ Gustave Eiffel, IGN, Géodata Paris, LASTIG, France

We face a unprecedented amount of geospatial data, describing directly or indirectly the Earth Surface at multiple spatial, temporal, and semantic scales, and stemming from numerous contributors, from satellites to citizens. The main challenge in all the geospatial-related communities lies in suitably leveraging a combination of some of the sources for either a generic or a thematic application. Certain data fusion schemes are predominantly exploited: they correspond to popular tasks with mainstream data sources, e.g., free archives of Sentinel images coupled with OpenStreetMap data under an open and widespread deep-learning backbone for land-cover mapping purposes. Most of these approaches unfortunately operate under a "master-slave" paradigm, where one source is basically integrated to help processing the "main" source, without mutual advantages (e.g., large-scale estimation of a given biophysical variable using in-situ observations) and under a specific community bias.

We argue that numerous key data fusion configurations, and in particular the effort in symmetrizing the exploitation of multiple data sources, are insufficiently addressed while being highly beneficial for generic or thematic applications. Bridges and retroactions between scales, communities and their respective sources are lacking, neglecting the utmost potential of such a "global-local loop". In this paper, we propose to establish the most relevant interaction schemes through illustrative use cases. We subsequently discuss under-explored research directions that could take advantage of leveraging available data through multiples scales and communities.

8:30am - 10:00am

ByA2: ISPRS Best Young Author Award Papers
Location: 713B

Practical Implementation and Adaptation of Rainforest-Based Inter-calibration for ESCAT-ASCAT Scatterometer Data Records

Clay Taylor Harrison¹, Sebastian Hahn¹, Roland Lindorfer¹, Thomas Melzer¹, Raffaele Crapolicchio², Wolfgang Wagner¹

¹TU Wien, Austria; ²Serco Italia SpA - for European Space Agency, Rome, Italy

C-band scatterometers have been collecting radar backscatter data since 1991, providing valuable long-term records for environmental monitoring applications such as soil moisture and vegetation dynamics. However, differences in sensor calibration between missions introduce biases that compromise the continuity of these data records. This paper presents the practical implementation and adaptation of Reimer's (2014) rainforest-based inter-calibration approach for ESA's ERS satellites (ESCAT) and MetOp/ASCAT instruments. We implement the method as a modern, open-source Python framework and apply it to the newly complete ERS data record (including ERS-1 data not available in the original study). The resulting calibrated backscatter data record will enable improved long-term monitoring of land surface dynamics with reduced mission-to-mission variability in bias and slope response over incidence angle.

Impact of geometric priors: advanced fine-grained airplane detection with geometric details in high-resolution satellite images

Tobias Traiser, Hai Huang, Helmut Mayer

Universität der Bundeswehr München, Germany

Improved availability and quality of high-resolution satellite imagery allow for reliable airplane detection. Yet, fine-grained classification, especially of commercial airliners, remains a formidable challenge. Besides common difficulties, such as varying image

artifacts and occlusions, the main challenge lies in the strong visual similarity between airliner families. This paper presents a geometry-aware classification that enhances oriented object detectors by integrating absolute measures and geometric features – fuselage length, wingspan, wing sweep angle, engine count, and fuselage width – in the form of priors into a Bayesian maximum a posteriori (MAP) estimation. The proposed pipeline is detector-agnostic by updating class posteriors without retraining the main detector. On the Gaofen Challenge dataset, it results in consistent improvements based on untuned baseline detectors, which outperform the top scores of the sophisticated fine-tuned models. An oracle experiment reveals the potential of the approach with an upper limit of the overall mean Average Precision of up to 0.96 and 0.98 for Gaofen and SuperView data, respectively. Furthermore, the impact of the employed geometric attributes is quantitatively evaluated.

Query2Property: Semantic retrieval of IFC properties for natural language BIM queries

Rabindra Lamsal, Sisi Zlatanova

University of New South Wales, Australia

IFC models store detailed building information, but their complex schema and deeply nested property sets make querying difficult for non-expert users and challenging for large language models (LLMs) to handle directly. Current LLM-based approaches are inefficient because prompts often include entire IFC schemas, many properties of which are irrelevant to the user’s query, leading to higher inference costs and potential errors. This paper presents Query2Property, a semantic retrieval system that maps natural language queries to the most relevant IFC properties. By embedding both property descriptions and user queries in a shared vector space, the system retrieves contextually relevant properties for dynamic and concise prompt construction in LLM-driven workflows. Evaluation on 55 representative BIM queries achieves a top-1 accuracy of 87.3% and top-3 accuracy of 100%, demonstrating effective alignment with user intent. Query2Property simplifies LLM-based workflows over BIM data, supporting semantic search and natural language exploration of complex building information.

Domain-Adaptive Object Detection for Enriching Semantic 3D City Models with Building Storeys from Street-View Images

Lukas Arzoumanidis, Al Maimun As Samee, Elmehdi Kanna, Huynh Duc An Son Nguyen, Youness Dehbi

HafenCity University Hamburg, Computational Methods Lab, Germany

Semantically rich 3D city models play a vital role in a variety of applications, such as urban planning. Enhancing these models with currently unavailable attributes, such as building storey numbers, can unlock new opportunities to address pressing challenges, including sustainable urban development. In this work, we present an end-to-end pipeline for the automatic estimation of the number of storeys to semantically enrich 3D city models. We employ volunteered geographic information street-view imagery

from Mapillary, using a COCO-pretrained object detection model to identify windows in façade images as key visual indicators for inferring building storey counts. Our detection pipeline, based on the YOLOv3 architecture, estimates storey numbers using an ensemble of clustering methods including Gaussian Mixtures and DBSCAN and enables the automatic augmentation of CityGML-based 3D city models by filling in missing attributes. This enrichment supports advanced applications, such as assessing building-scale energy demand, evaluating vertical urban growth patterns or population density estimations. We validated the feasibility of our approach with unfiltered Mapillary and applied it to a district in the city of Heidelberg, Germany. The paper also includes a detailed discussion of learning process quality, integration workflows, and visualization of the enriched 3D city model. The developed code is available at: https://github.com/hcu-cml/citydb-buildingstoreys-ai.

8:30am - 10:00am

WG III/7C: Remote Sensing of the Hydrosphere and Cryosphere
Location: 714A

8:30am - 8:45am

Spatial and Temporal Constraint One-Step Estimation of Terrestrial Water Storage Anomalies from GRACE/-FO Monthly Gravity Field Models

Yunzhong Shen, Lin Zhang

Tongji University, China, People's Republic of China

This study introduces a Spatial constraint One-step Approach (SOA) and Temporal constraint One-step Approach (TOA) to improve the estimation of Terrestrial Water Storage Anomalies (TWSA) from GRACE and GRACE-FO satellite data. Traditional three-step or two-step post-processing methods sequentially apply spectral filtering and leakage correction, often causing signal attenuation, spatial leakage, and reliance on external models.

In contrast, the proposed one-step framework simultaneously estimates all signal components—including trends, seasonal cycles, and non-seasonal signals (NSS)—directly from unfiltered TWSAs within a region of interest. It incorporates full error covariance and models NSS using spatiotemporal constraints: TOA employs a Multi-Order Gauss-Markov process for temporal correlation, while SOA uses spatial covariance functions and a buffer zone to reduce boundary effects. Tikhonov regularization ensures solution stability.

Validation across major river basins and regions like Southeastern China shows that SOA/TOA outperforms conventional filters (e.g., DDK, IPF), reducing errors and improving agreement with mascon products and climate indices. The method also better identifies hydrological extremes (e.g., droughts, floods) and links them to climate drivers like ENSO, enhancing the monitoring and understanding of global water storage dynamics.

8:45am - 9:00am

Predicting groundwater dependent ecosystem habitats in boreal Alberta, Canada using remote sensing and machine learning modelling

Michael Merchant¹, Michael Wendlandt², Jennifer Hird¹, Andrew Underwood², Cris Gray¹, Marshall McKenzie², Emily Herdman², David Evans¹, Jacqueline Dennet¹, Murdoch Taylor¹, David Roberts^1,2, Cynthia McClain¹

¹Alberta Biodiversity Monitoring Institute; ²InnoTech Alberta

Groundwater dependent ecosystems (GDEs) are sustained by direct or indirect access to groundwater, relying on its flow or chemistry for their water needs. These ecosystems span aquatic, terrestrial, and subterranean realms, providing critical ecological functions, maintaining water quality, and supporting biodiversity and Indigenous land use. In Alberta’s boreal region, GDEs are abundant yet remain poorly mapped, limiting understanding of their extent and sensitivity to industrial development and hydrological change. Developing consistent, spatially explicit mapping tools is therefore essential for effective monitoring and management. This research develops and evaluates a remote sensing and machine learning (ML) framework for predicting GDE habitats across boreal Alberta, Canada, as part of a broader provincial effort toward consistent, high-resolution GDE mapping. Multi-sensor Earth observation and geospatial datasets were integrated using ensemble ML modelling to identify groundwater-dependent habitats. Specifically, the study aimed to (1) evaluate the performance of multiple ML algorithms and ensemble approaches for GDE prediction, (2) assess whether aquatic and terrestrial GDEs can be effectively modelled within a unified framework, and (3) identify the most influential environmental and remote sensing variables driving GDE occurrence. The resulting model ensemble achieved high predictive accuracy (AUC = 0.90), with wetland and hydrological variables emerging as dominant predictors. The approach provides a scalable, transferable methodology for regional GDE mapping to support groundwater management, ecosystem monitoring, and cumulative effects assessment across northern Alberta.

9:00am - 9:15am

Enhancing supraglacial lake segmentation with hydrological features and FiLM-based two-stream U-Net

Sehoon Oh, Jiwan Hong, Geonu Park, Daegeon Woo, Joon Heo

Yonsei University, Korea, Republic of (South Korea)

This study presents a hydrology-informed deep learning framework for supraglacial lake segmentation on the Greenland Ice Sheet using Sentinel-2 imagery. Traditional approaches to lake mapping rely primarily on spectral cues, which often struggle in regions with weak contrast, shadowing, or surface melt variability. To address these challenges, we incorporate physically meaningful hydrological features—flow accumulation, distance-to-drainage, and surface depressions—derived from high-resolution DEMs to guide the segmentation process. The proposed FiLM-based two-stream U-Net consists of an RGB stream for spectral–textural representation and a hydrology stream encoding surface meltwater routing patterns. Feature-wise linear modulation is applied at multiple levels of the RGB encoder–decoder to dynamically condition spectral features on hydrological context and improve spatial coherence.

Experiments on the SIGSPATIAL 2023 GISCUP dataset demonstrate that this architecture improves segmentation accuracy over a Sentinel-2-only baseline and a simple channel-concatenation model, particularly for small, fragmented, or spectrally ambiguous lakes. The combined use of hydrological cues and deep feature modulation reduces false positives in regions where meltwater is unlikely to accumulate and strengthens delineations along complex lake boundaries. These improvements highlight the value of integrating physically informed geospatial descriptors with modern segmentation networks for robust supraglacial lake detection.

Beyond methodological gains, the results support downstream applications including meltwater routing analysis, supraglacial drainage characterization, and improved understanding of seasonal lake evolution. Ultimately, this framework contributes to more reliable ice-sheet mass balance assessments and sea-level rise projections by enhancing the consistency and physical realism of supraglacial lake mapping at scale.

9:15am - 9:30am

Glacial Lake Dynamics and Bathymetry Assessment Using Satellite Observations

Vaibhav Garg, Samiksha Mehra, Pankaj R. Dhote

Indian Institute of Remote Sensing, India

The rapid retreat and thinning of glaciers in the North-western Himalayas due to climate change have led to a significant increase in the number and size of glacial lakes. These high-altitude lakes, often dammed by unstable moraines, pose a growing threat of Glacial Lake Outburst Floods (GLOFs), which can cause catastrophic flash floods and endanger downstream communities. Accurate estimation of glacial lake bathymetry is crucial for GLOF risk assessment, but direct measurement is challenging due to inaccessibility and harsh conditions. This study presents a methodology for evaluating glacial lake bathymetry using remote sensing data, focusing on the Panikhar glacier lake in Ladakh, India. Time series analysis was conducted to map the lake's water spread from 2015 to 2024 using optical and synthetic aperture radar data. Three approaches were employed to estimate bathymetry: a radiative transfer model (RTM) based on multispectral reflectance, a topographical model using high-resolution digital elevation models, and empirical equations relating lake area to depth. The RTM approach relies on the optical properties of water, while the topographical model leverages the surrounding terrain to infer underwater topography. Empirical equations were drawn from established literature. Results were validated against physical bathymetry survey observations. Among the methods, topographical modeling demonstrated the highest potential for accurate depth estimation, as it directly incorporates the lake's topographic features. This study highlights the importance of integrating remote sensing techniques for effective GLOF hazard assessment in remote, high-altitude regions, offering a scalable solution for monitoring and mitigating risks associated with glacial lakes in the Himalayas.

9:30am - 9:45am

Wildfire Drives Widespread and Decadal Change in Boreal Lake Colour

Qianyu Chang^1,2, Simon Zwieback², Aaron Berg¹

¹Department of Geography, Environment and Geomatics, University of Guelph, Canada; ²Geophysical Institute, University of Alaska Fairbanks, US

Wildfires are an increasingly dominant disturbance in boreal and Arctic Canada, a trend projected to continue under a changing climate. The ecological and hydrological impacts of wildfires cascade into the abundant inland lakes in these interconnected northern landscapes, leading to post-fire changes in lake quality and colour. Previous in-situ studies on post-fire lake water quality in boreal regions have yielded inconsistent results, preventing a regional-scale understanding of the prevalence, magnitude, and duration of fire impacts on boreal lakes.

Here, we use harmonized Landsat time series to quantify fire-driven lake colour change and its controls across western boreal Canada. We studied 83 fires that burned 13,968 lakes during 2005 - 2015 and quantified lake colour dynamics through surface reflectance in the red wavelength, a proxy for suspended sediments and turbidity. Using a Difference-in-Difference approach, we found pervasive and long-lasting increases in lake colour driven by fire disturbance, beginning in the first post-fire summer and persisting for at least ten years, indicating sustained elevated suspended sediment concentrations and turbidity regardless of physiographic variations. The magnitude and temporal patterns of these changes varied, with burn severity and physiography as important controls. Severe burns in the Taiga and Shield zones underlain by extensive permafrost led to greater and more prolonged changes in lake colour.

These findings underscore the critical and growing role of wildfires in boreal lake quality change, with important implications for aquatic habitats and water resources in a fire-prone future.

8:30am - 10:00am

WG V/3: Open Source Promotion and Web-based Resource Sharing
Location: 714B

8:30am - 8:45am

An Open Source Framework for Routing and Event Management in University Campuses

Gökdemir Çardak^1,2, İbrahim Topcu³, Berk Anbaroglu³

¹Graduate School of Science and Engineering, Department of Geomatics Engineering, Hacettepe UniversityHacettepe University, Türkiye; ²General Directorate of Mapping, Ankara, Türkiye; ³Department of Geomatics Engineering, Hacettepe UniversityHacettepe University, Türkiye

An Open Source Framework for Routing and Event Management in University Campuses

8:45am - 9:00am

Demonstrating the importance of curriculum-focussed content: learnings from a collaborative STEM outreach partnership in second level schools in Ireland.

Eimear McNerney¹, Audrey Martin¹, Barry Doyle², Zerrin Dogança Küçük³, Angela Rickard³, James Lonergan⁴, Patricia Cleary⁵, Helen Cullen⁶, Sasha Brown⁶, Conor Cahalane⁶

¹School of Surveying and Construction Innovation, Technological University Dublin; ²Geospatial Strategy and Services, Tailte Éireann, Phoenix Park, Dublin 8. D08 F6E4, Ireland; ³Department of Education, Maynooth University, Co. Kildare, Ireland; ⁴Society of Chartered Surveyors Ireland, D02 EV61 Dublin, Ireland; ⁵Esri Ireland, D15 NP9Y Dublin, Ireland; ⁶Department of Geography, Maynooth University, Co. Kildare, Ireland.

5*S: Space, Surveyors & Students is a collaborative STEM outreach project lead by Maynooth University, in partnership with the Irish National Mapping Agency, Tailte Éireann, Technological University Dublin, Esri Ireland and the Society of Chartered Surveyors Ireland. Funded by Research Ireland and the European Space Education Research Office (Esero) Ireland, these groups have a shared interest to encourage student enrolment on 'geo' courses at university from under-represented groups and also to preempt a looming skills-gap. 5*S provides interactive and engaging educational content and training to teachers and students (12 to 18 years old) who are interested in learning more about satellites, spatial data and SDGs. Leveraging a combination of ArcGIS StoryMaps, a bespoke Augmented Reality app (SatelliteSkill5 - free to download on PlayStore and AppStore) and the National Geospatial Data platform, Geohive - students and teachers are provided with curriculum-focussed content that help teach how to harness the power of spatial data to solve a set of challenges. Framed around the United Nations Global Geospatial Information Management 14 Fundamental Geospatial Data Themes, each core piece of 5*S content topic is tailored to fit into a packed school curriculum and has been trialled in almost 20% of second level schools in Ireland. The learnings from this tailored content have been recorded and evaluated through a series of quantitative and qualitative respondent questionnaires and teacher focus groups/one-on-one interviews. The findings suggest cross curricular potential, value-add for schools and confirm the importance of this for encouraging data literacy and supporting teacher agency.

9:00am - 9:15am

TorchGeo 1.0: Satellite Image Time Series, and Beyond!

Adam J. Stewart^1,2, Sieger Falkena³, Isaac A. Corley⁴, Harald Kristen⁵, Keenan Eves⁶, Yi-Chia Chang⁷, Chandni Nagda⁷, Frederick Schindlegger⁸, Yang Mu¹, Nils Lehmann^1,2, Arindam Banerjee⁷, Xiao Xiang Zhu^1,2

¹Technical University of Munich, Germany; ²Munich Center for Machine Learning, Germany; ³Shell Information Technology International B.V., The Netherlands; ⁴Taylor Geospatial, USA; ⁵Joanneum Research, Austria; ⁶Independent Researcher, USA; ⁷University of Illinois Urbana-Champaign, USA; ⁸University of Münster, Germany

TorchGeo is a Python library bringing support for geospatial data to the PyTorch deep learning ecosystem. First released over four years ago, TorchGeo has always had strong support for 2D satellite image data. The upcoming TorchGeo 1.0 release will add complete time series support, including 1D through 4D data, requiring a complete rewrite of all GeoDatasets and GeoSamplers. This talk describes the 1.5 years of open source work required to enable full time series support and the backwards-incompatible changes coming to TorchGeo. It also demonstrates the power and simplicity of TorchGeo through a series of case studies: 1D) air pollution, 3D) change detection and land cover mapping, and 4D) weather forecasting and climate modeling. TorchGeo is open source and released under an MIT license, with over 140 built-in datasets, 130 foundation model weights, and 120 contributors from around the world.

9:15am - 9:30am

Empowering the Next Generation: ISPRS Student Consortium's Global Initiatives in Education, Networking, and Capacity Building

Laxmi Thapa¹, Chukwuma John Okolie², Saicharan Vasala³

¹Aston University, United Kingdom; ²African Centre for Cities, School of Architecture Planning and Geomatics, University of Cape Town, South Africa; ³Sharda University, Uttar Pradesh, India

The International Society for Photogrammetry and Remote Sensing Student Consortium (ISPRS SC) serves as the official representation of students and young professionals within ISPRS, connecting a global network of more than 900 active members from 64 countries as of November 2025. This paper presents a comprehensive overview of ISPRS SC activities during the 2022-2025 Board of Directors tenure, highlighting significant expansion in educational outreach and capacity building initiatives. Key achievements include facilitating 15 summer schools across seven countries, providing hands-on training in emerging geospatial technologies, and organizing more than 40 webinars through partnerships with 10 ISPRS Working Groups, demonstrating substantial growth from 2 webinars in 2022 to 24 in 2025. The consortium successfully launched 11 Student Chapters worldwide, establishing localized networks that promote inclusive access to geospatial education across diverse regions. Through quarterly publication of the SpeCtrum newsletter, maintenance of active social media presence across four platforms reaching over 10,000 followers, and organization of networking events at major ISPRS symposia, the consortium has strengthened its communication, networking and professional development opportunities. The paper also discusses operational challenges including funding constraints, geographic representation gaps, and Board capacity limitations, while outlining future initiatives including a mentorship program, virtual symposium, and comprehensive Congress 2026 activities. These efforts underscore ISPRS SC's evolving role in developing the next generation of geospatial professionals equipped to address global sustainability challenges.

9:30am - 9:45am

Evaluating the Rover-Side Performance of a Low-Cost GNSS Network for High-Accuracy Positioning and ZTD Estimation

Milad Bagheri¹, Paolo Dabove¹, Andrea Masiero², Ilaria Ferrando³, Wafi Mahdi Eldoud Mahdi³

¹Polytechnic University of Turin, Italy; ²University of Padova, Italy; ³University of Genoa, Italy

The densification of GNSS Continuously Operating Reference Station (CORS) networks in mountainous regions is constrained by the high cost of geodetic-grade equipment. Low-cost (LC) multi-frequency GNSS receivers offer a viable alternative, yet their performance in challenging high-altitude Alpine environments remains largely unexplored. This study evaluates the rover-side positioning performance and tropospheric delay estimation capability of a newly installed LC permanent station at Prali (2200~m elevation), in the Alpine region of Piedmont, Italy. The station, based on a u-blox ZED-F9P receiver with a broadband LC antenna and a Raspberry Pi computer, was assessed using Virtual Reference Station (VRS) corrections from the SPIN3 professional CORS network. Six independent two-hour RTK sessions across a full diurnal cycle were processed using RTKLIB in forward-only kinematic mode to emulate real-time conditions. Results demonstrate that the LC station achieves centimetre-level horizontal precision (8--11~mm) with fix rates up to 97\% and time to first fix below 3~minutes under favourable conditions. A diurnal performance variability was observed and characterised across the six sessions. Zenith Tropospheric Delay estimation via CSRS-PPP with 92\% fixed ambiguities yielded physically consistent values (mean ZTD~=~1811~mm, ZWD~=~41~mm), consistent with dry winter conditions at altitude. These results confirm that LC GNSS stations can deliver reliable centimetre-level positioning and meaningful tropospheric products in demanding Alpine environments, supporting their deployment for CORS network densification in regions where geodetic-grade infrastructure is economically or logistically prohibitive.

9:45am - 10:00am

Development of VR/AR applications to support geospatial education

Dimitrios Bolkas¹, Jeffrey D. Chiampi¹, Jason Herrera¹, Kyle E. Jones¹, Matthew B. Sharr¹, Matthew O'Banion², Erica I. Parisi³, Ivan Detchev⁴

¹Pennsylvania State University, United States of America; ²United States Military Academy, West Point; ³University of Florence, Italy; ⁴University of Calgary, Canada

Over the last few years immersive technologies have experienced rapid advancement providing several solutions in geospatial education such as improving student preparedness, enhancing student learning of theoretical concepts and practical procedures, and even supporting remote learning. However, several educators cannot utilize such immersive technologies because many of the

existing applications are not suitable for geospatial learning. Use of immersive technologies in education often necessitates specialized software and application development with the total investment (in terms of cost and time) becoming a barrier. This project is spearheaded by Working Group V/1 of ISPRS, and it is also supported by the Education and Capacity Building Initiative (ECBI) 2024 grant to provide sample experiences to educators. This project developed two immersive experiences relevant to geospatial education that can be used to enhance lab delivery and learning. The first experience uses a simplified GNSS receiver for

topographic mapping in virtual reality (VR). The second experience uses a tablet and an external GNSS receiver to visualize 3D objects in augmented reality (AR). To design these two applications the research team distributed a global questionnaire to professionals and educators. The questionnaire assisted in understanding the participant’s experience with immersive technologies, their attitude and beliefs towards these tools, and the potential benefits that immersive technologies can bring in education and industry. The results from the VR/AR implementation indicate that interactive environments can effectively support student preparation and reveal common misconceptions in topographic data collection, highlighting their value as both training and diagnostic tools in geospatial education.

10:00am - 10:15am

Modern online teaching formats for geodetic reconstruction methods in Ukraine

Iuliia Horkovchuk¹, Denys Gorkovchuk¹, Mykola Tregub², Maria Chizhova³, Darius Popovas⁴, Till Sieberth⁴, Thomas Luhmann⁴

¹Kyiv National University of Construction and Architecture; ²Dnipro University of Technology; ³Otto-Friedrich Universität Bamberg, Digital Technologies in Heritage Conservation; ⁴Institute for Applied Photogrammetry and Geoinformatics, Jade University of Applied Sciences, Oldenburg, Germany

The GeoRek project, funded by the DAAD within the German-Ukrainian University Network, aims to strengthen geospatial education in Ukraine through digitalization and international cooperation. Implemented by Jade University of Applied Sciences (Germany) together with Kyiv National University of Construction and Architecture (KNUCA), Dnipro University of Technology, and the University of Bamberg, the initiative develops innovative e-learning tools and micro-credential systems for geodetic reconstruction and high accuracy documentation.

A central element of the project is the VRscan3D - virtual laser scanner simulator — an educational platform that enables realistic training in terrestrial and mobile laser scanning without the expensive equipment. The system supports interactive learning, gamified exercises, and data export for advanced processing.

GeoRek further establishes micro-certificates in key subjects such as terrestrial laser scanning, photogrammetry, and 3D/BIM data processing, aligning with European standards (ECTS, EQF) to promote flexible and lifelong learning. The project’s applied component includes real-life case studies on the digital documentation for reconstruction of war-damaged buildings in Ukraine.

Overall, GeoRek exemplifies how modern digital education can strengthen academic resilience, support reconstruction, and deepen long-term German-Ukrainian cooperation in geospatial sciences.

8:30am - 10:00am

ThS23B: Towards Large Cultural Heritage Foundation Models: Datasets, Semantic Alignment, and Component-Level Annotation
Location: 715A

8:30am - 8:45am

Research on Hyperspectral-Based Feature Set Construction and Machine Learning Inversion for Mixed Salts Characteristics in Murals

Wenxuan Lin^1,2, Shuqiang Lyu^1,2, Bo Ning³, Lihong Li³, Miaole Hou⁴

¹Beijing University of Civil Engineering and Architecture; ²Beijing Key Laboratory for Architectural Heritage Fine Reconstruction & Health Monitoring; ³Yungang Research Institute; ⁴Chang’an University

To enable non-destructive quantitative identification of mixed salts in mural plaster layers, hyperspectral data were collected from Na₂SO₄-CaCl₂ mixed-salt samples. Based on these data, a method integrating spectral preprocessing, feature-set construction, and machine-learning inversion was proposed. First, the original spectra were preprocessed using Savitzky-Golay smoothing and multiplicative scatter correction. A 0.6-order fractional-order derivative (FOD) was then introduced to enhance subtle salt-related spectral features. Subsequently, 30 single-band features were selected using a two-step strategy involving competitive adaptive reweighted sampling for preliminary screening and variable importance in projection for secondary screening. On this basis, dual-band and tri-band spectral indices were further constructed, and a combined-band feature set was formed by integrating the three feature sets. Gaussian process regression (GPR) was used to compare the inversion performance of different feature-input strategies for Na₂SO₄ and CaCl₂ contents. The results showed that the 0.6-order FOD achieved a favorable balance between feature enhancement and noise suppression. Among the evaluated feature-input strategies, the combined-band model showed the best predictive performance for both Na₂SO₄ and CaCl₂. These results indicate that integrating complementary information from feature sets with different dimensions can improve the stability and accuracy of mixed-salt inversion, providing a useful reference for the hyperspectral non-destructive quantification of mixed salts in murals.

8:45am - 9:00am

Research on Deacidification Treatment for Addressing the Acidification Crisis of Map Archives

Ming Li¹, Fen Hu², Ju Bai¹, Lifei Feng³, Hao Yu¹, Yunlu Peng¹

¹National Geomatics Center of China; ²Land Satellite Remote Sensing Application Center, Ministry of Natural Resources of P.R. China; ³Sichuan Ruili Heritage Preservation Technology Co., Ltd.,

Map archives, serving as crucial cultural heritage documenting historical spatial information, face severe challenges in long-term preservation. To evaluate the feasibility of deacidification technology in the conservation of map archives, this study utilized 41 severely acidified early 20th-century map archives as samples. These were treated using a specific non-aqueous deacidification technology, and changes in pH value, color difference (ΔE), and inks stability before and after treatment were analyzed. The results indicate that after deacidification, the paper pH value significantly increased from an average of 4.48 to a range between 8.24 and 8.87. The color change was minimal, with an average color difference ΔE of only 1.62. This study verifies that the deacidification technology is suitable and effective for the deacidification treatment of acidified paper-based map archives, providing a safe and reliable method for preserving their cultural value.

9:00am - 9:15am

High-Precision Registration of Grotto Point Clouds Using Multi-Source Data Fusion

jinru li¹, miaole hou^1,2, bo ning³, yong he^1,3

¹Beijing University of Civil Engineering and Architecture, China, People's Republic of; ²Chang'an University; ³Yungang Researeh Institute

To address the challenges of large initial pose discrepancies in grotto point clouds acquired from multiple sources, complex local geometric structures, significant noise interference, and the tendency of traditional ICP algorithms to fall into local optima, a high-precision point cloud registration method is proposed by integrating feature extraction with the collaborative optimization of coarse and fine registration. This method first performs point cloud preprocessing through voxel downsampling and outlier removal; it then extracts stable feature regions based on normal vector estimation and curvature analysis, and constructs feature representations using FPFH descriptors; building on this, the K-4PCS algorithm is employed to perform coarse registration and obtain optimal initial transformation parameters, followed by fine registration using an improved ICP algorithm combined with KD-tree-based search optimization. The proposed method was validated using the STANFORD DRAGON dataset and the point cloud of the Buddha head statue from Cave 18 of the Yungang Grottoes. The results indicate that the proposed method effectively improves the convergence speed and accuracy of point cloud registration. It demonstrates good stability and applicability in complex cave heritage scenarios and can provide methodological support for the fusion of multi-source point clouds in the digital preservation of cultural heritage.

9:15am - 9:30am

Automatic Line Drawing Generation for Grotto Wall Surfaces Based on Depth Map and Normal Map Fusion

Jiteng Wang¹, Miaole Hou^1,2, Yong He^1,3, Bo Ning³

¹School of Geomatics and Urban Spatial Informatics, Beijing University of Civil Engineering and Architecture, No. 15, Yongyuan Road, Huangcun Town, Daxing District, Beijing, 102627, China; ²School of Land Engineering, Chang’an University, Middle Section, South 2nd Ring Road, Xi'an, Shaanxi, 710054, China; ³Yungang Research Institute, No. 1, Dong Street, Yungang Town, Yungang District, Datong City, 037007, China

To address the lack of suitable methods for automatic 2D line drawing generation from grotto wall mesh models, as well as the difficulty of existing methods in balancing structural representation and detail preservation, this paper proposes a line drawing generation method based on depth map and normal map fusion. The method first orthographically projects the 3D model into a depth map and a surface normal map, then constructs an initial line drawing pipeline based on projected edge fusion. A layered optimization strategy is further introduced to improve detail representation and result stability. Experiments on the mesh model of the north wall of Cave 18 at the Yungang Grottoes show that the projected edge fusion method is more suitable for overall structural representation, while the layered optimization method performs better in preserving weak structures and fine details. The proposed method effectively improves the quality of automatic 2D line drawing generation for grotto wall surfaces.

9:30am - 9:45am

An Automated Recognition Framework for Surface Deterioration Features of Stone Sculptural Artifacts in the Yungang Grottoes based on Deep Learning

Zhenyu Lin^1,2, Yuan Cheng^1,3, Hanling Li¹, Jizhong Huang^1,3, Guanpeng Xia¹, Yue Zhang^1,3, Wanfu Wang⁴, Bo Ning⁵, Hongbin Yan⁵

¹Institute for the Conservation of Cultural Heritage, School of Cultural Heritage and Information Management, Shanghai University,Shanghai, China; ²School of Materials Science and Engineering, Shanghai University, Shanghai, China; ³Key Laboratory of Silicate Cultural Relics Conservation (Shanghai University), Ministry of Education; ⁴National Research Center for Conservation of Ancient Wall Paintings and Earthen Sites, Dunhuang Academy, Dunhuang, Gansu, China; ⁵Yungang Research Institute, Datong, Shanxi, China

Rock-cut cave temples, such as the UNESCO World Heritage site of Yungang Grottoes, represent invaluable cultural heritage facing severe deterioration. Traditional monitoring methods are often slow, subjective, and inadequate for large-scale, long-term analysis, creating a critical gap in effective conservation.To address this challenge, we developed an automated framework for identifying surface deterioration features on stone carvings using deep learning. Our approach leverages a novel multi-source image dataset, combining historical and modern imagery of the Yungang Grottoes. We propose an enhanced model based on the YOLO architecture, featuring a synergistic semantic and spatial perception mechanism that significantly improves the detection of subtle features like peeling and cracks.The model was trained to recognize three key deterioration types: peeling, crack, and human damage. On-site deployment and testing in the authentic cave environments demonstrated excellent performance, achieving high recognition confidence for cracks (87.5%), peeling (85.2%), and human damage (81.3%). This study provides a powerful new tool for the quantitative monitoring of stone carvings, offering a scientifically-informed pathway for practical and proactive conservation strategies at heritage sites worldwide.

9:45am - 10:00am

Hyperspectral Analysis of Pigment Identification and Abundance Inversion in the Dome of China’s Yungang Grotto 7

Junlin Yan^1,2, Shuqiang Lyu^1,2, Bo Ning³, Lihong Li³, Miaole Hou⁴

Most of the grotto temples have undergone long-term weathering and multiple repainting campaigns, so accurate identification of the composition and spatial abundance of surface pigments is an important foundation for pigment characterization and conservation research. This study focuses on the dome of Yungang Grotto 7. Data were acquired using a three-dimensional (3D) hyperspectral multimodal digital acquisition system and the Analytical Spectral Devices (ASD) field spectroradiometer. The workflow consisted of two stages: pigment identification and abundance inversion. In the pigment identification stage, a normalized weighted identification method integrating Spectral Angle Mapper (SAM) and the Normalized Difference Spectral Index (NDSI) was proposed based on mineral pigment reflectance curves measured by the ASD field spectroradiometer. In the abundance inversion stage, Fully Constrained Least Squares (FCLS) was applied to estimate pigment proportions in mixed pixels under non-negativity and sum-to-one constraints. The results show that the green pigments are most likely malachite and Paris green, the red pigments are most likely hematite and laterite, and the black pigment is most likely carbon black. The interwoven distribution of Paris green and traditional mineral pigments provides material-science evidence for modern repainting and restoration in this area. Nonlinear mixing may occur on rough and weathered grotto surfaces. However, under the current data conditions, its influence on abundance inversion remains unclear. Therefore, Kernel Fully Constrained Least Squares (K-FCLS) was additionally introduced as a reference nonlinear model for qualitative comparison with FCLS.

8:30am - 10:00am

WG V/1: Education and Training through Curricula Development and Enhanced Learning Practices
Location: 715B

8:30am - 8:45am

Earth Sensing in the Dolomites: A Summer School for Capacity Building and Collaboration on Geomatics for Environmental Applications

Larissa Maria Granja^1,2, Erico Kutchartt^1,3, Enrico Magazzino^1,2, Thomas Zieher⁴, Rasoul Eskandari⁵, Michele Crosetto⁶, Caterina Balletti⁷, Andrea Martino⁷, Mattia Balestra^8,9, Roberto Pierdicca⁹, Lindo Nepi¹⁰, Carlos Cabo¹¹, Anna Iglseder¹², Mauricio Acuna¹³, Borja García Pascual^13,14, Francesco Pirotti^1,2

¹Department of Land, Environment, Agriculture and Forestry (TESAF), University of Padova, Viale dell’Università 16, Legnaro, PD 35020, Italy; ²Interdepartmental Research Center of Geomatics (CIRGEO), University of Padova, Corte Benedettina, Via Roma 34, Legnaro, PD 35020, Italy; ³Forest Science and Technology Centre of Catalonia (CTFC), Carretera de Sant Llorenç de Morunys, Km 2, 25280 Solsona, Spain; ⁴Department of Natural Hazards, Austrian Research Centre for Forests (BFW), Rennweg 1, 6020 Innsbruck, Austria; ⁵Department of Architecture, Built Environment and Construction Engineering, Politecnico di Milano, via Ponzio 31, 20133 Milano, Italy; ⁶Centre Tecnològic de Telecomunicacions de Catalunya (CTTC/CERCA), Geomatics Division, Av. Gauss, 7, E-08860 Castelldefels (Barcelona), Spain; ⁷Università Iuav di Venezia, Santa Croce, 191, Venezia, VE 30135, Italy; ⁸Department of Agricultural, Food and Environmental Sciences (D3A), Università Politecnica delle Marche, Ancona, 60131, Italy; ⁹Department of Civil, Building and Architectural Engineering (DICEA), Università Politecnica delle Marche, Ancona, 60131, Italy; ¹⁰Department of Information Engineering (DII), Università Politecnica delle Marche, Ancona, Italy; ¹¹Department of Mining Exploitation and Prospecting, University of Oviedo, Campus de Mieres, 33600 Mieres, Oviedo, Spain; ¹²Department of Geodesy and Geoinformation, TU Wien, Wiedner Hauptstraße 8-10, 1040 Vienna, Austria; ¹³Forest technology and Wood Material Solutions, Natural Resources Institute Finland (Luke), 80100 Joensuu, Finland; ¹⁴School of Forest Sciences, University of Eastern Finland, 80101 Joensuu, Finland

Capacity building is a key element in promoting and training in spatial technologies and in fostering a network of early-stage researchers for future collaborations. The first edition of the Earth Sensing Summer School, organised by the University of Padova with support from ISPRS Education and Capacity Building (ECBI) funds, was held from 7 to 13 September 2025 in the Dolomite area of the Alpine region. This article illustrates specific aspects of the organisation and discusses the return on investment in terms of training and networking. It highlights the methodology used for selecting participants and conducting the training, which included a balanced combination of seminars, fieldwork, data analysis, dissemination to peers, and a final defence of results. We discuss the outcomes and feedback from the almost 40 participants and provide ideas for future improvements, aiming to offer insights for fellow researchers who might want to replicate a capacity-building activity of this kind.

8:45am - 9:00am

Implementing Team-based Learning in Geomatics Education: enhancing hard and soft Skills in multicultural academic Contexts

Andrea Ajmar¹, Fabio Giulio Tonolo²

¹Interuniversity Department of Regional and Urban Studies and Planning, Politecnico and Università di Torino, Italy; ²Department of Architecture and Design, Politecnico di Torino, Torino, Italy

This practice paper presents the design, implementation, and first evaluation of Team-Based Learning (TBL) activities in university-level Geomatics courses taught in English to multicultural and international student groups. The study documents a structured pathway for adapting TBL to technically demanding subjects, including GIS suitability analysis, network analysis, remote sensing classification, and heat-risk assessment. Its main contribution lies in showing how a pedagogical model widely discussed in general higher education can be translated into software-based Geomatics teaching while supporting both disciplinary learning and intercultural collaboration. The paper also identifies the main organizational conditions for successful adoption, including team formation, workload calibration, and suitable classroom settings. Results from 12 TBL implementations involving 187 students and 470 total participations show clear benefits of teamwork: average team test performance was markedly higher than individual performance, repeated participation was associated with improved results, and student satisfaction increased after the introduction of TBL. Qualitative evidence further indicates gains in communication, teamwork, and intercultural interaction. Although the first implementation required substantial preparation effort, the approach proved replicable and scalable in subsequent editions, making TBL an effective instructional model for Geomatics education.

9:00am - 9:15am

ISPRS SC Summer School: A Global Initiative on Capacity Building and Education Outreach in the Field of Photogrammetry, Remote Sensing and GIS

Laxmi Thapa¹, Nicolas Pucino², Miguel Luis Rivera Lagahit³, Chukwuma John Okolie⁴

¹Aston Business School, Birmingham, United Kingdom; ²Climate Friendly, Sydney, New South Wales, Australia; ³Dynamic Map Platform Co., Ltd., Tokyo, Japan; ⁴African Centre for Cities, School of Architecture Planning and Geomatics, University of Cape Town, South Africa

The ISPRS Student Consortium (SC) Summer Schools are one of the fundamental initiatives that ISPRS SC jointly organises with interested institutions to advance education outreach and capacity building in photogrammetry, remote sensing, and geospatial information sciences. Since their start in 2004, these programs have provided students and young researchers with immersive learning opportunities, combining technical lectures, hands-on sessions, and cultural experiences. Grounded in Experiential Learning Theory, the Summer Schools emphasise real-world application, reflective observation, and collaboration. This paper explores their evolution, global outreach, and educational impacts. Drawing on recent ISPRS SC Summer Schools, including the BUCEA Summer School 2024 on Smart Cities and the Summer School 2024 on AI for Geospatial Applications, the analysis highlights their integration of theory and practice, networking benefits, and transformative cultural exchange. Challenges such as financial barriers and technological gaps are discussed together with recommendations for sustaining and enhancing these initiatives. This study underscores the critical role of ISPRS SC Summer Schools in fostering a global community of geospatial practitioners to address real-world challenges.

9:15am - 9:30am

Legal aspects in photogrammetric curricula: navigating property rights and airspace boundaries

Salvatore Marsico¹, Dimitrios Bolkas¹, Henrique Candido de Oliveira², Matthew Sharr¹

¹Penn State University, United States of America; ²Universidade Estadual de Campinas (Unicamp)

This paper discusses the importance of integrating legal aspects into UAS mapping courses and related curricula providing a framework for integration in an introductory photogrammetric course with sample questions and assignments. Curricula focus is placed on two interrelated issues: first, the extent to which property owners maintain a reasonable expectation of privacy from UAS intrusion within the “immediate reaches” of their airspace; and second, the potential for UAS-mounted sensors to inadvertently capture imagery or point cloud of neighboring properties while operating in compliance with Federal Aviation Administration (FAA) regulations. The discussion concludes by identifying strategies to mitigate these legal and operational challenges, giving students the knowledge and tools to address similar situations in real-life scenarios, and ensuring that aerial surveying practices respect both regulatory compliance and property rights.

9:30am - 9:45am

Building Capacity in Satellite-Based Earth Observation and HQP Training: Canada as a Use Case

Ashraf Elshorbagy, Scott Mitchell, Koreen Millard

Carleton University, Canada

Remote sensing (RS) and especially earth observation (EO) have been used extensively for decades in environmental monitoring, infrastructure asset management, urban planning, emergency response, mapping and many others. The pace of technology

advancements in big data, cloud computing, Geospatial AI (GeoAI) and Geospatial Foundation Models (GeoFM) causes a paradigm shift on how to and who can maximize the potential of remote sensing technology. This paradigm shift challenges traditional geomatics education and pedagogical methods. Additionally, the gap between geomatics graduates’ skills and market needs is widening. The pace of disruptive technology advances like GeoAI and GeoFM often outpaces developments in geomatics education content or suitable pedagogical methods and formats. To address these skills gaps in geomatics courses and courseware, an initiative has been developed

between the Canadian Space Agency and Carleton University, involving more than a dozen different partners spanning industry, government, academia and NGOs. We have been gathering information through qualitative and quantitative techniques to obtain

insights about the soft and hard skills that are valuable and/or lacking in contemporary geomatics graduates, to forecast trends and future needs, and plan how to optimize the introduction of new technology and techniques into the educational content. Based on the mapped feedback, university-level geomatics courses are being redeveloped and updated, and novel course modules, mini-courses and micro credential programs are being developed and tested.

9:45am - 10:00am

Advancing Earth Observation in Africa : Achievements of the WG Africa Copernicus Training of trainers program in three languages

Linda Tomasini¹, Ali Arslan Nadir², Agnès Begue³, Nico Bonora⁴, Catarina Duarte⁵, Maria Daraio⁶, Jean-François Faure⁷, Philippe Gisquet⁸, Carlos Gonzales Inca⁹, Benjamin Palmaerts¹⁰, Michal Krupinski¹¹, Cécilia Leduc¹², Marc Leroy¹³, Marietta Papakonstantinou¹⁴, Cristina Lira¹⁵, Carolina Sa¹⁶, Dimitra Tsoutsou¹⁷

¹CNES, France; ²FMI, Finland; ³CIRAD, France; ⁴ISPRA, Italy; ⁵Air Centre, Portugal; ⁶ASI, Italy; ⁷IRD, France; ⁸Visioterra, France; ⁹University of Turku, Finland; ¹⁰ISSEP, Belgium; ¹¹CBK PAN, Poland; ¹²IDGEO, France; ¹³Space4Dev, France; ¹⁴NOA, Greece; ¹⁵IPMA, Portugal; ¹⁶PT Space, Portugal; ¹⁷PRAXI network, Greece

The WG Africa project is a collaborative initiative bringing together 12 national institutions from 8 European countries. Its objective is to support and strengthen the use of Copernicus data and services in Africa through a training-of-trainers program funded by the European Commission under the Framework Partnership Agreement on Copernicus User Uptake (FPCUP) and implemented in French, Portuguese, and English. To widely support the Copernicus products uptake, the primary goal is to collaborate with African academic and private-sector trainers by integrating Copernicus-based modules into their training programs or curricula. This initiative complements other capacity-building efforts in space-based Earth Observation in Africa, such as GMES & Africa and the Global Gateway European initiative.

10:00am - 10:15am

Geospatial UK Higher Education – status, challenges, and outreach initiatives

Henny Mills, Craig Robson, Alex Linighan, Stuart Edwards, Chris Larkin

Newcastle University, United Kingdom

Geospatial education in the UK is facing a critical decline, despite the increasing relevance of 3D reality capture and spatial technologies across sectors. While industry recognises the value of geospatial skills, the absence of coordinated national policy or incentives has led to the closure of key undergraduate programmes. Notably, Newcastle University closed its geospatial UG programme in 2023. The University of East London remains the only UK institution offering a dedicated undergraduate surveying degree, supplemented by an industry-linked apprenticeship. To address the skills gap, several universities now offer postgraduate conversion courses in geospatial science, primarily within geography or environmental science departments.

Outreach has emerged as a vital strategy to raise awareness and inspire future talent. GeospatialUK.org, developed at Newcastle University with industry support, provides accessible resources on careers, study pathways, and classroom activities aligned with UK education curricula at high school level. Its exercises—ranging from mapping hazards, wildfires, census data to GNSS-based calculations—bridge advanced research with school-level learning. It also offers insight into geospatial relate careers and links to possible job opportunities. The platform has gained international traction and continues to attract users.

This paper highlights the urgent need for national coordination in geospatial education and showcases GeospatialUK.org as a scalable model for outreach. Without intervention, the UK risks a shortage of skilled geospatial professionals, undermining its capacity to address pressing societal challenges

8:30am - 10:00am

WG IV/3: Geo-computation and Geo-simulation
Location: 716A

8:30am - 8:45am

A Framework for Mapping Recreational Boating: Inferring Vessel Behaviour from Mobile Phone Data and Sentinel-2 Imagery

Katarzyna Sila-Nowicka¹, Richard Beauchamp¹, Murray Ford¹, Ian Tuck², Bruce Hartill²

¹University of Auckland, New Zealand; ²Ministry of Primary Industries, New Zealand

Recreational fishing supports economies, wellbeing, and connection to the marine environment but can pressure fish stocks. Traditional monitoring in New Zealand is costly, sporadic, and self-reported. This study evaluates integrating mobile phone data (MPD) and satellite-based object detection (YOLO on Sentinel-2 and sub-meter imagery) to improve monitoring. MPD provides temporal coverage but is biased, while satellite imagery offers spatial validation but provides only snapshots. Combining these datasets mitigates biases and gaps, enabling more accurate, representative estimates of fishing activity. This is the first study to integrate these approaches, demonstrating the potential of hybrid methods for scalable, cost-effective recreational fisheries monitoring.

8:45am - 9:00am

Building Footprint Aggregation with Preservation of Edge Orientations

Alexander Naumann, Samuel Bergé, Jonas Sauer, Jan-Henrik Haunert

University of Bonn, Germany

The aggregation of building footprints is a key task of cartographic generalization, which is an important topic in geoinformation science. It has been approached from various angles, ranging from heuristics and optimization algorithms to machine learning. Given a set of input polygons that represent the building footprints, the task is to generate a set of polygons that provide a coarser representation of the input. The problem has applications in the visualization of settlement areas in small-scale maps, as well as settlement classification and analysis. A popular solution approach is to construct a subdivision of the plane and then build a solution by selecting faces from the subdivision. Often, a triangulation is used for the subdivision. However, this can cause the orientations of the boundary edges in the solution to differ drastically from the input polygons, which leads to a loss of information about the underlying settlement structure. We explore an alternative method that constructs the subdivision by extending the input building edges, thereby automatically preserving their orientations. To make the approach scalable to large instances without substantially decreasing the solution quality, we propose different methods of reducing the complexity of the subdivision. Our experimental evaluation on real-world data shows that our method is able to aggregate towns containing up to approximately 10 000 building footprints while preserving input edge orientations much better than state-of-the-art methods.

9:00am - 9:15am

Lane-level Dynamic Information Updating for High-Definition Maps Based on Crowdsourced Data

Yuan Guo¹, Zhangcai Yin¹, Liangcun Jiang¹, Bijun Li², Han Zhou¹, Quanhua Dong³, Shen Ying⁴

¹School of Resources and Environmental Engineering, Wuhan University of Technology; ²State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University; ³Institute of Remote Sensing and Geographical Information Systems, School of Earth and Space Sciences, Peking University; ⁴School of Resources and Environmental Sciences, Wuhan University

Timely updates of lane-level dynamic information are crucial for intelligent vehicle path planning and driving safety. Most existing crowdsourced map update methods lack sufficient analysis of the reliability and uncertainty of perception results, making it difficult to ensure the accuracy of map updates. We propose a novel method for updating lane-level dynamic information in HD maps based on crowdsourced data. First, a hybrid modelling multi-object detection method is used to reliably perceived lane markings and traffic cones. To address the issues of false detection and missed detection in single-vehicle perception, a multi-vehicle probabilistic fusion algorithm is proposed, which explicitly models perceptual uncertainty to effectively mitigate the impact of missed and false detections, enabling accurate, robust, and real-time detection of dynamic information such as temporary lane closures.To validate the effectiveness and accuracy of the proposed method, we conducted experiments in the Intelligent and Connected Vehicle Demonstration Zone in Wuhan.Experiments comparing single-vehicle and multi-vehicle fusion results demonstrate the effectiveness of the proposed method in enhancing detection performance.

9:15am - 9:30am

Maximum entropy for climate change and variability impact assessment on seabirds: use case on Eudyptula minor little penguins

Priyanka Singh¹, Sameer Saran², Amulya Saxena¹

¹Dept. of Natural and Applied Sciences, TERI School of Advanced Studies, Delhi, India; ²Regional Remote Sensing Center-North, ISRO, New Delhi, India

This study uses machine learning and geospatial science to investigate how climate change may affect the foraging and habitat suitability of little penguins Eudyptula minor in Australia and New Zealand. An innovative modeling approach was followed here to identify favorable climatic conditions for the species across both regions. The model trained on Australian occurrence data was projected to New Zealand, and vice versa, to assess cross-regional habitat suitability and potential range shifts under changing climate conditions. This is to further evaluate adaptive potential and determine whether transoceanic relocation would be feasible in the event of local extinction. The study evaluated habitat suitability using the ML model and climate variables from the WorldClim dataset. The findings showed that the healthy habitat of little penguins is significantly shaped by temperature-related bioclimatic variables, especially temperature annual range. According to the models, the habitat suitability of little penguins varies between the two nations, with Australia offering the little penguins of New Zealand less hospitable conditions. But the New Zealand is predicted to offer relatively better habitat to Australia-based little penguins. This study offers vital information for conservation strategies by highlighting the possible changes in penguin populations brought on by climate change. A promising tool for comprehending how the climate affects marine ecosystems is provided by this study.

9:30am - 9:45am

Parametric Modelling and GIS Integration for Multi-Criteria Decision-Making: An Application to the Einstein Telescope Underground Research Infrastructure

Wissam Wahbeh¹, Maria Marsella², Francesco Rossi²

¹FHNW University of Applied Sciences and Arts Northwestern Switzerland, Switzerland; ²Sapienza University of Rome, Department of Civil, Building and Environmental Engineering, Italy

This paper presents an advanced computational framework developed to support decision-making for the placement of the underground Einstein Telescope, a third-generation gravitational-wave observatory. The system aims to automate the search for an optimal location through a multi-criteria analysis approach. Because the ET is extremely sensitive to environmental noise sources—including seismic, thermal, and anthropogenic vibrations—its design prioritises underground construction. This strategy, also adopted for the Japanese KAGRA detector and in contrast to surface-based observatories such as LIGO and Virgo, minimises interference from surface activities while ensuring subsurface stability.

The proposed methodology integrates Geographic Information System (GIS) data, incorporating a Digital Surface Model (DSM) to spatially represent relevant factors. The dominant site-selection criteria were identified and weighted according to their scientific and strategic importance in collaboration with the ET scientific community. An interactive parametric model was developed to interface directly with the GIS data, enabling evaluation of key factors and providing real-time analytical feedback on placement scenarios. Using an evolutionary algorithm combined with a composite fitness function, the system balances competing objectives and delivers optimised solutions, offering a robust decision-support tool for the early planning stages of the Einstein Telescope project.

Although the Sardinia site is currently considered a preliminary case study, the methodology is generalisable and applicable to other candidate sites to host ET

9:45am - 10:00am

Kinematic Characteristics and Risk Analysis of Potential Rockfall based on 3D Point Clouds

Wei Yuan¹, Changqing Liu^1,2, Han Bao², Weihang Ran⁴, Zhongyuan Yang¹, Xiuxiao Yuan³, Ryosuke Shibasaki⁴, Shunichi Koshimura¹

¹Tohoku University, Japan; ²Changan University, China; ³Wuhan University, China; ⁴The University of Tokyo, Japan

In fractured rock slopes, the geometric configuration and spatial arrangement of unstable rock blocks are fundamentally governed by the intersection of multiple joint sets. The mechanical weakening along these joints markedly reduces the integral strength of the rock mass and establishes potential kinematic release boundaries. This study establishes an in-situ hazardous-rock detection and characterization framework utilizing high resolution three-dimensional point cloud acquired under realistic topographic conditions. This method first examines the spatial interaction between joints and slope morphology, and incorporates explicit kinematic criteria to automatically identify structural combinations capable of different failures. Consequently, the spatial positions and distribution patterns of potentially unstable blocks are delineated within the point cloud. Subsequently, point cloud differencing is employed to achieve volumetric extraction and statistical classification of block sizes, enabling quantitative characterization of block volume and elevation across the source areas. Representative blocks are then selected as initial release elements, with their actual geometrical and volumetric attributes incorporated into rockfall simulations. This allows for the computation of key kinematic parameters including rockfall frequency, bounce height, velocity, and kinetic energy. Overall, the presented approach delivers a scalable pathway for rapid detection, quantitative assessment, and hazard evaluation of structurally controlled rockfalls in complex mountainous terrain. The results provide technical support and decision insights for the safe operation and disaster-resilient planning of transportation infrastructure.

10:00am - 10:30am

Morning Coffee Break
Location: Exhibition Hall "E"

10:30am - 12:00pm

WG III/1G: Remote Sensing Data Processing and Understanding
Location: 713A

10:30am - 10:45am

YOLOv8m-CCFM-GSConv: Research on Lightweight Marine Oil Spill Target Detection Based on Improved YOLOv8m Model

Junjie Lu¹, Qingyang Wang^1,2,3, Bo Song¹, Jianwu Jiang^1,2,3, Bin Yang¹, Chen Jiao¹

¹College of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541004, China; ²Guangxi Ecological Spatiotemporal Big Data Perception Service Laboratory, Guilin 541004, China; ³Guangxi Key Laboratory of Spatial Information and Geomatics, Guilin 541004, China

In the application of target detection for marine oil spills, deep learning methods are gradually replacing traditional remote sensing image recognition approaches. While complex models designed for higher accuracy may compromise recognition speed, they often fail to meet the rapid response requirements of terminal device applications (Chai et al, 2025). Therefore, developing a lightweight detection model that balances high accuracy and real-time performance is crucial for enhancing marine oil spill emergency response capabilities (Liang et al, 2024). Based on the yolov8m model, this study introduces GSConv (Li et al, 2024) lightweight convolution and CCFM (Guo et al, 2025) cross-scale feature fusion module, which significantly improves the adaptability of multi-scale target detection and recognition accuracy in complex backgrounds while maintaining model lightweightness, thereby offering a novel and effective solution for marine oil spill target detection.

10:45am - 11:00am

Detecting moving vehicles on Sentinel-2 imagery using semi-automatic labeling from S2A/S2C tandem phase

Guillaume Buthmann¹, Florentin Poucin¹, Jérémy Anger^1,2

¹Kayrros SAS; ²Université Paris-Saclay, ENS Paris-Saclay, CNRS, Centre Borelli, 91190, Gif-sur-Yvette, France

During the commissioning phase of ESA's Sentinel-2C, tandem images with Sentinel-2A were acquired with a delay of 30 seconds. We present a novel, automated method for labeling moving vehicles in Sentinel-2 images, leveraging the temporal offset between these tandem acquisitions. We propose a filtering process that isolates pixels corresponding to vehicles that moved between the two acquisitions. We generate a training dataset based on this process, removing the need for a large manual labeling phase. The dataset is used to train a standard deep-learning-based vehicle detection model. Experimental results, as well as a validation study using ground-truth data from California, highlight the quality of the proposed labeling method, and show that a vehicle detection model can be successfully trained from quasi-simultaneous acquisitions.

11:00am - 11:15am

LAD-Enhancer: A Lightweight All in One Aerial Detection Enhancer Under Adverse Weather

Yu Wan¹, Jie Li¹, Liupeng Lin², Zaiyan Zhang¹, Qiangqiang Yuan¹, Huanfeng Shen²

¹School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China; ²School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China

With the rapid development of aerial imaging technology, aerial target detection has become a research hotspot with broad applications in intelligent transportation, agricultural monitoring, and military surveillance. However, the performance of aerial detection models is often degraded under adverse weather conditions such as fog, sandstorms, and low illumination. In such environments, aerial images typically suffer from reduced contrast and color distortion, which significantly affects the model’s ability to accurately identify targets. To this end, a Lightweight All-in-One Aerial Detection Enhancer Under Adverse Weather (LAD-Enhancer) has been proposed. The designed enhancer processes and restores degraded aerial images, thereby enhancing the detection model’s ability to perceive potential targets. Unlike conventional image restoration models, LAD-Enhancer integrates detection labels as additional supervision during training to ensure that enhancement is detection-oriented rather than purely visual. Furthermore, a three-stage training strategy and a Mixture of Experts (MoE) framework are employed to adaptively classify and process images captured under different degradation conditions. Experimental results demonstrate that, with an increase of fewer than 3K parameters, the proposed LAD-Enhancer significantly improves detection performance under adverse weather conditions while maintaining almost unchanged performance on clear-weather images.

11:15am - 11:30am

A Collaborative Detection Method of Small Unmanned Aerial Vehicle Target via Multi-modal Feature Fusion in Complex Background

Wen Jiang, Keyi Zhang, Yanping Wang, Yun Lin, Fukun Bi

North China University of Technology, Beijing, People's Republic of China

Currently, the state-of-the-art methods for detecting small unmanned aerial vehicles (UAVs) continue to struggle in complex urban settings due to several persistent challenges, namely, frequent target occlusion, high similarity in thermal radiation signatures between UAVs and their surroundings, and the inherently low visual saliency of small UAV targets, all of which contribute to degraded detection performance. To tackle these issues, this paper introduces a novel multi-modal feature fusion collaborative detection (MFFCD) framework grounded in learnable spatial mapping. The architecture consists of three key components: firstly, a multi-branch parallel feature extraction module (MBPFE) that simultaneously processes infrared, visible, and radar range-azimuth images, complemented by a feature fusion module (FFM) designed to enhance both intra-modal and inter-modal feature interactions; then, an adaptive spatially-aware dynamic detection head module (DDH) that dynamically recalibrates feature weights to strengthen target representation and boost detection accuracy; and a feature collaborative enhancement module (FCE) that employs a learnable affine transformation to align and fuse multi-modal features, thereby producing more robust and reliable detection outcomes. Extensive experiments show that the proposed MFFCD framework substantially outperforms existing methods under challenging urban conditions, achieving a 56.89% gain in Mean Average Precision (mAP) for small UAV detection.

11:30am - 11:45am

Infrared-Visible Image Fusion Method Based on Differential Feature Enhancement and Cross-Modal Attention

Huang Zhang¹, Lina Xu¹, Qing Zhou¹, Tiyou Zhou², Siyu Liu¹, Xincai Chang¹, Hao LI¹

¹Hubei Subsurface Multi-scale Imaging Key Laboratory, School of Geophysics and Geomatics, China University of Geosciences, Wuhan, 430074, China; ²State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430079, China

Infrared and visible remote sensing image fusion is crucial for improving scene perception in complex environments, but existing autoencoder-based methods suffer from insufficient information interaction between modalities, inadequate deep feature fusion, and ineffective loss functions in extreme scenarios. To address these issues, this study proposes a Differential Feature Enhancement and Cross-modal Fusion (DFECF) method. The DFECF adopts an end-to-end architecture consisting of dual-stream encoders, cross-modal fusion modules, Transformer global perception modules, and decoders. Specifically, the Differential Enhancement (DE) module extracts differential information between infrared and visible features, combined with spatial and channel attention to enhance feature representation. The cross-modal fusion module adaptively integrates deep features based on channel attention, adjusting feature weights according to scene characteristics. The Transformer module supplements the global receptive field to capture long-range feature dependencies, and a joint loss function is designed to optimize fusion performance. Experimental results on public datasets show that the proposed method outperforms existing state-of-the-art methods in both subjective visual effects and objective evaluation metrics, especially in extreme environments such as strong light and thick smoke. It effectively improves the integrity of scene perception and provides high-quality data support for practical applications such as forest fire prevention, mining area monitoring, and autonomous driving.

10:30am - 12:00pm

ThS27: From Photogrammetry, Remote Sensing, and AI to Climate Action
Location: 713B

10:30am - 10:45am

Google Earth Engine Apps – a novel method for highlighting the role of satellite-derived bathymetry (SDB) to non-specialists and citizens – a case study for Irish bays

Margarida Victor¹, Xavier Monteys², Gema Casal³, Isaac Obour Mensah¹, Conor Cahalane¹

¹Department of Geography, Maynooth University, Co. Kildare, Ireland; ²Geological Survey of Ireland, Dept. of the Environment, Climate and Communications, Blackrock, Dublin, Ireland.; ³Oceanographic Centre of A Coruña, IEO-CSIC, Spain

This research addresses the need for accurate updates to the seabed datasets in coastal areas under environmental and human pressure. It uses Google Earth Engine (GEE) to develop a cloud-based application for Satellite-Derived Bathymetry (SDB) of the Irish bays using Sentinel-2 and Landsat-8 imagery. For the validation, the OPW Pilot Coastal Monitoring and INFOMAR datasets were used. The research refines semi-empirical algorithms and introduces an Earth Engine App (EEA) using the JavaScript API specifically tailored for and non-specialist public use. The methodology employed included pre-selecting high-quality satellite images based on the higher R-squared and lower RMSE to ensure reliability and better performance. In the initial phase, 18 bays were assessed, and the results showed that five bays (Dublin, Dungarvan, Portrane, Rosses, and Tramore) performed better across the evaluated metrics. he development and use of this application support a wide range of marine applications, especially for capacity building, as part of the pilot research led by Maynooth University and Geological Survey Ireland (GSI).

10:45am - 11:00am

High-resolution Arctic Wetland Methane Flux Modeling using a Geofoundational Deep Learning Model and Multispectral Satellite Data

Ali Radman¹, Masoud Mahdianpari^1,2, Fariba Mohammadimanesh³

¹Memorial University of Newfoundland, St. John's, Newfoundland, Canada; ²C-CORE, St. John’s, Newfoundland, Canada; ³Canada Centre for Remote Sensing, Natural Resources Canada, 580 Booth Street, Ottawa, Ontario K1A 0E4, Canada

Accurate estimation of methane fluxes across Arctic wetlands is essential for understanding carbon–climate feedbacks, yet remains difficult due to sparse ground measurements, strong spatial heterogeneity, and the coarse resolution of most existing bottom-up inventories. To address these limitations, we develop a high-resolution methane flux modeling framework that integrates multisensor Earth observation data with a geofoundational deep-learning approach. The study leverages 30 m Harmonized Landsat–Sentinel (HLS) imagery, together with environmental predictors from SMAP and ERA5, and daily eddy-covariance methane flux measurements from Arctic sites after 2015. Following data filtering and quality control, the dataset comprises more than 7,600 daily observations from 45 wetland sites across northern high latitudes. A hybrid model architecture is constructed by combining the Prithvi geospatial foundation model for HLS feature extraction with a lightweight feature-wise attention encoder processing 48 auxiliary environmental variables. Fused latent representations are used to predict daily methane flux at 30 m resolution. The model demonstrates strong performance on an independent test set, capturing key spatial and temporal patterns of methane emissions. By enabling fine-scale flux estimation far beyond the resolution of conventional 0.1°–0.5° inventories, the framework offers new opportunities for detailed Arctic methane monitoring and improved characterization of wetland-driven emissions.

11:00am - 11:15am

Automatic Levee Extraction along Rivers from High Resolution Terrain Models

Wilfried Karel¹, Carolina Damm¹, Martin Wenk², Markus Hollaus¹, Norbert Pfeifer¹

¹TU Wien, Department of Geodesy and Geoinformation, Austria; ²Federal Ministry of Agriculture and Forestry, Climate and Environmental Protection, Regions and Water Management, Austria

To plan nature restoration of fluvial corridors on a national level an inventory of existing man-made levees is mandatory. We suggest an automatic method for a river-wise extraction of levees from a high resolution terrain model based on profiles perpendicular to the river axis. In this course we present a method to cover corridors with non overlapping profiles with a given maximum distance. Levee detection is based on a mathematical formulation of the protective function of levees. In an evaluation of 150 km river length distributed over nine different rivers in Austria the method detected 98% of manually extracted levees, and 68% of their length.

11:15am - 11:30am

Urban Temperature Simulation for resilient City Planning based on a single high resolution Satellite Stereo Data Scene

Thomas Krauß¹, Pablo d'Angelo¹, Erwin Lintermeir¹, Chiara Richiardi², Markus Tum³

¹DLR - German Aerospace Center, Germany; ²ENEA Bologna Research Centre: Bologna, IT; ³RIWA GmbH

Temperatures in urban areas are rising due to the climate change. Together with increasing urbanization and densification reducing cooling green spaces in cities this leads to so called urban heat islands (UHI) with increased surface- and air-temperatures in urban areas relatively to the surrounding areas. Since high temperatures are the reason for many exceed deaths municipalities are forced to protect their citizens. Satellite earth observation allows to monitor the development of urban heat islands to warn inhabitants early from dangerous heat. An other important way is increasing the resilience of cities to heat waves. For this we developed a simple but efficient method for the simulation of urban surface- and air-temperatures from single very high resolution stereo satellite images. In this paper we present the improved workflow for the simulation of urban temperatures together with the calibration and validation. Further we compare the results to in-situ-measurements in the city of Memmingen in southern Germany, to LandSat thermal mapper imagery and existing works on urban heat islands. Additionally we show how modifying the digital twin e.g. by adding trees or water areas allow the simulation of different scenarios to support decision-makers on their path towards resilient cities.

11:30am - 11:45am

Assessment of bud flush and damage in young Norway Spruce trees through high-resolution multispectral UAV images

Louisa Eurich¹, Sara López Fernández², Malin Elfstrand², María Rosario García-Gil³, Jonas Bohlin¹, Eva Lindberg¹

¹Department of Forest Resource Management, SLU, Umeå, Sweden; ²Department of Forest Mycology and Plant Pathology, SLU, Uppsala, Sweden; ³Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, SLU

Scandinavia is facing climate change, with mean temperatures projected to rise by 2-4°C. To prepare Swedish forests for this challenge, the Swedish tree breeding program aims to selects trees adapted to a range of biotic and abiotic conditions. Key variables in this selection process include spring phenology, damage, and overall tree vitality.

Traditionally, these data have been collected through manual field assessments, a resource-intensive approach that constrains both the number of trees that can be evaluated and the frequency of measurements. Remote sensing offers an alternative: high-resolution multispectral drone imaging enables the scoring of greater numbers of trees in less time, captures multiple data points across the growing season, and reduces the risk of human error through algorithmic measurement.

This project aims to develop methods suitable for integration into the Swedish tree breeding program by using multispectral drone imagery to assess spring phenology, shoot damage, and vitality in young Norway Spruce. Field campaigns were conducted during spring 2023 and 2024. Bud flush is modeled from spectral values of tree crowns, using manual assessments of a subset of trees as training data. To capture the full progression of bud flush at high temporal resolution, images were acquired before the vegetation season and up to twice weekly during the period of most rapid development. The same modeling framework is applied to assess damage and vitality.

11:45am - 12:00pm

Decadal Evolution of the Nansen Ice Shelf, Antarctica, from Historical Aerial Photography and Landsat Imagery

Yujie Bian^1,2, Yuan Cheng^1,4, Philip Christopher Reid³

¹Key Laboratory of Silicate Cultural Relics Conservation, Shanghai University, Shanghai, China; ²School of Mechanics and Engineering Science, Shanghai University, Shanghai, China; ³The Marine Biological Association (MBA), The Laboratory Plymouth, UK; ⁴School of Cultural Heritage and Information Management, Shanghai University, Shanghai, China

Antarctic ice shelves regulate ice sheet mass balance through their "buttressing effect", with major implications for global sea level rise. This study focuses on the Nansen Ice Shelf in Victoria Land, East Antarctica, which exhibits complex topography and sensitivity to environmental changes. Previous research has primarily centered on its significant collapse event in 2016; however, systematic evolutionary patterns over longer timescales remain unclear. This study integrates multi-source remote sensing observations from 1948 to 2025 to systematically reconstruct changes in the Nansen Ice Shelf's geometric characteristics (crevasse width, area) and dynamic parameters (ice flow velocity). Findings reveal distinct activity differences between the northern and southern regions of the ice shelf, closely linked to their respective boundary conditions and structural features.

10:30am - 12:00pm

WG III/3B: Active Microwave Remote Sensing
Location: 713B

10:30am - 10:45am

Evaluating the potential and added value of interferometric coherence in flood mapping across various environments

Sulaiman Fayez Hotaki¹, Mahmud Haghshenas Haghighi¹, Mahdi Motagh^1,2

¹Institute of Photogrammetry and GeoInformation, Leibniz University Hannover, Germany; ²GFZ German Research Centre for Geosciences, Department of Geodesy, Section of Remote Sensing and Geoinformatics, Potsdam, Germany

Flood mapping is one of the most important applications of Synthetic Aperture Radar (SAR) because it can monitor the earth's surface under all-weather, day-and-night conditions. While SAR intensity has been widely used for flood mapping, the potential and added value of interferometric coherence, especially its temporal behavior in different environments, remains mostly unexplored. In this study, we assess the potential and added value of interferometric coherence from Sentinel-1 time series for flood mapping in three contrasting regions: the urban area of Valencia (Spain), the arid region of Sistan and Baluchestan (Iran), and the agricultural area of Hannover (Germany). Our analysis of multi-temporal coherence shows that coherence provides clear flood indicators in arid regions through strong temporal decorrelation, but its performance is less reliable in vegetated and urban areas. In agricultural regions, pre-flood (baseline) coherence is inherently low due to vegetation phenology and temporal decorrelation, making any additional decrease due to flood inundation often indistinguishable. In urban areas, coherence generally remains stable, with only slight decreases observed in specific cases; therefore, the detectability of flooded areas using coherence-based approaches is limited in both agricultural and urban environments. In contrast, coherence in arid regions is high before flooding and drops significantly during flood events, making floods easy to detect in such regions. These findings demonstrate that, for flood mapping, interferometric coherence is a valuable but environment-dependent indicator, with the highest benefit seen in arid regions where intensity-based methods are limited.

10:45am - 11:00am

Leveraging Polarized Ku- and C-band Radar Backscatter Time Series for Sea Ice Thickness Prediction using Random Forest

Mehran Dadjoo¹, Dustin Isleifson²

¹Centre for Earth Observation Science (CEOS), University of Manitoba, Canada; ²Department of Electrical & Computer Engineering, Centre for Earth Observation Science (CEOS), University of Manitoba, Canada

Arctic sea ice thickness has been declining over recent decades due to climate change, making accurate prediction increasingly critical for environmental monitoring and climate modeling. Microwave remote sensing combined with machine learning has emerged as a promising approach for estimating sea ice thickness. This study investigates the prediction of lab-grown sea ice thickness, ranging from 27 to 47 cm, using time-series backscatter data collected from surface-based Ku- and C-band scatterometers in three polarizations (VV, HH, and HV). A Random Forest model was applied to the time series, incorporating Normalized Radar Cross-Section (NRCS) values and statistical features (mean and standard deviation) across various temporal variables (lead and lag times). The model achieved high prediction accuracy, with the lowest error recorded at RMSE = 0.03 cm. Feature importance analysis using the Permutation Importance method revealed that co-polarized C-band features (C-VV and C-HH) were the most influential in predicting sea ice thickness. These findings underscore the potential of integrating microwave remote sensing with Random Forest models to enhance sea ice thickness prediction and provide valuable insights for future research and real-time monitoring in Arctic regions.

11:00am - 11:15am

Flood Depth Mapping from SAR Imagery Using CS-Mamba with DEM Sensitivity Analysis

Zhongyuan Yang¹, Wei Yuan¹, Weihang Ran², Changqing Liu¹, Bruno Adriano¹, Ryosuke Shibasaki³, Shunichi Koshimura¹

¹Tohoku University, Japan; ²The University of Tokyo; ³Reitaku University

Operational flood monitoring demands both accurate extent delineation and quantitative depth estimation, yet existing research addresses these objectives separately. This study presents an integrated SAR-to-depth framework combining state space model segmentation with DEM-based geometric depth estimation to deliver comprehensive flood intelligence from Sentinel-1 SAR imagery and digital elevation models. We propose CS-Mamba, a hierarchical U-Net architecture incorporating selective state space

mechanisms, achieving 79.79% mean IoU on 10 European flood events from the KuroSiwo benchmark while surpassing CNN baselines and outperforming RSMamba by 7.37 percentage points. Test performance exceeding validation confirms robust cross-event generalization to unseen disasters. Controlled experiments establish that deep learning predictions achieve sufficient accuracy for operational depth estimation, with CS-Mamba flood masks showing ±2% agreement with reference annotations across four global DEMs despite conservative extent delineation. This agreement enables integrated pipelines without manual annotation, while systematic DEM comparison identifies Copernicus and MERIT as optimal choices. The complete framework delivers three-class flood masks and pixel-wise depth maps at operational resolution, bridging the traditional gap between extent mapping and quantitative assessment for emergency response.

11:15am - 11:30am

Temporal variation-guided self-supervised PolSAR despeckling network

Shaowei Shi¹, Jie Li^1,2, Liupeng Lin³, Qiangqiang Yuan¹, Huanfeng Shen³

¹School of Geodesy and Geomatics, Wuhan University, Wuhan, China; ²Hubei Luojia Laboratory, Wuhan, China; ³School of Resource and Environmental Sciences, Wuhan University, Wuhan, China

This contribution introduces TGSD-Net, a temporal variation-guided self-supervised network designed to improve despeckling of polarimetric SAR (PolSAR) imagery without the need for clean reference data. The method leverages consecutive multi-temporal observations to create pseudo training pairs and incorporates a lightweight temporal change detection prior, allowing the network to exploit temporal redundancy while remaining robust to land-cover variations. TGSD-Net further integrates auxiliary polarimetric decomposition features and a spatiotemporal information fusion module to enhance structural and scattering representations. The approach is tailored for multi-temporal SAR scenarios, where speckle, temporal variation, and heterogeneous land-cover types pose significant challenges. Experiments on real PolSAR datasets show that TGSD-Net achieves strong noise suppression while preserving edges, textures, and physical scattering properties. The results demonstrate the potential of self-supervised temporal learning to advance PolSAR image restoration and support downstream remote sensing applications.

11:30am - 11:45am

A Novel Approach for Data Fusion of SAR (EOS-4) and Optical Multispectral (Sentinel-2) Data

Nidhi Chaubey, Srivally M.V., Sumit Pandey, Asha Rani B., Neeraj Mishra

Advance Data Processing Research Institute, Department of Space, India

Current Remote Sensing applications demand multi-source, multi-sensor data fusion. Multi-source, multi-sensor data fusion provides useful information integrated for quick and better interpretation, understanding and effective decision-making. Data fusion of Synthetic Aperture Radar (SAR) data of Earth Observation Satellite-04 (EOS-04) and Optical Multispectral (MX) data of Sentinel-2 are current topic of interest in this paper. SAR and Optical MX which includes active and passive remote sensing technologies belong to different mechanisms of wave interaction due to widely separated and non-overlapping regions of the electromagnetic spectrum. In this paper, a novel approach to the re-implementation of Wavelet, Brovey, Fast Intensity Hue Saturation (FIHS), Frequency filtering, and Pure pixel data fusion methods is presented. The presented novel approach emphasises modulation-based fusion technique with proper normalization and scaling of both the input datasets. Fusion results of presented fusion methods are evaluated visually as well as quantitatively with quality metrics. The quality metrics demonstrate the ability of the presented novel approach to fuse optical spectral information into SAR data effectively to generate improved high-resolution SAR-coloured fused products.

10:30am - 12:00pm

ThS15: Data-Centric Learning for Geospatial Data
Location: 714A

10:30am - 10:45am

The Potential of Copernicus Satellites for Disaster Response: Retrieving Building Damage from Sentinel-1 and Sentinel-2

Olivier Dietrich¹, Merlin Alfredsson¹, Emilia Arens², Nando Metzger¹, Torben Peters¹, Linus Scheibenreif¹, Jan Dirk Wegner², Konrad Schindler¹

¹ETH Zurich, Switzerland; ²University of Zurich, Switzerland

Natural disasters demand rapid damage assessment to guide humanitarian response. Here, we investigate whether medium-resolution Earth observation images from the Copernicus program can support building damage assessment, complementing very-high resolution imagery with often limited availability. We introduce xBD-S12, a dataset of 10,315 pre- and post-disaster image pairs from both Sentinel-1 and Sentinel-2, spatially and temporally aligned with the established xBD benchmark. In a series of experiments, we demonstrate that building damage can be detected and mapped rather well in many disaster scenarios, despite the moderate 10m ground sampling distance. We also find that, for damage mapping at that resolution, architectural sophistication does not seem to bring much advantage: more complex model architectures tend to struggle with generalization to unseen disasters, and geospatial foundation models bring little practical benefit. Our results suggest that Copernicus images are a viable data source for rapid, wide-area damage assessment and could play an important role alongside VHR imagery. We release the xBD-S12 dataset, code, and trained models to support further research.

10:45am - 11:00am

From Text to Map: AI-Based Graphic Translation of Information

Francesca Biolo, Franco Guzzetti, Isabella C.R. Balestreri

Politecnico di Milano, Department of Architecture, Built Environment and Construction Engineering, 20133 Milan, Italy

In recent years, technological advancements, particularly in artificial intelligence (AI), are changing various fields and spurring new research. This study focuses on the use of AI in cartography and historical studies. It is part of the PRIN project "Crafted in Stone / Recorded on Paper," which aims to document the heritage of small Italian municipalities by creating an open-access database. The research discovered significant documents in Gandino, Italy, including a large-scale map and a 139-page textual register from the mid-eighteenth century. These documents come from land surveyors who measured municipal boundaries and properties using physical landscape markers. The original surveying method, although lost, shares similarities with modern land descriptions.

The study seeks to generate new maps from these textual registers using AI capabilities, aiming to replicate a historical mapping effort from the 1700s. Initial tests with an AI model involved reading the register, computing measurements, and creating coordinate tables. The results showed promise despite some inaccuracies. The goal is to develop an interdisciplinary method that graphically reconstructs information from written documents, enhancing access for historical and territorial analysis. The research will also explore further AI models and larger case studies to achieve this aim.

11:00am - 11:15am

From Pixels to Semantics: Can a Single Instruction-Tuned VLM Unify Geospatial Building Analysis?

Guneet Mutreja¹, Harisankar Harikumar², Chaikal Amrullah¹, Ksenia Bittner¹

¹Remote Sensing Technology Institute (IMF), German Aerospace Center (DLR); ²Karlsruhe Institute of Technology

The analysis of buildings from aerial imagery is a fundamental task for urban planning and disaster response, yet it traditionally requires a suite of specialized models for tasks like segmentation, detection, and semantic querying. The advent of generalist Vision-Language Models (VLMs) offers a new paradigm, but their adaptation to the specific, high-resolution remote sensing domain remains a significant challenge. This paper proposes and investigates a novel methodology for adapting a general-purpose VLM,

Google’s PALIGEMMA2, to function as a unified geospatial building analyzer. The core of this contribution is a data-centric pipeline that converts single-modality annotations (building polygons) into a rich, multi-task instruction-tuning dataset (16,500 samples) spanning segmentation, detection, Visual Question Answering (VQA), and captioning. A rigorous study is conducted to answer three critical questions: (1) Can a single instruction-tuned VLM outperform specialized models in a multi-task setting? (2) What are the synergistic benefits of multi-task learning? (3) How data-efficient is this adaptation process? The results demonstrate that the unified model significantly outperforms the zero-shot PaliGemma2 baseline and strong single-task fine-tuned variants on three out of four tasks, while remaining competitive on the fourth. A strong synergistic effect is found: multi-task training on both visual localization and semantic tasks improves performance on individual localization tasks. Furthermore, the analysis shows that high performance can be achieved with a surprisingly small instruction dataset. This work provides a complete methodology for efficiently adapting VLMs to multi-task geospatial analysis, suggesting a new path towards generalist models in remote sensing.

11:15am - 11:30am

Geolocation-aware pretraining strategies for globally applicable remote sensing foundation models

Mojgan Madadikhaljan, Jonathan Prexl, Michael Schmitt

University of the Bundeswehr Munich, Germany

Foundation models have achieved remarkable success across various domains due to their ability to learn generalizable representations from large-scale, unlabeled datasets. In the geospatial domain, several foundation models have been developed to leverage the abundance of unlabeled remote sensing data and support Earth observation tasks across diverse regions and sensor types. However, the geolocation-dependent characteristics of remote sensing data introduce unique challenges in adapting these models to region-focused applications. By conducting a comprehensive empirical analysis across diverse geographical regions and tasks, we explore whether incorporating regional information during pretraining or fine-tuning improves performance on region-specific downstream tasks. We show that regional representation learning, as well as regional adaptation of features extracted from a globally trained foundation model, is beneficial when the region-specific performance of the downstream tasks is of interest. To this end, we also propose a regional adaptation to the globally trained foundation models to balance global diversity with regional representation learning for improved performance.

11:30am - 11:45am

An assessment of data-centric methods for label noise identification in remote sensing data sets

Felix Kröber^1,2, Genc Hoxha², Ribana Roscher²

¹Forschungszentrum Juelich GmbH, Germany; ²University of Bonn, Germany

Label noise in the sense of incorrect labels is present in many real-world data sets and is known to severely limit the generalizability of deep learning models. In the field of remote sensing, however, automated treatment of label noise in data sets has received little attention to date. In particular, there is a lack of systematic analysis of the performance of data-centric methods that not only cope with label noise but also explicitly identify and isolate noisy labels. In this paper, we examine three such methods and evaluate their behavior under different label noise assumptions. To do this, we inject different types of label noise with noise levels ranging from 10 to 70% into two benchmark data sets, followed by an analysis of how well the selected methods filter the label noise and how this affects task performances. With our analyses, we clearly prove the value of data-centric methods for both parts – label noise identification and task performance improvements. Our analyses provide insights into which method is the best choice depending on the setting and objective. Finally, we show in which areas there is still a need for research in the transfer of data-centric label noise methods to remote sensing data. As such, our work is a step forward in bridging the methodological establishment of data-centric label noise methods and their usage in practical settings in the remote sensing domain.

11:45am - 12:00pm

Automatic Extraction and Multi-Class Instance Segmentation of Rural Road Networks from Orthoimagery using YOLOv11 and SAHI Sliced Inference for Cadastral Update

Marsia Sanità¹, Lindo Nepi², Eva Savina Malinverni¹, Adriano Mancini², Roberto Pierdicca¹, Artur Warchoł³, Monika Balawejder⁴

¹Dept. of Civil, Building and Architecture, Marche Polytechnic University, 60131 Ancona, Italy; ²Department of Information Engineering (DII), Marche Polytechnic University, 60131 Ancona, Italy; ³Kielce University of Technology – Kielce, Poland; ⁴PANS State University of Applied Sciences in Jaroslaw, Poland

Extracting road networks from high-resolution imagery remains a significant challenge in geomatics, particularly in fragmented rural landscapes. The big difficulty is the spectral similarities between unpaved tracks and agricultural backgrounds that can lead to classification errors. This study proposes an automated geospatial pipeline based on the YOLOv11 architecture. Specifically, the approach is made on the optimization of the multi-class road detection in the rural areas of Kosina and Markowa, two villages in Poland. To reduce the computational effort, due to large-scale 9000x9000 px orthophotos and to improve the detection of small-scale features, Slicing Aided Hyper Inference (SAHI) strategy was integrated. High-resolution imagery has been decomposed into optimized tiles, ensuring feature continuity across boundaries and preventing GPU memory overhead. The instance segmentation model was trained on a custom-annotated dataset, with seven labels (categories) such as internal paved roads, rural tracks, and railway infrastructures. Therefore, a high level of robustness has been achieved reaching a mean Average Precision value (mAP@0.5) of 0.90. A confusion matrix reveals quantitatively that the pipeline effectively distinguishes between complex classes and low omission rates. As a result, the generated outputs are converted into interoperable GeoJSON format ensuring their integration into GIS environments. In conclusion, the experimental result demonstrates that the framework is valuable for emergency response logistics and urban planning. It offers a scalable and near real-time solution for updating national topographic databases.

10:30am - 12:00pm

WG III/8F: Remote Sensing for Agricultural and Natural Ecosystems
Location: 714B

10:30am - 10:45am

Evaluating the Transferability of Machine-Learning Models for Pre-Emergence Bark Beetle Detection Using Multispectral and Hyperspectral UAV Data Across Europe

Wang Geng¹, Aurora Bozzini², Luiz Cosimo¹, Salma Bijou⁴, Raquel Oliveira³, Juha Soumalainen³, Erika Vennervirta³, Niko Koivumäki³, Roope Näsi³, Lucie Kupková⁴, Massimo Faccoli², Eija Honkavaara³, Langning Huo¹

¹Department of Forest Resource Management, Swedish University of Agricultural Sciences, 90 183, Umeå, Sweden; ²Department of Agronomy Food Natural Resources Animals and Environment, University of Padua, 35020, Legnaro (Padova), Italy; ³Department of Remote Sensing and Photogrammetry, Finnish Geospatial Research Institute FGI, 00521 Helsinki, Finland; ⁴Department of Applied Geoinformatics and Cartography, Faculty of Science, Charles University, Albertov 6, Prague 2, Czech Republic

Outbreaks of the European spruce bark beetle (Ips typographus) have intensified across Central and Northern Europe due to droughts, storms, and other extreme climatic events. Resulting Norway spruce mortality has reduced growing stock and weakened forest carbon uptake, creating an urgent need for rapid, operational tools for early detection. Pre-emergence detection, i.e. identifying infested trees before brood emergence, is particularly valuable, yet field surveys remain too slow and costly at large scales. UAV-based optical remote sensing offers high-resolution, flexible, and timely mapping at the single-tree level, allowing detailed observation of spectral changes soon after attack.

Despite many recent UAV studies, the reliability and transferability of pre-emergence detection remain unclear. Differences in sensor types (multispectral vs. hyperspectral), band configurations—especially in the red-edge and green-shoulder regions—and analytical approaches have produced inconsistent results. Many models are developed within single sites and often lack standardized accuracy metrics or cross-site validation, limiting insights into robustness under varying ecological and climatic conditions.

To address this, we compiled six UAV datasets from four major outbreak regions—southern Sweden, southern Finland, the southeast Alps in Italy, and Czechia—covering multispectral and hyperspectral campaigns at the single-tree level. Using these harmonized data, we compare machine-learning models for classifying tree health based on spectral features and vegetation indices. A central focus is transferability. We test models across regions using cross-regional, joint, and leave-one-region-out schemes to quantify generalization across contrasting climates, outbreak phases, and stand structures. The results reveal consistently informative spectral regions and modelling strategies, offering practical guidance for operational early-warning systems.

10:45am - 11:00am

Country-wide, high-resolution monitoring of forest browning with Sentinel-2

Samantha Biegel^1,2, David Brüggemann⁴, Francesco Grossi³, Michele Volpi⁴, Konrad Schindler^1,2, Benjamin D. Stocker^5,6

¹Photogrammetry and Remote Sensing, ETH Zürich; ²ETH AI Center, ETH Zürich; ³Forest and Soil Ecology, Swiss Federal Institute for Forest, Snow and Landscape Research WSL; ⁴Swiss Data Science Center, ETH Zürich and EPFL; ⁵Institute of Geography, University of Bern; ⁶Oeschger Centre for Climate Change Research, University of Bern

Natural and anthropogenic disturbances are impacting the health of forests worldwide. Monitoring forest disturbances at scale is important to inform conservation efforts. Here, we present a scalable approach for country-wide mapping of forest greenness anomalies at the 10 m resolution of Sentinel-2. Using relevant ecological and topographical context and an established representation of the vegetation cycle, we learn a predictive quantile model of the normalised differential vegetation index (NDVI) derived from Sentinel-2 data. The resulting expected seasonal cycles are used to detect NDVI anomalies across Switzerland between April 2017 and August 2025. Goodness-of-fit evaluations show that the conditional model explains 65% of the observed variations in the median seasonal cycle. The model benefits most from the local context information during the green-up period. The approach produces coherent spatial anomaly patterns and enables country-wide quantification of forest browning. Case studies with independent reference data from known events illustrate that the model reliably detects different types of disturbances.

11:00am - 11:15am

Evaluating the Potential of yearly Sentinel-1 Composites for Bark Beetle Infestation Detection

Sebastian Spreitzer¹, Georg Wohlfahrt², Martin Rutzinger¹

¹Department of Geography, University of Innsbruck, Austria; ²Department of Ecology, University of Innsbruck, Austria

The exponential spread of the bark beetle (Ips typographus L.) outbreaks across Europe in recent years has led to heightened interest in remote sensing-based detection. This increase is closely linked with ongoing climate change, which has led to rising temperatures, prolonged dry periods, and increasing frequency and intensity of both biotic and abiotic disturbances. These conditions created a favourable environment for bark beetle proliferation, resulting in larger and more widespread infestations. Effective detection and management of these outbreaks is crucial for forest officals, necessitating the implementation of monitoring systems that complement traditional ground-based efforts. At present, remote sensing approaches for bark beetle detection mainly rely on optical data to identify changes in spectral reflectance of vegetation. In this study, we utilised annual Sentinel-1 synthetic aperture radar (SAR) composites from 2021 to 2023 for infestation detection. A Random Forest classification model was applied to distinguish between healthy and infested forest areas. Additionally, vegetation indices were incorporated to evaluate and compare the results. A reference dataset was used to validate model performance. Our results show that the Sentinel-1 based approach achieved lower accuracies (max. overall accuracy: 0.78), compared to Sentinel-2 (max. overall accuracy: 0.87). Despite this, the Sentinel-1 data proved valuable as a tool for bark beetle infestations detection, especially in scenarios where optical data may be unavailable or limited. These results underscore the importance of integrating SAR data into remote sensing workflows to improve the detection of bark beetle outbreaks.

11:15am - 11:30am

Integrating green-shoulder indices from hyperspectral drone imagery and sap flow monitoring to assess water dynamics in healthy and bark beetle-infested trees

Luiz Henrique Elias Cosimo¹, Jose Gutierrez Lopez², Eva Lindberg¹, Zuosinan Chen³, Langning Huo¹

¹Department of Forest Resource Management, Swedish University of Agricultural Sciences; ²Department of Forest Ecology and Management, Swedish University of Agricultural Sciences; ³Department of Water, Energy and Environmental Engineering, University of Oulu

Forest ecosystems are increasingly threatened by biotic and abiotic disturbances that are intensifying under a changing climate. Accurate detection of tree stress is essential for effective forest management, as stress strongly increases vulnerability to damaging agents such as pests, pathogens, and fire. Tree water functioning is a key indicator of physiological status, yet traditional field-based methods for monitoring water transport – such as sap flow measurements – require costly instrumentation and can only be applied to a limited number of trees.

Hyperspectral remote sensing offers a powerful means to upscale forest health monitoring, but its effectiveness depends on robust spectral indicators that reliably reflect physiological change. Green-Shoulder Indices (GSI), which leverage reflectance features in the 490–560 nm region linked to carotenoid dynamics, have been previously used to monitor tree health. Because carotenoids are closely tied to photosynthetic regulation, stress responses, and canopy vitality, GSI have emerged as promising indicators of health decline. Notably, they have shown strong performance in detecting Norway spruce trees in the early stages of bark beetle infestation.

This study investigates how GSI can be further strengthened as indicators of forest hydraulic functioning by integrating hyperspectral drone imagery with continuous sap flow monitoring. By linking canopy spectral responses to internal water transport dynamics, we aim to advance GSI as operational tools for large-scale forest health surveillance and disturbance detection.

11:30am - 11:45am

A Green Shoulder Index to estimate carotenoid content verified by the radiative transfer model FluSAIL and real-world data

Langning Huo

Swedish University of Agricultural Sciences, Department of Forest Resource Management, 90183 Umea, Sweden.

Carotenoids regulate photoprotection and respond early to stress, but their retrieval from canopy reflectance is often unstable because green-band signals are confounded by canopy structure, illumination/view geometry, and covariance with chlorophyll. This study proposes and evaluates the sensitivity of green-shoulder indices (derived from 490–550 nm bands) to carotenoid content in vegetation. We use FluSAIL simulations to generate canopy reflectance under wide-ranging biochemical and structural conditions and benchmark multiple green-region indices (490–560 nm, including PRI-type formulations) for their sensitivity and stability to carotenoids. We then transfer the best-performing index–carotenoid relationship to independent real-world datasets with pigment measurements at both leaf and canopy scales (ANGERS, LOTUS, CABO) to test generalization beyond the simulation domain. Results showed that a curvature-based green-shoulder index provided the most consistent carotenoid sensitivity, with the strongest and most stable VI–Car relationships across varying chlorophyll–carotenoid coupling, LAI, and sun–sensor conditions. Validation on measured spectra confirms that green-shoulder indices can predict carotenoid content with high accuracy and improved transferability compared with conventional green indices.

11:45am - 12:00pm

High-dimensional Detection of Landscape Dynamics 2.0: a Framework for Mapping Non-stand replacing Forest Disturbance using Sentinel-2 Time Series

Donato Morresi¹, Hyeyoung Maeng², Langning Huo¹

¹Swedish University of Agricultural Sciences, Department of Forest Resource Management, Skogsmarksgränd 17 901 83 Umeå, Sweden; ²Durham University, Department of Mathematical Sciences, Upper Mountjoy Campus, Stockton Road, Durham DH1 3LE, United Kingdom

Non-stand replacing (NSR) disturbances—low- to moderate-severity events causing single-tree mortality or canopy thinning—are driven by agents such as drought, insects, pathogens, low-intensity fire, wind, and snow. Their variable duration, frequency, and extent challenge detection using medium-resolution optical imagery because changes are spectrally subtle and spatially complex. We developed a framework to detect NSR disturbances in boreal forests on a sub-annual basis using Sentinel-2 (S2) time series. Key methods include the spectral normalisation of monthly cloud-free composites via weighted multidimensional medians (medoid and geometric median), as well as improvements to the sensitivity and robustness of the HILANDYN algorithm. Observation weights are based on spectral distance measures (Euclidean distance and Spectral Angle Mapper), normalised using an adaptive sigmoid function. Normalisation reduced seasonality patterns by 41.4%, leaving only 13.7% of the tested time series with a significant seasonal pattern. Validated on more than 10,000 points, the best F1 and F2 scores were 0.75 and 0.72, respectively, when using seven S2 variables. These metrics increased to 0.80 and 0.81, respectively, when including detections in the subsequent vegetative season. The geometric median outperformed the medoid, and the optimal spectral indices varied by agent, e.g., NBR for canopy removal, red-edge indices for wind and snow damage. While the framework effectively maps natural and anthropogenic NSR events, reducing detection lag at high latitudes remains a priority.

10:30am - 12:00pm

WG II/3H: 3D Scene Reconstruction for Modeling & Mapping
Location: 715B

10:30am - 10:45am

Accurate Point Measurement in 3DGS - A New Alternative to Traditional Stereoscopic-View Based Measurements

Deyan Deng¹, Rongjun Qin^1,2

¹Department of Civil, Environmental and Geodetic Engineering, The Ohio State University, Columbus, USA; ²Department of Electrical and Computer Engineering, The Ohio State University, Columbus, USA

3D Gaussian Splatting (3DGS) has revolutionized real-time rendering with state-of-the-art novel view synthesis, but its applicability to accurate geometric measurement remains limited. Compared with multi-view stereo (MVS)-based point clouds or mesh models, 3DGS provides superior visual quality and completeness, while existing measurement approaches still rely on stereoscopic workstations or direct measurements on incomplete and inaccurate reconstructed geometry.

As a novel view synthesizer, 3DGS reproduces source views and smoothly interpolates intermediate viewpoints, enabling users to intuitively identify congruent points across multiple views. By triangulating these correspondences, accurate 3D point measurements can be obtained. Inspired by traditional stereoscopic measurement, the proposed approach removes the need for stereo workstations and biological stereoscopic capability, while naturally supporting multi-view measurements for improved accuracy.

We implement a web-based application to demonstrate this proof of concept using UAV-based aerial datasets. Experimental results show that the proposed method achieves measurement accuracy comparable to or better than traditional stereoscopic measurement approaches while operating entirely on non-stereo workstations. In particular, the proposed method consistently outperforms direct mesh-based measurements, achieving RMSEs of 1–2 cm on well-defined points. On challenging thin structures, the proposed method reduces RMSE from 0.062 m to 0.037 m, and successfully measures sharp corners where mesh-based methods fail entirely.

The source code and documentation are open-source and available at: https://github.com/GDAOSU/3dgs_measurement_tool.

10:45am - 11:00am

Gaussian Texturing: Surface-Anchored 3D Gaussian Splatting for Metric-Accurate Heritage Preservatio

Ming Huang, Lei Wang

Beijing University of Civil Engineering and Architecture,

Traditional 3D Gaussian Splatting (3DGS) methods initialize Gaussian primitives from Structure-from-Motion point clouds, resulting in loosely distributed representations that lack geometric constraints and metric accuracy. This limitation severely restricts their application in architectural heritage preservation, where millimeter-level precision and practical editability are essential requirements.

This paper introduces Gaussian Texturing, a novel framework that fundamentally transforms how Gaussians relate to geometry by directly binding 3D Gaussian primitives to precisely measured mesh surfaces—essentially "texturing" surfaces with Gaussians. Our approach comprises three key innovations: (1) a constrained optimization framework that maintains tight Gaussian-surface coupling throughout training, preventing geometric drift while preserving photorealistic rendering quality; (2) engineering-oriented editing tools enabling geometry-based material replacement, region editing, and mesh-driven deformation; and (3) seamless integration with professional heritage preservation workflows.

Experimental validation on MipNeRF360 benchmarks and custom architectural datasets demonstrates that our method achieves millimeter-level geometric precision while maintaining competitive rendering metrics. Unlike traditional "bind-after-training" approaches, our direct surface binding paradigm eliminates intermediate reconstruction steps, ensuring accuracy from source data. Real-world applications in heritage documentation and architectural design confirm the method's practical value, successfully bridging the gap between photorealistic visualization and engineering-grade geometric accuracy for professional applications.

11:00am - 11:15am

Structured-Li-GS: Structured 3D Gaussians Splatting with LiDAR Incorporation and Spatial Constraints

Huaiyuan Weng, Huibin Li, Chul Min Yeum

University of Waterloo, Canada

In this study, we develop a Structured framework for Gaussian Splatting (3DGS) with LiDAR integration (Structured-Li-GS). It is a lightweight Gaussian Splatting pipeline that leverages LiDAR–inertial–visual SLAM. Structured-Li-GS achieves high-quality 3D reconstructions with fewer Gaussians by training on accurate, dense, colorized point clouds. Gaussian primitives are anchored using sub-sampled point clouds, and their ellipsoidal parameters are initialized from local surface geometry. Our training strategy integrates a comprehensive set of loss terms, including photometric, flattening, offset, depth, and normal losses—guided by the dense point cloud, enabling accurate reconstruction without Gaussian densification. This approach produces up-to-scale, high-fidelity results with a moderate model size. For experimental validation, we develop a custom hardware-synchronized LiDAR–camera handheld scanner. Experiments on both benchmark datasets and our real-world in-house dataset demonstrate that Structured-Li-GS surpasses state-of-the-art methods while using fewer Gaussians.

11:15am - 11:30am

Evaluating 3DGS for True Orthophoto Generation: Comparative Study with Photogrammetric Processes

Jangwoo Cheon¹, Impyeong Lee²

¹Innopam, Korea, Republic of (South Korea); ²University of Seoul, Korea, Republic of (South Korea)

True Digital Orthophoto Maps (TDOMs) are essential for urban analysis and map updating, traditionally generated through photogrammetric workflows involving aerial triangulation, DSM construction, and orthorectification. Recently, 3D Gaussian Splatting (3DGS) has emerged as an alternative approach using differentiable volumetric rendering. While both methods depend on acquisition geometry, they follow fundamentally different reconstruction processes, potentially producing distinct representational characteristics. Systematic comparisons under controlled conditions remain limited.

This study generates photogrammetric and 3DGS-based TDOMs from four UAV datasets acquired over the same area with varying resolution (2.51–5.8 cm GSD), image count, and oblique view proportion (0–75%). All datasets were preprocessed through common SfM to obtain identical inputs. We evaluate differences through inter-method agreement (PSNR, SSIM, LPIPS), detail preservation (gradient magnitude, high-frequency energy), and spatial distribution patterns (boundary–interior separation).

Results show 3DGS systematically smooths fine-scale texture with gradient ratios of 0.58–0.89 and high-frequency energy reductions of 2.5–55× relative to photogrammetry. Oblique view proportion emerges as the dominant divergence factor: oblique-dominant datasets show lowest agreement (PSNR 15.15) despite larger image counts, while nadir-only datasets achieve higher similarity (PSNR 26.73). Difference maps reveal 2–3 times higher discrepancies along boundaries than interiors. Visually cleaner 3DGS boundaries are byproducts of overall smoothing rather than superior reconstruction. These findings establish that the two methods are complementary—photogrammetry preserving texture fidelity and 3DGS providing structural regularity—with acquisition geometry critically influencing performance characteristics.

11:30am - 11:45am

Supercharging Thermal Gaussian Splatting with depth estimation

Manoj Biswanath¹, Chenxin Cai², Hannah Schieber³, Daniel Roth³, Benjamin Busam¹

¹Photogrammetry and Remote Sensing, Munich Center for Machine Learning (MCML), Technical University of Munich, Munich, Germany; ²Technical University of Munich, Munich, Germany; ³Human-Centered Computing and Extended Reality Lab, TUM University Hospital, Clinic for Orthopedics and Sports Orthopedics, Munich Institute of Robotics and Machine Intelligence (MIRMI), Technical University of Munich, Munich, Germany

Efficient and robust 3D scene representation is crucial in fields such as robotics, autonomous driving, and augmented reality. While RGBimagesprovidevaluable content for 3D reconstruction, other modalities like thermal or depth can enable additional information on the 3D environment. Lately, Novel View Synthesis (NVS) methods like Gaussian Splatting (GS) have started using multiple modalities to further boost their performance. But fusing or combining those multi-modal data can make the process slower and bring in additional challenges. Therefore, our project aims to use single modality based on thermal infrared domain, by removing the reliance on visible light, as much as possible. We propose a method Thermal-to-Depth Gaussian (TDg), that uses only thermal images and depth estimation in its architecture to derive the radiance fields. Mainstream methods relying heavily on RGB images, perform poorly in visually degraded environments, such as low-light conditions, fog, smoke, or extreme weather. Contrary to this, infrared cameras can detect objects’ inherent thermal radiation and provide a robust perception, suitable regardless of lighting and weather conditions. But despite their promise, thermal images are inherently characterized by low contrast, sparse texture, and non-uniform brightness distribution. So current approaches still rely heavily on paired RGB images for supervision or joint optimization, failing to establish a truly independent and purely thermal-based Gaussian representation system. Therefore, the core innovation of our work is to prepare a self contained Thermal GS framework that uses only thermal image inputs. We design a thermal-guided depth estimation module, Thermal-to-Depth (TDg), providing explicit and reliable constraints for geometric optimization.

10:30am - 12:00pm

WG IV/4: Data Management for Spatial Scenarios
Location: 716A

10:30am - 10:45am

Construction and Integration of Image Control Point, Interpretation Sample, and Spectral Information Databases for Megacity Management

Sitong Li, Xinhui Wang

Shanghai Surveying and Mapping Institute

With the rapid advancement of satellite, aerial, and UAV platforms, the daily volume of remote sensing data collected over megacities has grown exponentially. However, only a limited portion of this data can be transformed into usable products in time. Current production workflows remain lengthy and poorly automated, which fails to meet the increasing demand for high-precision and high-timeliness remote sensing products in city management, environmental monitoring, and emergency response.

To address this gap, this study proposes the construction of an standardized, efficient and reusable foundational database system consisting of three key components: image control point database, interpretation sample database, and spectral information database.

The image control point database establishes a unified geometric reference for multi-source data; The interpretation sample database provides large-scale, high-quality labeled data for deep learning-based image analysis; and the spectral database offers standardized spectral features for accurate classification and parameter inversion.

Together, the three databases form a collaborative mechanism that links geometric accuracy, semantic understanding, and spectral consistency, thereby building a complete chain from analysis-ready data (ARD) production to rapid information extraction. Using Shanghai as a case study, this paper presents the design, construction, and collaborate applications of the three databases, demonstrating their effectiveness in supporting refined and sustainable megacity governance.

10:45am - 11:00am

Fireguard: A Real-Time Wildfire Monitoring and Risk Assessment System Using Unmanned Aerial Systems and Multi-Sensor Fusion

Hannah Kemper, Gerhard Kemper

GGS GmbH Speyer, Germany

Disaster Risk Management benefits from innovative techniques including AI and Multi Sensor Fusion. The Fireguard Approach uses such technologies to improve the Wildfire Management works in Saxony, Eastern Germany by supporting standing efforts in Early Warning, Disaster Response and Monitoring. Unmanned Aerial Systems (UAS) play a vital role in providing real-time information via a 5G network to a central information management system that delivers geospatial information to response teams. This study highlights the potential of combining UAS, AI, geospatial solutions and existing data for real-time wildfire monitoring and risk assessment systems. The preliminary study successfully shows the potential of the provided solution to enhance Wildfire early detection, response and monitoring to address immediate and long-term needs of response teams.

11:00am - 11:15am

A Multi-Agent Geospatial Model for Semantic and Spatial Querying

Amirhossein Nourbakhshrezaei, Saeed Abbasi, Mojgan Jadidi

Department of Civil Engineering, Lassonde School of Engineering, York Univeristy, Canada

This paper presents a multi-agent geospatial application that enables users to interact with spatial data through natural language, called MapEcho Copilot. The system integrates large language model (LLM) reasoning, semantic embedding search, and spatial analytics within a unified architecture. A vector embedding database is constructed to index diverse open-source geospatial datasets. Upon receiving a user query such as “show all tennis courts in downtown of Toronto” or “find habitats of grizzly bears in Canada” the system performs semantic retrieval to identify relevant datasets, followed by geospatial filtering and reasoning through specialized agents via an interactive and friendly interface. The multi-agent framework coordinates between semantic understanding, data retrieval, and spatial computation layers to deliver map-based responses in real time. The Results demonstrate the system’s ability to process both semantic and geospatial queries with high accuracy and interpretability, providing an intuitive bridge between natural language and spatial intelligence.

11:15am - 11:30am

Point Cloud Data Management for Cross-Domain Applications

Kwasi Nyarko Poku-Agyemang, Balthasar Teuscher, Martin Werner

Technical University Munich, Germany

Point clouds have proven over the years to be a suitable spatial representation of scenes and objects at varying scales and levels of complexity, making them widely used across several scientific domains and applications. Advancements in sensor technology, computer vision, and data science have produced high‑resolution point clouds and advanced analytical approaches, leading to broader adoption for spatial information extraction to support decision making. However, traditional point cloud management systems for organizing and distributing data throughout the point cloud lifecycle often create significant duplication at each stage. This causes data fragmentation as multiple copies and versions are scattered across different processing steps, workgroups, and storage locations, further limiting cross‑domain applications.

In this paper, we propose a unified point cloud data management (PCDM) approach that supports the principles of findability, accessibility, interoperability, and reusability (FAIR) across domains at scale. The proposed approach aims to support diverse point cloud retrieval for cross-domain analysis by leveraging a single, reusable PCDM system built on a shared data model. Our approach improves on existing frameworks and provides a foundation for point cloud data management and data spaces.

11:30am - 11:45am

Mathematical Modeling of Confidence Ellipses and Computational Validation of their Implementation in the LFTools Plugin: A Case Study Using GWDBrazil

Leandro Franca, Tiago Silvano, Simone Sato, Savia Gavazza

Federal University of Pernambuco (UFPE), Brazil

This contribution presents a rigorous mathematical and computational examination of confidence ellipses applied to bivariate spatial distributions, with a specific focus on their implementation in the open-source LFTools plugin for QGIS. Confidence ellipses are widely used in geography, environmental sciences, public health, criminology, and spatial statistics to summarize central tendency, dispersion, and directional trends of point-based datasets. Although conceptually well established, their practical reliability depends on correct numerical implementation and statistical consistency—an aspect rarely evaluated in detail.

The study first revisits the formal mathematical foundations of confidence ellipses, including covariance-matrix geometry, eigen-decomposition, and Chi-Square-based scaling for different confidence levels. It then analyses the computational workflow adopted in LFTools and validates its correctness using 100,000 simulated Gaussian random points, demonstrating near-perfect adherence (<0.05% deviation) to theoretical confidence intervals.

To assess performance on real-world data, the method is applied to the Groundwater Well Database for Brazil (GWDBrazil), comprising more than 350,000 groundwater wells. Confidence ellipses at the national and regional levels reveal strong anisotropy, clustered patterns, and non-Gaussian spatial structures, confirming both the robustness of the tool and the complexity inherent to real geospatial phenomena.

Results indicate that the LFTools implementation is mathematically sound, statistically reliable, and suitable for scientific applications. The study highlights the relevance of reproducible open-source tools and outlines future directions involving spatial–temporal extensions, non-parametric approaches, and multi-scale territorial analysis, applicable in Brazil and worldwide.

11:45am - 12:00pm

Consumer's risk in zero-defect sampling inspection of surveying and mapping products

Fujun Luo^1,2, Haitao Zhao^1,2, Jinghu Liu^1,2, Xiaodi Wang^1,2, Chunxi Chen^1,2

¹National Quality Inspection and Testing Center for Surveying and Mapping Products, China, People's Republic of; ²Technology Innovation Center for Remote Sensing Intelligent Verification, Ministry of Natural Resources, Beijing, China

Through theoretical analysis and empirical research, this study thoroughly examines the theoretical foundations and practical applications of zero-defect sampling inspection schemes, revealing significant differences between inspecting large lots as a whole versus splitting them into sub-lots in terms of consumer's risk control. The findings indicate that although the zero-defect sampling scheme (Ac=0) adopted in the GB/T 24356-2023 standard shifts quality control from "post-production spot checks" toward "in-process prevention", it exhibits notable deficiencies in controlling consumer's risk, resulting in an unacceptably high level of risk for consumers. Empirical analysis demonstrates that, for large lots with relatively poor quality, e.g., when the product's defect rate is 10%, the inspection plan (100, 10, 0) still carries a 33.3% probability of erroneously accepting the lot, which significantly exceeds the risk level typically acceptable to consumers and thus imposes excessive quality risk on them. Furthermore, the study reveals that inspecting small lots or subdividing large lots benefits producers, highlighting an imbalance in the current standard's risk allocation mechanism. These insights provide more reliable theoretical support and practical guidance for quality management of surveying and mapping products.

12:00pm - 1:30pm

Closing Ceremony
Location: Exhibition Hall "G"

Awards Ceremony:

The President's Honorary Citation
ISPRS Certificate of Recognition of Service
Congress Best Reviewer Award
ISPRS Best Poster Award
The ISPRS CATCON Award