Conference Agenda
Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).
|
Agenda Overview |
| Session | ||
WG II/3A: 3D Scene Reconstruction for Modeling & Mapping
Session Topics: 3D Scene Reconstruction for Modeling & Mapping (WG II/3)
| ||
| External Resource: http://www.commission2.isprs.org/wg3 | ||
| Presentations | ||
8:30am - 8:45am
GT-LOD3: LOD3 Semantic 3D Building Reconstruction Benchmark Dataset 1Lyles School of Civil and Construction Engineering, Purdue University, West Lafayette, IN, USA; 2CV4DT, Department of Engineering, University of Cambridge, Cambridge, United Kingdom; 3Faculty of Civil Engineering, Hochschule München University of Applied Sciences, Munich, Germany; 4Remote Sensing Technology Institute, German Aerospace Center (DLR), Oberpfaffenhofen, Germany; 5Department of Civil Engineering, The University of Akron, Akron, OH, USA; 6Institute of Visual Computing, Graz University of Technology, Graz, Austria; 7University of Michigan Transportation Research Institute, University of Michigan, Ann Arbor, MI, USA; 8Institute of Photogrammetry and GeoInformation, Leibniz University Hannover, Hannover, Germany; 9Faculty of Geoinformatics, Hochschule München University of Applied Sciences, Munich, Germany; 10Institute of Photogrammetry and Remote Sensing, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany This contribution introduces GT-LOD3, a new benchmark dataset designed to advance semantic Level of Detail 3 (LOD3) building reconstruction from UAS-based photogrammetric point clouds. Existing benchmarks primarily focus on mesh- or point-level semantic labelling, façade segmentation, or LOD2-level modelling, but high-quality, geometry-accurate LOD3 ground truth paired with real-world photogrammetric observations are still limited. GT-LOD3 fills this gap by offering paired UAS point clouds and manually modeled LOD3 reference data in CityGML format, enabling research on window-level facade reasoning, geometric regularization, and instance-level shape recovery. The benchmark currently consists of two subsets featuring different architectural styles and environmental conditions: (1) a urban block in Gold Coast (Lakewood, Ohio, USA), and (2) the Technical University of Munich (TUM) campus. The accompanying LOD3 reference models contain explicit window geometry, enabling detailed evaluation of both detection performance and polygon-level geometric accuracy. We further provide a baseline reconstruction pipeline that combines point-cloud semantic segmentation, facade-aligned 2D projection, window region extraction, and geometric back-projection into CityGML. An evaluation protocol is presented including pixel-level metrics (IoU, precision, recall, F1) and instance-level detection metrics based on optimal assignment via the Hungarian algorithm. 8:45am - 9:00am
LoD2-Former: Multi-Modal Transformer-Based 3D Building Wireframe Reconstruction 1Remote Sensing Technology Institute, German Aerospace Center (DLR), Wessling, Germany; 2Sm@rts Laboratory, Digital Research Center of Sfax, Tunisia Building wireframe reconstruction from LiDAR faces challenges due to sparse and incomplete point cloud data. We present LoD2-Former, a multi-modal Transformer architecture that fuses aerial LiDAR and optical imagery for end-to-end 3D roof wireframe reconstruction. Unlike existing point-cloud-only methods, our dual-backbone approach with bidirectional cross-modal attention leverages complementary geometric and visual information. Experiments on two datasets show consistent improvements in edge detection metrics, with edge F1-scores increasing from 0.874 to 0.899 on Tallinn and 0.968 to 0.974 on Roof-Intuitive, while substantially boosting corner recall (0.630 to 0.729) in complete-data settings. We also contribute a curated multi-modal subset of Building3D with aligned LiDAR and aerial imagery to facilitate future research. 9:00am - 9:15am
Point2WSS: Reconstructing LoD2 Buildings from Aerial LiDAR Data using Multimodal Learning and Weighted Straight Skeleton 1DEMR, ONERA, Université Paris Saclay, F-91123 Palaiseau, France; 2Univ Gustave Eiffel, ENSG, IGN, LASTIG, F-77420 Champs-sur-Marne, France In this paper, a method exploiting aerial LiDAR point clouds to build realistic building meshes suitable for electromagnetic simulation is proposed. One of the main challenges lies in reconstructing regularized building meshes with low polygonal density. Optimization-based methods, commonly used for building reconstruction from point clouds, are highly data-driven, making the quality of results dependent on the quality of input data. Aerial LiDAR scans can be incomplete or sparse, for instance due to occlusion. A novel LoD2 buildings reconstruction method based on deep learning is proposed, assuming that deep learning methods are more robust to incomplete or sparse data than optimization-based methods. A parametric building model is introduced, based on the Weighted Straight Skeleton algorithm, which generates realistic roofs from a building footprint and an associated set of slopes, and subsequently extrudes the roof to the specified building height. This parametric approach guarantees that a given set of parameters (height, footprint and slopes) produces a regularized building mesh with low polygonal density. A multimodal model, named Point2WSS, was trained to recover the variable number of building's continuous parameters from its corresponding point cloud. This approach enables the generation of realistic building meshes suitable for electromagnetic simulation, if the predicted parameters accurately approximate real-world values. 9:15am - 9:30am
Wide-area Scene Reconstruction with polyhedral Buildings featuring recognized Regularities Fraunhofer IOSB, Germany The modeling of buildings suffers from a dichotomy between generic and specific representations: the lack of domain knowledge in flexible models that can represent many shapes, and the restricted geometry of pre-specified parametric building primitives. To fill this gap, we propose using general boundary representations enriched with automatically recognized and enforced geometric constraints derived from human-made regularities. The proposed reasoning process relies on the statistics of the planar point groups extracted from airborne-captured point clouds. Hence, a chosen significance level is the only process parameter. To enforce the creation of sound solids, we apply manifold constraints for the generation of the boundary representations. The feasibility and usability of the approach are demonstrated by evaluating an airborne-captured laser scan containing approximately 7,600 buildings over an area of 50 km^2 featuring both inner-city and rural landscapes. 9:30am - 9:45am
The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization 1Université Côte d’Azur, INRIA – Sophia-Antipolis, France; 2LuxCarta Technology, Mouans-Sartoux, France We present P3, a large-scale multimodal dataset for building vectorization, including aerial LiDAR point clouds, aerial images, and vectorized 2D building outlines, collected across three continents. P3 contains over 10 billion LiDAR points with decimeter-level accuracy and RGB images at a ground sampling distance of 25 centimeters. While many existing datasets focus on the image modality, P3 offers a complementary perspective by incorporating dense 3D information. We demonstrate that LiDAR point clouds serve as a robust modality for predicting building polygons, both in hybrid and end-to-end learning frameworks. Moreover, fusing LiDAR and imagery further improves accuracy and geometric quality of predicted polygons. The P3 dataset is publicly available, along with code and pretrained weights of three state-of-the-art models for building polygon prediction at https://github.com/raphaelsulzer/PixelsPointsPolygons . 9:45am - 10:00am
Building height estimation from stereo satellite images using contour vector registration School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China Accurate building height estimation plays a crucial role in large-scale 3D urban reconstruction. However, conventional stereo matching approaches often suffer from mismatches around building edges, leading to unreliable height retrieval in dense urban areas. To address this issue, this paper presents a novel method for building height estimation based on contour vector registration integrated with the vertical line locus technique. The proposed framework first automatically matches building contour vectors extracted from stereo high-resolution satellite images. Then, for each paired contour, a range of candidate heights is searched using a rational function model to project the reference contour from the image space to object space and then reproject it onto the conjugate image. The elevation that maximizes the overlap ratio between projected and paired contours is identified as the optimal roof elevation. Building height is subsequently derived by subtracting the ground elevation from the estimated roof elevation. Experiments conducted on SuperView-1 (SV-1) satellite stereo images over Jiuyuan District, Baotou, Inner Mongolia, China, demonstrate the effectiveness of the proposed method. The resulting building height estimates achieve a root mean square error of 0.84 m compared to manual measurements, showing strong agreement (r = 0.9993). The proposed contour-based stereo registration approach provides a robust and efficient solution for building height extraction from high-resolution satellite data, supporting precise urban 3D modeling and large-scale spatial analysis. | ||

