Weakly-Aligned Cross-Modal Learning Framework for Subsurface Defect Segmentation on Building Facades Using Unmanned Aerial Vehicles

24 Pages Posted: 28 May 2024

See all articles by Sudao HE

Sudao HE

The Hong Kong University of Science and Technology

Gang Zhao

The Hong Kong University of Science and Technology

Jun Chen

The Hong Kong University of Science and Technology

Shenghan Zhang

The Hong Kong University of Science and Technology

Dhanda Mishra

affiliation not provided to SSRN

Matthew MF Yuen

affiliation not provided to SSRN

Abstract

This study introduces a Weakly-aligned Cross-modal Learning (WCL) framework for subsurface defect segmentation using UAVs. The proposed WCL framework comprises two main components: the Multimodal Feature Description Network (MFDN) and the Prompt-aided Cross-modal Graph Learning (PCGL) algorithm. Initially, the undistorted RGB and infrared images are processed by MFDN to extract local feature descriptors for multi-modal alignment due to UAV motion. Subsequently, the PCGL algorithm is developed to identify visually critical areas by implementing graph partitioning on a prompt-aided Wasserstein graph. Then, the critical visual areas are transferred to the well-aligned infrared image and a Wasserstein adjacency graph is constructed based on masked superpixel segmentation. Moreover, an edge-based method is developed for pinpointing the location and contour of defects by detecting abnormal vertices on the WAG. The practicality and efficiency of the proposed methodology are validated through controlled laboratory experiments on concrete samples and field applications on tiled facades.

Keywords: Infrared thermography, unmanned aerial vehicle, building facade inspection, subsurface defect segmentation, multimodal image alignment, prompt-aided multimodal graph learning.

Suggested Citation

HE, Sudao and Zhao, Gang and Chen, Jun and Zhang, Shenghan and Mishra, Dhanda and Yuen, Matthew MF, Weakly-Aligned Cross-Modal Learning Framework for Subsurface Defect Segmentation on Building Facades Using Unmanned Aerial Vehicles. Available at SSRN: https://ssrn.com/abstract=4845688 or http://dx.doi.org/10.2139/ssrn.4845688

Sudao HE

The Hong Kong University of Science and Technology ( email )

Gang Zhao

The Hong Kong University of Science and Technology ( email )

Jun Chen

The Hong Kong University of Science and Technology ( email )

Shenghan Zhang (Contact Author)

The Hong Kong University of Science and Technology ( email )

Dhanda Mishra

affiliation not provided to SSRN ( email )

Matthew MF Yuen

affiliation not provided to SSRN ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
46
Abstract Views
219
PlumX Metrics