UASTHN: Uncertainty-Aware Deep Homography Estimation for UAV Satellite-Thermal Geo-localization

IEEE International Conference on Robotics & Automation (ICRA) 2025

Best Poster Award Finalist – Thermal Infrared in Robotics Workshop, ICRA 2025

New York University

TL;DR

UASTHN introduces CropTTA, a simple crop-based test-time augmentation that provides model-agnostic data uncertainty estimation for deep homography estimation in satellite-thermal geo-localization, achieving 7m geo-localization error with a 97% success rate.

Video

Abstract

Geo-localization is an essential component of Unmanned Aerial Vehicle (UAV) navigation systems to ensure precise absolute self-localization in outdoor environments. To address the challenges of GPS signal interruptions or low illumination, Thermal Geo-localization (TG) employs aerial thermal imagery to align with reference satellite maps to accurately determine the UAV's location. However, existing TG methods lack uncertainty measurement in their outputs, compromising system robustness in the presence of textureless or corrupted thermal images, self-similar or outdated satellite maps, geometric noises, or thermal images exceeding satellite maps. To overcome these limitations, this paper presents UASTHN, a novel approach for Uncertainty Estimation (UE) in Deep Homography Estimation (DHE) tasks for TG applications. Specifically, we introduce a novel Crop-based Test-Time Augmentation (CropTTA) strategy, which leverages the homography consensus of cropped image views to measure data uncertainty effectively. This approach complements Deep Ensembles (DE), offering comparable performance with improved efficiency and seamless integration with any DHE model. Extensive experiments across multiple DHE models demonstrate the effectiveness and efficiency of CropTTA in TG applications. Analysis of detected failure cases underscores the improved reliability of CropTTA under challenging conditions. Finally, we demonstrate the capability of combining CropTTA and DE for a comprehensive assessment of both data and model uncertainty.

Data Uncertainty in Thermal Geo-localization

UASTHN captures six categories of high data-uncertainty samples leading to geo-localization failure, where predicted displacements significantly deviate from the ground truth. Thermal images are overlaid on predicted displacements on the satellite imagery for visualization.

UASTHN teaser: six categories of data uncertainty
Categories from left to right: (a) Textureless features, (b) Image corruption, (c) Geometric noise, (d) Self-similar maps, (e) Exceeding regions, (f) Outdated maps.

Method Overview

UASTHN combines CropTTA for data uncertainty with Deep Ensembles for model uncertainty. CropTTA augments thermal images by cropping with specific offsets. The homography network FH with an uncertainty estimation module calculates aggregated displacements and data uncertainty. High-uncertainty samples are rejected. Optionally, Deep Ensembles estimate model uncertainty, which can be combined with CropTTA for comprehensive assessment.

UASTHN Framework
UASTHN Framework. CropTTA augments thermal images, and network FH with a UE module calculates aggregated displacements and data uncertainty. High-uncertainty samples are rejected. Optionally, DE estimates model uncertainty for comprehensive UE.

CropTTA Sampling Methods

Crop offset
(a) Crop offsets
Random sampling
(b) Random sampling
Grid sampling
(c) Grid sampling

Visualization of crop offsets and sampling methods. Colored boxes represent cropping regions with different offsets.

Quantitative Results

Comparison of uncertainty estimation methods across DHE baselines at WS = 1536. All baselines trained with real and synthesized thermal data.

DHE Method UE Method Uncertainty DC = 128m DC = 256m DC = 512m
MACECESR MACECESR MACECESR
DHN
73.6073.58100% 171.93171.02100% 342.41341.11100%
DEmodel 61.9161.9097.3% 162.77162.7596.4% 346.26346.2193.6%
DMdata 66.7766.7699.3% 164.19164.1799.7% 335.81335.7791.7%
CropTTAdata 64.1862.7898.6% 162.04161.8295.1% 337.90337.8491.5%
CropTTA+DEdata+model 64.0962.7096.6% 161.84161.6296.1% 336.41336.4793.6%
IHN
7.277.24100% 16.7816.42100% 16.4215.90100%
DEmodel 6.076.0697.9% 12.3112.1397.6% 13.7313.3794.5%
DMdata 7.026.99100% 11.8111.40100% 11.4811.1494.1%
CropTTAdata 7.467.4797.4% 11.9110.8097.5% 9.278.0695.0%
CropTTA+DEdata+model 7.257.2695.7% 11.5710.4497.1% 10.679.4193.8%
STHN (two-stage)
7.516.66100% 14.9914.34100% 12.7012.12100%
DEmodel 9.458.7298.1% 9.989.0995.3% 8.297.5897.3%
DMdata 9.758.68100% 13.6412.91100% 11.3510.64100%
CropTTAdata 8.267.7598.5% 7.857.3195.8% 7.937.2597.5%
CropTTA+DEdata+model 8.167.6598.1% 7.506.9794.5% 7.837.1597.0%

Blue bold = best result. Underlined = second best.

Inference Time

Method Early Stop w/o UE CropTTA DE CropTTA+DE
IHN35.264.6114.6164.2
IHN54.663.192.1
STHN63.987.0130.2186.0
STHN78.281.9118.6

Inference time (ms) with 5 samples on NVIDIA RTX 2080Ti. Early stopping reduces overhead while maintaining accuracy.

Detected Failure Cases

Thermal images overlap with satellite images, showing ground truth and predicted displacements. Thermal images are overlaid on predicted displacements on the satellite imagery for visualization.

BibTeX


    @INPROCEEDINGS{11128423,
    author={Xiao, Jiuhong and Loianno, Giuseppe},
    booktitle={2025 IEEE International Conference on Robotics and Automation (ICRA)}, 
    title={UASTHN: Uncertainty-Aware Deep Homography Estimation for UAV Satellite-Thermal Geo-Localization}, 
    year={2025},
    volume={},
    number={},
    pages={14066-14072},
    keywords={Location awareness;Uncertainty;Satellites;Measurement uncertainty;Estimation;Autonomous aerial vehicles;Thermal noise;Robustness;Noise measurement;Robotics and automation},
    doi={10.1109/ICRA55743.2025.11128423}}