Abstract
Accurate lens distortion correction is important for calibration, registration, image stitching, and 3D reconstruction, especially in low-data device-specific settings where disposable or specialised cameras cannot provide large calibration datasets. We address distortion correction for cameras with highly irregular or non-stationary distortion fields, where fixed polynomial models and generic learning-based rectification methods can struggle. We propose a framework based on Deep Gaussian Processes (DGPs) to model the non-linear mapping required for undistortion. The key motivation is that conventional single-layer GPs with stationary kernels must use one global notion of smoothness, whereas DGPs can represent spatially varying behaviour through composed latent mappings while preserving per-pixel predictive uncertainty. This uncertainty can be used to identify or downweight unreliable corrected regions in downstream tasks. We evaluate the method on three real camera datasets with increasing distortion complexity. The full structured acquisitions contain 512 horizontal and 512 vertical line images per camera. These are not thousands of natural calibration images, but they yield up to 29,532, 11,311, and 31,686 detected intersection correspondences for the RPI, Theta, and Pillcam datasets, respectively. This distinction is important for cameras where acquiring many independent images is impractical. The results are assessed using qualitative rectification, uncertainty maps, normalised collinearity errors, and total training time. Polynomial calibration remains strongest for the regular radial RPI distortion, while DGP and DGP2 models show lower normalised collinearity-error distributions than the standard GP and lightweight MLP baselines on the more distorted Theta and Pillcam datasets. For the full datasets, total DGP/DGP2 training times ranged from 2383.50 s to 10092.50 s, reflecting the additional computational cost of probabilistic non-stationary modelling.
IPC Classification
Keywords
€ 4.00