Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Nov 26, 2024

Ziang Xu, Bin Li, Yang Hu, Chenyu Zhang, James East, Sharib Ali, Jens Rittscher

Figure 1 for Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Figure 2 for Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Figure 3 for Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Figure 4 for Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Share this with someone who'll enjoy it:

Abstract:Accurate 3D mapping in endoscopy enables quantitative, holistic lesion characterization within the gastrointestinal (GI) tract, requiring reliable depth and pose estimation. However, endoscopy systems are monocular, and existing methods relying on synthetic datasets or complex models often lack generalizability in challenging endoscopic conditions. We propose a robust self-supervised monocular depth and pose estimation framework that incorporates a Generative Latent Bank and a Variational Autoencoder (VAE). The Generative Latent Bank leverages extensive depth scenes from natural images to condition the depth network, enhancing realism and robustness of depth predictions through latent feature priors. For pose estimation, we reformulate it within a VAE framework, treating pose transitions as latent variables to regularize scale, stabilize z-axis prominence, and improve x-y sensitivity. This dual refinement pipeline enables accurate depth and pose predictions, effectively addressing the GI tract's complex textures and lighting. Extensive evaluations on SimCol and EndoSLAM datasets confirm our framework's superior performance over published self-supervised methods in endoscopic depth and pose estimation.

View paper on

Share this with someone who'll enjoy it:

Title:Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors

Paper and Code