Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Samuel Ehrenstein

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Mar 26, 2024

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang, Inbar Fried, Stephen M. Pizer, Marc Niethammer, Roni Sengupta

Figure 1 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Figure 2 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Figure 3 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Figure 4 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Abstract:Monocular depth estimation in endoscopy videos can enable assistive and robotic surgery to obtain better coverage of the organ and detection of various health issues. Despite promising progress on mainstream, natural image depth estimation, techniques perform poorly on endoscopy images due to a lack of strong geometric features and challenging illumination effects. In this paper, we utilize the photometric cues, i.e., the light emitted from an endoscope and reflected by the surface, to improve monocular depth estimation. We first create two novel loss functions with supervised and self-supervised variants that utilize a per-pixel shading representation. We then propose a novel depth refinement network (PPSNet) that leverages the same per-pixel shading representation. Finally, we introduce teacher-student transfer learning to produce better depth maps from both synthetic data with supervision and clinical data with self-supervision. We achieve state-of-the-art results on the C3VD dataset while estimating high-quality depth maps from clinical data. Our code, pre-trained models, and supplementary materials can be found on our project page: https://ppsnet.github.io/

* 26 pages, 7 tables, 7 figures

Via

Access Paper or Ask Questions

ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

Nov 19, 2021

Yubo Zhang, Jan-Michael Frahm, Samuel Ehrenstein, Sarah K. McGill, Julian G. Rosenman, Shuxian Wang, Stephen M. Pizer

Figure 1 for ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

Figure 2 for ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

Figure 3 for ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

Figure 4 for ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

Abstract:One of the key elements of reconstructing a 3D mesh from a monocular video is generating every frame's depth map. However, in the application of colonoscopy video reconstruction, producing good-quality depth estimation is challenging. Neural networks can be easily fooled by photometric distractions or fail to capture the complex shape of the colon surface, predicting defective shapes that result in broken meshes. Aiming to fundamentally improve the depth estimation quality for colonoscopy 3D reconstruction, in this work we have designed a set of training losses to deal with the special challenges of colonoscopy data. For better training, a set of geometric consistency objectives was developed, using both depth and surface normal information. Also, the classic photometric loss was extended with feature matching to compensate for illumination noise. With the training losses powerful enough, our self-supervised framework named ColDE is able to produce better depth maps of colonoscopy data as compared to the previous work utilizing prior depth knowledge. Used in reconstruction, our network is able to reconstruct good-quality colon meshes in real-time without any post-processing, making it the first to be clinically applicable.

* 13 pages, 5 figures

Via

Access Paper or Ask Questions