Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Castells-Rufas

BronchoPose: an analysis of data and model configuration for vision-based bronchoscopy pose estimation

Apr 25, 2022

Juan Borrego-Carazo, Carles Sánchez, David Castells-Rufas, Jordi Carrabina, Débora Gil

Figure 1 for BronchoPose: an analysis of data and model configuration for vision-based bronchoscopy pose estimation

Figure 2 for BronchoPose: an analysis of data and model configuration for vision-based bronchoscopy pose estimation

Figure 3 for BronchoPose: an analysis of data and model configuration for vision-based bronchoscopy pose estimation

Figure 4 for BronchoPose: an analysis of data and model configuration for vision-based bronchoscopy pose estimation

Abstract:Vision-based bronchoscopy (VB) models require the registration of the virtual lung model with the frames from the video bronchoscopy to provide effective guidance during the biopsy. The registration can be achieved by either tracking the position and orientation of the bronchoscopy camera or by calibrating its deviation from the pose (position and orientation) simulated in the virtual lung model. Recent advances in neural networks and temporal image processing have provided new opportunities for guided bronchoscopy. However, such progress has been hindered by the lack of comparative experimental conditions. In the present paper, we share a novel synthetic dataset allowing for a fair comparison of methods. Moreover, this paper investigates several neural network architectures for the learning of temporal information at different levels of subject personalization. In order to improve orientation measurement, we also present a standardized comparison framework and a novel metric for camera orientation learning. Results on the dataset show that the proposed metric and architectures, as well as the standardized conditions, provide notable improvements to current state-of-the-art camera pose estimation in video bronchoscopy.

Via

Access Paper or Ask Questions

OpenCL-based FPGA accelerator for disparity map generation with stereoscopic event cameras

Mar 08, 2019

David Castells-Rufas, Jordi Carrabina

Figure 1 for OpenCL-based FPGA accelerator for disparity map generation with stereoscopic event cameras

Figure 2 for OpenCL-based FPGA accelerator for disparity map generation with stereoscopic event cameras

Figure 3 for OpenCL-based FPGA accelerator for disparity map generation with stereoscopic event cameras

Figure 4 for OpenCL-based FPGA accelerator for disparity map generation with stereoscopic event cameras

Abstract:Although event-based cameras are already commercially available. Vision algorithms based on them are still not common. As a consequence, there are few Hardware Accelerators for them. In this work we present some experiments to create FPGA accelerators for a well-known vision algorithm using event-based cameras. We present a stereo matching algorithm to create a stream of disparity events disparity map and implement several accelerators using the Intel FPGA OpenCL tool-chain. The results show that multiple designs can be easily tested and that a performance speedup of more than 8x can be achieved with simple code transformations.

* Presented at HIP3ES, 2019

Via

Access Paper or Ask Questions

A High-Performance HOG Extractor on FPGA

Jan 12, 2018

Vinh Ngo, Arnau Casadevall, Marc Codina, David Castells-Rufas, Jordi Carrabina

Figure 1 for A High-Performance HOG Extractor on FPGA

Figure 2 for A High-Performance HOG Extractor on FPGA

Figure 3 for A High-Performance HOG Extractor on FPGA

Figure 4 for A High-Performance HOG Extractor on FPGA

Abstract:Pedestrian detection is one of the key problems in emerging self-driving car industry. And HOG algorithm has proven to provide good accuracy for pedestrian detection. There are plenty of research works have been done in accelerating HOG algorithm on FPGA because of its low-power and high-throughput characteristics. In this paper, we present a high-performance HOG architecture for pedestrian detection on a low-cost FPGA platform. It achieves a maximum throughput of 526 FPS with 640x480 input images, which is 3.25 times faster than the state of the art design. The accelerator is integrated with SVM-based prediction in realizing a pedestrian detection system. And the power consumption of the whole system is comparable with the best existing implementations.

* Presented at HIP3ES, 2018

Via

Access Paper or Ask Questions