Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haixin Shi

Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

May 10, 2024

Haixin Shi, Yinlin Hu, Daniel Koguciuk, Juan-Ting Lin, Mathieu Salzmann, David Ferstl

Figure 1 for Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Figure 2 for Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Figure 3 for Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Figure 4 for Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Abstract:We propose an approach for reconstructing free-moving object from a monocular RGB video. Most existing methods either assume scene prior, hand pose prior, object category pose prior, or rely on local optimization with multiple sequence segments. We propose a method that allows free interaction with the object in front of a moving camera without relying on any prior, and optimizes the sequence globally without any segments. We progressively optimize the object shape and pose simultaneously based on an implicit neural representation. A key aspect of our method is a virtual camera system that reduces the search space of the optimization significantly. We evaluate our method on the standard HO3D dataset and a collection of egocentric RGB sequences captured with a head-mounted device. We demonstrate that our approach outperforms most methods significantly, and is on par with recent techniques that assume prior information.

Via

Access Paper or Ask Questions

Two-level Data Augmentation for Calibrated Multi-view Detection

Oct 19, 2022

Martin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua

Figure 1 for Two-level Data Augmentation for Calibrated Multi-view Detection

Figure 2 for Two-level Data Augmentation for Calibrated Multi-view Detection

Figure 3 for Two-level Data Augmentation for Calibrated Multi-view Detection

Figure 4 for Two-level Data Augmentation for Calibrated Multi-view Detection

Abstract:Data augmentation has proven its usefulness to improve model generalization and performance. While it is commonly applied in computer vision application when it comes to multi-view systems, it is rarely used. Indeed geometric data augmentation can break the alignment among views. This is problematic since multi-view data tend to be scarce and it is expensive to annotate. In this work we propose to solve this issue by introducing a new multi-view data augmentation pipeline that preserves alignment among views. Additionally to traditional augmentation of the input image we also propose a second level of augmentation applied directly at the scene level. When combined with our simple multi-view detection model, our two-level augmentation pipeline outperforms all existing baselines by a significant margin on the two main multi-view multi-person detection datasets WILDTRACK and MultiviewX.

* Accepted at WACV 2023

Via

Access Paper or Ask Questions