Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

SoonYong Cho

DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation

Mar 20, 2024

Yamin Mao, Zhihua Liu, Weiming Li, SoonYong Cho, Qiang Wang, Xiaoshuai Hao

Figure 1 for DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation

Figure 2 for DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation

Figure 3 for DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation

Figure 4 for DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation

Abstract:Depth-based 3D hand pose estimation is an important but challenging research task in human-machine interaction community. Recently, dense regression methods have attracted increasing attention in 3D hand pose estimation task, which provide a low computational burden and high accuracy regression way by densely regressing hand joint offset maps. However, large-scale regression offset values are often affected by noise and outliers, leading to a significant drop in accuracy. To tackle this, we re-formulate 3D hand pose estimation as a dense ordinal regression problem and propose a novel Dense Ordinal Regression 3D Pose Network (DOR3D-Net). Specifically, we first decompose offset value regression into sub-tasks of binary classifications with ordinal constraints. Then, each binary classifier can predict the probability of a binary spatial relationship relative to joint, which is easier to train and yield much lower level of noise. The estimated hand joint positions are inferred by aggregating the ordinal regression results at local positions with a weighted sum. Furthermore, both joint regression loss and ordinal regression loss are used to train our DOR3D-Net in an end-to-end manner. Extensive experiments on public datasets (ICVL, MSRA, NYU and HANDS2017) show that our design provides significant improvements over SOTA methods.

Via

Access Paper or Ask Questions

DVI-SLAM: A Dual Visual Inertial SLAM Network

Sep 25, 2023

Xiongfeng Peng, Zhihua Liu, Weiming Li, Ping Tan, SoonYong Cho, Qiang Wang

Figure 1 for DVI-SLAM: A Dual Visual Inertial SLAM Network

Figure 2 for DVI-SLAM: A Dual Visual Inertial SLAM Network

Figure 3 for DVI-SLAM: A Dual Visual Inertial SLAM Network

Figure 4 for DVI-SLAM: A Dual Visual Inertial SLAM Network

Abstract:Recent deep learning based visual simultaneous localization and mapping (SLAM) methods have made significant progress. However, how to make full use of visual information as well as better integrate with inertial measurement unit (IMU) in visual SLAM has potential research value. This paper proposes a novel deep SLAM network with dual visual factors. The basic idea is to integrate both photometric factor and re-projection factor into the end-to-end differentiable structure through multi-factor data association module. We show that the proposed network dynamically learns and adjusts the confidence maps of both visual factors and it can be further extended to include the IMU factors as well. Extensive experiments validate that our proposed method significantly outperforms the state-of-the-art methods on several public datasets, including TartanAir, EuRoC and ETH3D-SLAM. Specifically, when dynamically fusing the three factors together, the absolute trajectory error for both monocular and stereo configurations on EuRoC dataset has reduced by 45.3% and 36.2% respectively.

* 7 pages, 3 figures

Via

Access Paper or Ask Questions