Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrey Vakunov

Real-time Pupil Tracking from Monocular Video for Digital Puppetry

Jun 19, 2020

Artsiom Ablavatski, Andrey Vakunov, Ivan Grishchenko, Karthik Raveendran, Matsvei Zhdanovich

Figure 1 for Real-time Pupil Tracking from Monocular Video for Digital Puppetry

Figure 2 for Real-time Pupil Tracking from Monocular Video for Digital Puppetry

Figure 3 for Real-time Pupil Tracking from Monocular Video for Digital Puppetry

Figure 4 for Real-time Pupil Tracking from Monocular Video for Digital Puppetry

Abstract:We present a simple, real-time approach for pupil tracking from live video on mobile devices. Our method extends a state-of-the-art face mesh detector with two new components: a tiny neural network that predicts positions of the pupils in 2D, and a displacement-based estimation of the pupil blend shape coefficients. Our technique can be used to accurately control the pupil movements of a virtual puppet, and lends liveliness and energy to it. The proposed approach runs at over 50 FPS on modern phones, and enables its usage in any real-time puppeteering pipeline.

Via

Access Paper or Ask Questions

MediaPipe Hands: On-device Real-time Hand Tracking

Jun 18, 2020

Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann

Figure 1 for MediaPipe Hands: On-device Real-time Hand Tracking

Figure 2 for MediaPipe Hands: On-device Real-time Hand Tracking

Figure 3 for MediaPipe Hands: On-device Real-time Hand Tracking

Figure 4 for MediaPipe Hands: On-device Real-time Hand Tracking

Abstract:We present a real-time on-device hand tracking pipeline that predicts hand skeleton from single RGB camera for AR/VR applications. The pipeline consists of two models: 1) a palm detector, 2) a hand landmark model. It's implemented via MediaPipe, a framework for building cross-platform ML solutions. The proposed model and pipeline architecture demonstrates real-time inference speed on mobile GPUs and high prediction quality. MediaPipe Hands is open sourced at https://mediapipe.dev.

* 5 pages, 7 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, Seattle, WA, USA, 2020

Via

Access Paper or Ask Questions

Real-time Hair Segmentation and Recoloring on Mobile GPUs

Jul 15, 2019

Andrei Tkachenka, Gregory Karpiak, Andrey Vakunov, Yury Kartynnik, Artsiom Ablavatski, Valentin Bazarevsky, Siargey Pisarchyk

Figure 1 for Real-time Hair Segmentation and Recoloring on Mobile GPUs

Figure 2 for Real-time Hair Segmentation and Recoloring on Mobile GPUs

Figure 3 for Real-time Hair Segmentation and Recoloring on Mobile GPUs

Figure 4 for Real-time Hair Segmentation and Recoloring on Mobile GPUs

Abstract:We present a novel approach for neural network-based hair segmentation from a single camera input specifically designed for real-time, mobile application. Our relatively small neural network produces a high-quality hair segmentation mask that is well suited for AR effects, e.g. virtual hair recoloring. The proposed model achieves real-time inference speed on mobile GPUs (30-100+ FPS, depending on the device) with high accuracy. We also propose a very realistic hair recoloring scheme. Our method has been deployed in major AR application and is used by millions of users.

* 4 pages, 5 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, Long Beach, CA, USA, 2019

Via

Access Paper or Ask Questions

BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs

Jul 14, 2019

Valentin Bazarevsky, Yury Kartynnik, Andrey Vakunov, Karthik Raveendran, Matthias Grundmann

Figure 1 for BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs

Figure 2 for BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs

Figure 3 for BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs

Figure 4 for BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs

Abstract:We present BlazeFace, a lightweight and well-performing face detector tailored for mobile GPU inference. It runs at a speed of 200-1000+ FPS on flagship devices. This super-realtime performance enables it to be applied to any augmented reality pipeline that requires an accurate facial region of interest as an input for task-specific models, such as 2D/3D facial keypoint or geometry estimation, facial features or expression classification, and face region segmentation. Our contributions include a lightweight feature extraction network inspired by, but distinct from MobileNetV1/V2, a GPU-friendly anchor scheme modified from Single Shot MultiBox Detector (SSD), and an improved tie resolution strategy alternative to non-maximum suppression.

* 4 pages, 3 figures; CVPR Workshop on Computer Vision for Augmented and Virtual Reality, Long Beach, CA, USA, 2019

Via

Access Paper or Ask Questions