Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stephen Balaban

Deep learning and face recognition: the state of the art

Feb 10, 2019

Stephen Balaban

Abstract:Deep Neural Networks (DNNs) have established themselves as a dominant technique in machine learning. DNNs have been top performers on a wide variety of tasks including image classification, speech recognition, and face recognition. Convolutional neural networks (CNNs) have been used in nearly all of the top performing methods on the Labeled Faces in the Wild (LFW) dataset. In this talk and accompanying paper, I attempt to provide a review and summary of the deep learning techniques used in the state-of-the-art. In addition, I highlight the need for both larger and more challenging public datasets to benchmark these systems. The high accuracy (99.63% for FaceNet at the time of publishing) and utilization of outside data (hundreds of millions of images in the case of Google's FaceNet) suggest that current face verification benchmarks such as LFW may not be challenging enough, nor provide enough data, for current techniques. There exist a variety of organizations with mobile photo sharing applications that would be capable of releasing a very large scale and highly diverse dataset of facial images captured on mobile devices. Such an "ImageNet for Face Recognition" would likely receive a warm welcome from researchers and practitioners alike.

* Proc. SPIE 9457, Biometric and Surveillance Technology for Human and Activity Identification XII, 94570B (15 May 2015)
* Published May 15th 2015 in the Proc. SPIE 9457, Biometric and Surveillance Technology for Human and Activity Identification XII, 94570B; Ioannis A. Kakadiaris; Ajay Kumar; Walter J. Scheirer, Editor(s)

Via

Access Paper or Ask Questions

RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Jun 19, 2018

Thu Nguyen-Phuoc, Chuan Li, Stephen Balaban, Yong-Liang Yang

Figure 1 for RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Figure 2 for RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Figure 3 for RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Figure 4 for RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Abstract:Traditional computer graphics rendering pipeline is designed for procedurally generating 2D quality images from 3D shapes with high performance. The non-differentiability due to discrete operations such as visibility computation makes it hard to explicitly correlate rendering parameters and the resulting image, posing a significant challenge for inverse rendering tasks. Recent work on differentiable rendering achieves differentiability either by designing surrogate gradients for non-differentiable operations or via an approximate but differentiable renderer. These methods, however, are still limited when it comes to handling occlusion, and restricted to particular rendering effects. We present RenderNet, a differentiable rendering convolutional network with a novel projection unit that can render 2D images from 3D shapes. Spatial occlusion and shading calculation are automatically encoded in the network. Our experiments show that RenderNet can successfully learn to implement different shaders, and can be used in inverse rendering tasks to estimate shape, pose, lighting and texture from a single image.

* 14 pages, 9 figures

Via

Access Paper or Ask Questions