Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Xianfeng Gu

Geometric Understanding of Deep Learning

May 31, 2018

Na Lei, Zhongxuan Luo, Shing-Tung Yau, David Xianfeng Gu

Figure 1 for Geometric Understanding of Deep Learning

Figure 2 for Geometric Understanding of Deep Learning

Figure 3 for Geometric Understanding of Deep Learning

Figure 4 for Geometric Understanding of Deep Learning

Abstract:Deep learning is the mainstream technique for many machine learning tasks, including image recognition, machine translation, speech recognition, and so on. It has outperformed conventional methods in various fields and achieved great successes. Unfortunately, the understanding on how it works remains unclear. It has the central importance to lay down the theoretic foundation for deep learning. In this work, we give a geometric view to understand deep learning: we show that the fundamental principle attributing to the success is the manifold structure in data, namely natural high dimensional data concentrates close to a low-dimensional manifold, deep learning learns the manifold and the probability distribution on it. We further introduce the concepts of rectified linear complexity for deep neural network measuring its learning capability, rectified linear complexity of an embedding manifold describing the difficulty to be learned. Then we show for any deep neural network with fixed architecture, there exists a manifold that cannot be learned by the network. Finally, we propose to apply optimal mass transportation theory to control the probability distribution in the latent space.

Via

Access Paper or Ask Questions

A Geometric View of Optimal Transportation and Generative Model

Dec 19, 2017

Na Lei, Kehua Su, Li Cui, Shing-Tung Yau, David Xianfeng Gu

Figure 1 for A Geometric View of Optimal Transportation and Generative Model

Abstract:In this work, we show the intrinsic relations between optimal transportation and convex geometry, especially the variational approach to solve Alexandrov problem: constructing a convex polytope with prescribed face normals and volumes. This leads to a geometric interpretation to generative models, and leads to a novel framework for generative models. By using the optimal transportation view of GAN model, we show that the discriminator computes the Kantorovich potential, the generator calculates the transportation map. For a large class of transportation costs, the Kantorovich potential can give the optimal transportation map by a close-form formula. Therefore, it is sufficient to solely optimize the discriminator. This shows the adversarial competition can be avoided, and the computational architecture can be simplified. Preliminary experimental results show the geometric method outperforms WGAN for approximating probability measures with multiple clusters in low dimensional space.

Via

Access Paper or Ask Questions