Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

James Ting-Ho Lo

Adaptively Solving the Local-Minimum Problem for Deep Neural Networks

Dec 25, 2020

Huachuan Wang, James Ting-Ho Lo

Figure 1 for Adaptively Solving the Local-Minimum Problem for Deep Neural Networks

Abstract:This paper aims to overcome a fundamental problem in the theory and application of deep neural networks (DNNs). We propose a method to solve the local minimum problem in training DNNs directly. Our method is based on the cross-entropy loss criterion's convexification by transforming the cross-entropy loss into a risk averting error (RAE) criterion. To alleviate numerical difficulties, a normalized RAE (NRAE) is employed. The convexity region of the cross-entropy loss expands as its risk sensitivity index (RSI) increases. Making the best use of the convexity region, our method starts training with an extensive RSI, gradually reduces it, and switches to the RAE as soon as the RAE is numerically feasible. After training converges, the resultant deep learning machine is expected to be inside the attraction basin of a global minimum of the cross-entropy loss. Numerical results are provided to show the effectiveness of the proposed method.

* arXiv admin note: substantial text overlap with arXiv:1506.02690, arXiv:1510.03826

Via

Access Paper or Ask Questions

Low-Order Model of Biological Neural Networks

Dec 12, 2020

Huachuan Wang, James Ting-Ho Lo

Figure 1 for Low-Order Model of Biological Neural Networks

Figure 2 for Low-Order Model of Biological Neural Networks

Figure 3 for Low-Order Model of Biological Neural Networks

Figure 4 for Low-Order Model of Biological Neural Networks

Abstract:A biologically plausible low-order model (LOM) of biological neural networks is a recurrent hierarchical network of dendritic nodes/trees, spiking/nonspiking neurons, unsupervised/ supervised covariance/accumulative learning mechanisms, feedback connections, and a scheme for maximal generalization. These component models are motivated and necessitated by making LOM learn and retrieve easily without differentiation, optimization, or iteration, and cluster, detect and recognize multiple/hierarchical corrupted, distorted, and occluded temporal and spatial patterns.

Via

Access Paper or Ask Questions