Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wei Bu

Fokker-Planck to Callan-Symanzik: evolution of weight matrices under training

Jan 16, 2025

Wei Bu, Uri Kol, Ziming Liu

Abstract:The dynamical evolution of a neural network during training has been an incredibly fascinating subject of study. First principal derivation of generic evolution of variables in statistical physics systems has proved useful when used to describe training dynamics conceptually, which in practice means numerically solving equations such as Fokker-Planck equation. Simulating entire networks inevitably runs into the curse of dimensionality. In this paper, we utilize Fokker-Planck to simulate the probability density evolution of individual weight matrices in the bottleneck layers of a simple 2-bottleneck-layered auto-encoder and compare the theoretical evolutions against the empirical ones by examining the output data distributions. We also derive physically relevant partial differential equations such as Callan-Symanzik and Kardar-Parisi-Zhang equations from the dynamical equation we have.

* 8 pages, 9 figures

Via

Access Paper or Ask Questions

Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

Aug 18, 2016

Youbao Tang, Xiangqian Wu, Wei Bu

Figure 1 for Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

Figure 2 for Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

Figure 3 for Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

Figure 4 for Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

Abstract:This paper proposes a novel saliency detection method by developing a deeply-supervised recurrent convolutional neural network (DSRCNN), which performs a full image-to-image saliency prediction. For saliency detection, the local, global, and contextual information of salient objects is important to obtain a high quality salient map. To achieve this goal, the DSRCNN is designed based on VGGNet-16. Firstly, the recurrent connections are incorporated into each convolutional layer, which can make the model more powerful for learning the contextual information. Secondly, side-output layers are added to conduct the deeply-supervised operation, which can make the model learn more discriminative and robust features by effecting the intermediate layers. Finally, all of the side-outputs are fused to integrate the local and global information to get the final saliency detection results. Therefore, the DSRCNN combines the advantages of recurrent convolutional neural networks and deeply-supervised nets. The DSRCNN model is tested on five benchmark datasets, and experimental results demonstrate that the proposed method significantly outperforms the state-of-the-art saliency detection approaches on all test datasets.

* 5 pages, 5 figures, accepted by ACMMM 2016

Via

Access Paper or Ask Questions