Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haoyuan Mu

DP-Net: Dynamic Programming Guided Deep Neural Network Compression

Mar 21, 2020

Dingcheng Yang, Wenjian Yu, Ao Zhou, Haoyuan Mu, Gary Yao, Xiaoyi Wang

Figure 1 for DP-Net: Dynamic Programming Guided Deep Neural Network Compression

Figure 2 for DP-Net: Dynamic Programming Guided Deep Neural Network Compression

Figure 3 for DP-Net: Dynamic Programming Guided Deep Neural Network Compression

Figure 4 for DP-Net: Dynamic Programming Guided Deep Neural Network Compression

Abstract:In this work, we propose an effective scheme (called DP-Net) for compressing the deep neural networks (DNNs). It includes a novel dynamic programming (DP) based algorithm to obtain the optimal solution of weight quantization and an optimization process to train a clustering-friendly DNN. Experiments showed that the DP-Net allows larger compression than the state-of-the-art counterparts while preserving accuracy. The largest 77X compression ratio on Wide ResNet is achieved by combining DP-Net with other compression techniques. Furthermore, the DP-Net is extended for compressing a robust DNN model with negligible accuracy loss. At last, a custom accelerator is designed on FPGA to speed up the inference computation with DP-Net.

* 7pages, 4 figures

Via

Access Paper or Ask Questions

Single Path One-Shot Neural Architecture Search with Uniform Sampling

Apr 06, 2019

Zichao Guo, Xiangyu Zhang, Haoyuan Mu, Wen Heng, Zechun Liu, Yichen Wei, Jian Sun

Figure 1 for Single Path One-Shot Neural Architecture Search with Uniform Sampling

Figure 2 for Single Path One-Shot Neural Architecture Search with Uniform Sampling

Figure 3 for Single Path One-Shot Neural Architecture Search with Uniform Sampling

Figure 4 for Single Path One-Shot Neural Architecture Search with Uniform Sampling

Abstract:One-shot method is a powerful Neural Architecture Search (NAS) framework, but its training is non-trivial and it is difficult to achieve competitive results on large scale datasets like ImageNet. In this work, we propose a Single Path One-Shot model to address its main challenge in the training. Our central idea is to construct a simplified supernet, Single Path Supernet, which is trained by an uniform path sampling method. All underlying architectures (and their weights) get trained fully and equally. Once we have a trained supernet, we apply an evolutionary algorithm to efficiently search the best-performing architectures without any fine tuning. Comprehensive experiments verify that our approach is flexible and effective. It is easy to train and fast to search. It effortlessly supports complex search spaces (e.g., building blocks, channel, mixed-precision quantization) and different search constraints (e.g., FLOPs, latency). It is thus convenient to use for various needs. It achieves start-of-the-art performance on the large dataset ImageNet.

Via

Access Paper or Ask Questions

Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

Apr 03, 2019

Xuecai Hu, Haoyuan Mu, Xiangyu Zhang, Zilei Wang, Tieniu Tan, Jian Sun

Figure 1 for Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

Figure 2 for Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

Figure 3 for Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

Figure 4 for Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

Abstract:Recent research on super-resolution has achieved great success due to the development of deep convolutional neural networks (DCNNs). However, super-resolution of arbitrary scale factor has been ignored for a long time. Most previous researchers regard super-resolution of different scale factors as independent tasks. They train a specific model for each scale factor which is inefficient in computing, and prior work only take the super-resolution of several integer scale factors into consideration. In this work, we propose a novel method called Meta-SR to firstly solve super-resolution of arbitrary scale factor (including non-integer scale factors) with a single model. In our Meta-SR, the Meta-Upscale Module is proposed to replace the traditional upscale module. For arbitrary scale factor, the Meta-Upscale Module dynamically predicts the weights of the upscale filters by taking the scale factor as input and use these weights to generate the HR image of arbitrary size. For any low-resolution image, our Meta-SR can continuously zoom in it with arbitrary scale factor by only using a single model. We evaluated the proposed method through extensive experiments on widely used benchmark datasets on single image super-resolution. The experimental results show the superiority of our Meta-Upscale.

* 10 pages, 4 figures

Via

Access Paper or Ask Questions

MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

Apr 03, 2019

Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Tim Kwang-Ting Cheng, Jian Sun

Figure 1 for MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

Figure 2 for MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

Figure 3 for MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

Figure 4 for MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

Abstract:In this paper, we propose a novel meta learning approach for automatic channel pruning of very deep neural networks. We first train a PruningNet, a kind of meta network, which is able to generate weight parameters for any pruned structure given the target network. We use a simple stochastic structure sampling method for training the PruningNet. Then, we apply an evolutionary procedure to search for good-performing pruned networks. The search is highly efficient because the weights are directly generated by the trained PruningNet and we do not need any finetuning. With a single PruningNet trained for the target network, we can search for various Pruned Networks under different constraints with little human participation. We have demonstrated competitive performances on MobileNet V1/V2 networks, up to 9.0/9.9 higher ImageNet accuracy than V1/V2. Compared to the previous state-of-the-art AutoML-based pruning methods, like AMC and NetAdapt, we achieve higher or comparable accuracy under various conditions.

Via

Access Paper or Ask Questions