Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient-based Bi-level Optimization for Deep Learning: A Survey

Aug 04, 2022

Can, Chen, Xi Chen, Chen Ma, Zixuan Liu, Xue Liu

Figure 1 for Gradient-based Bi-level Optimization for Deep Learning: A Survey

Figure 2 for Gradient-based Bi-level Optimization for Deep Learning: A Survey

Figure 3 for Gradient-based Bi-level Optimization for Deep Learning: A Survey

Figure 4 for Gradient-based Bi-level Optimization for Deep Learning: A Survey

Share this with someone who'll enjoy it:

Abstract:Bi-level optimization, especially the gradient-based category, has been widely used in the deep learning community including hyperparameter optimization and meta knowledge extraction. Bi-level optimization embeds one problem within another and the gradient-based category solves the outer level task by computing the hypergradient, which is much more efficient than classical methods such as the evolutionary algorithm. In this survey, we first give a formal definition of the gradient-based bi-level optimization. Secondly, we illustrate how to formulate a research problem as a bi-level optimization problem, which is of great practical use for beginners. More specifically, there are two formulations: the single-task formulation to optimize hyperparameters such as regularization parameters and the distilled data, and the multi-task formulation to extract meta knowledge such as the model initialization. With a bi-level formulation, we then discuss four bi-level optimization solvers to update the outer variable including explicit gradient update, proxy update, implicit function update, and closed-form update. Last but not least, we conclude the survey by pointing out the great potential of gradient-based bi-level optimization on science problems (AI4Science).

* AI4Science; Bi-level Optimization; Hyperparameter Optimization; Meta Learning; Implicit Function

View paper on

Share this with someone who'll enjoy it:

Title:Gradient-based Bi-level Optimization for Deep Learning: A Survey

Paper and Code