Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dharmesh Tailor

Learning to Defer to a Population: A Meta-Learning Approach

Mar 05, 2024

Dharmesh Tailor, Aditya Patra, Rajeev Verma, Putra Manggala, Eric Nalisnick

Abstract:The learning to defer (L2D) framework allows autonomous systems to be safe and robust by allocating difficult decisions to a human expert. All existing work on L2D assumes that each expert is well-identified, and if any expert were to change, the system should be re-trained. In this work, we alleviate this constraint, formulating an L2D system that can cope with never-before-seen experts at test-time. We accomplish this by using meta-learning, considering both optimization- and model-based variants. Given a small context set to characterize the currently available expert, our framework can quickly adapt its deferral policy. For the model-based approach, we employ an attention mechanism that is able to look for points in the context set that are similar to a given test point, leading to an even more precise assessment of the expert's abilities. In the experiments, we validate our methods on image recognition, traffic sign detection, and skin lesion diagnosis benchmarks.

* Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

Via

Access Paper or Ask Questions

The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Oct 30, 2023

Peter Nickl, Lu Xu, Dharmesh Tailor, Thomas Möllenhoff, Mohammad Emtiyaz Khan

Figure 1 for The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Figure 2 for The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Figure 3 for The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Figure 4 for The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Abstract:Understanding model's sensitivity to its training data is crucial but can also be challenging and costly, especially during training. To simplify such issues, we present the Memory-Perturbation Equation (MPE) which relates model's sensitivity to perturbation in its training data. Derived using Bayesian principles, the MPE unifies existing sensitivity measures, generalizes them to a wide-variety of models and algorithms, and unravels useful properties regarding sensitivities. Our empirical results show that sensitivity estimates obtained during training can be used to faithfully predict generalization on unseen test data. The proposed equation is expected to be useful for future research on robust and adaptive learning.

* 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

Via

Access Paper or Ask Questions

Exploiting Inferential Structure in Neural Processes

Jun 27, 2023

Dharmesh Tailor, Mohammad Emtiyaz Khan, Eric Nalisnick

Abstract:Neural Processes (NPs) are appealing due to their ability to perform fast adaptation based on a context set. This set is encoded by a latent variable, which is often assumed to follow a simple distribution. However, in real-word settings, the context set may be drawn from richer distributions having multiple modes, heavy tails, etc. In this work, we provide a framework that allows NPs' latent variable to be given a rich prior defined by a graphical model. These distributional assumptions directly translate into an appropriate aggregation strategy for the context set. Moreover, we describe a message-passing procedure that still allows for end-to-end optimization with stochastic gradients. We demonstrate the generality of our framework by using mixture and Student-t assumptions that yield improvements in function modelling and test-time robustness.

* Uncertainty in Artificial Intelligence (UAI) 2023

Via

Access Paper or Ask Questions

Learning the optimal state-feedback via supervised imitation learning

Jan 07, 2019

Dharmesh Tailor, Dario Izzo

Figure 1 for Learning the optimal state-feedback via supervised imitation learning

Figure 2 for Learning the optimal state-feedback via supervised imitation learning

Figure 3 for Learning the optimal state-feedback via supervised imitation learning

Figure 4 for Learning the optimal state-feedback via supervised imitation learning

Abstract:Imitation learning is a control design paradigm that seeks to learn a control policy reproducing demonstrations from experts. By substituting expert's demonstrations for optimal behaviours, the same paradigm leads to the design of control policies closely approximating the optimal state-feedback. This approach requires training a machine learning algorithm (in our case deep neural networks) directly on state-control pairs originating from optimal trajectories. We have shown in previous work that, when restricted to relatively low-dimensional state and control spaces, this approach is very successful in several deterministic, non-linear problems in continuous-time. In this work, we refine our previous studies using as test case a simple quadcopter model with quadratic and time-optimal objective functions. We describe in detail the best learning pipeline we have developed and that is able to approximate via deep neural networks the state-feedback map to a very high accuracy. We introduce the use of the softplus activation function in the hidden units showing how it results in a smoother control profile whilst retaining the benefits of ReLUs. We show how to evaluate the optimality of the trained state-feedback, and find that already with two layers the objective function reached and its optimal value differ by less than one percent. We later consider also an additional metric linked to the system asymptotic behaviour - time taken to converge to the policy's fixed point. With respect to these metrics, we show that improvements in the mean average error do not necessarily correspond to significant improvements.

Via

Access Paper or Ask Questions

On the stability analysis of optimal state feedbacks as represented by deep neural models

Dec 07, 2018

Dario Izzo, Dharmesh Tailor, Thomas Vasileiou

Figure 1 for On the stability analysis of optimal state feedbacks as represented by deep neural models

Figure 2 for On the stability analysis of optimal state feedbacks as represented by deep neural models

Figure 3 for On the stability analysis of optimal state feedbacks as represented by deep neural models

Figure 4 for On the stability analysis of optimal state feedbacks as represented by deep neural models

Abstract:Research has shown how the optimal feedback control of several non linear systems of interest in aerospace applications can be represented by deep neural architectures and trained using techniques including imitation learning, reinforcement learning and evolutionary algorithms. Such deep architectures are here also referred to as Guidance and Control Networks, or G&CNETs. It is difficult to provide theoretical proofs on the control stability of such neural control architectures in general, and G&CNETs in particular, to perturbations, time delays or model uncertainties or to compute stability margins and trace them back to the network training process or to its architecture. In most cases the analysis of the trained network is performed via Monte Carlo experiments and practitioners renounce to any formal guarantee. This lack of validation naturally leads to scepticism especially in cases where safety and validation are of paramount importance such as is the case, for example, in the automotive or space industry. In an attempt to narrow the gap between deep learning research and control theory, we propose a new methodology based on differential algebra and automated differentiation to obtain formal guarantees on the behaviour of neural based control systems.

Via

Access Paper or Ask Questions

Machine learning and evolutionary techniques in interplanetary trajectory design

Sep 28, 2018

Dario Izzo, Christopher Sprague, Dharmesh Tailor

Figure 1 for Machine learning and evolutionary techniques in interplanetary trajectory design

Figure 2 for Machine learning and evolutionary techniques in interplanetary trajectory design

Figure 3 for Machine learning and evolutionary techniques in interplanetary trajectory design

Figure 4 for Machine learning and evolutionary techniques in interplanetary trajectory design

Abstract:After providing a brief historical overview on the synergies between artificial intelligence research, in the areas of evolutionary computations and machine learning, and the optimal design of interplanetary trajectories, we propose and study the use of deep artificial neural networks to represent, on-board, the optimal guidance profile of an interplanetary mission. The results, limited to the chosen test case of an Earth-Mars orbital transfer, extend the findings made previously for landing scenarios and quadcopter dynamics, opening a new research area in interplanetary trajectory planning.

* Submitted to as a book chapter for a Springer book on "Optimization in Space Engineering"

Via

Access Paper or Ask Questions