Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joshua Lockhart

Learn to explain yourself, when you can: Equipping Concept Bottleneck Models with the ability to abstain on their concept predictions

Nov 21, 2022

Joshua Lockhart, Daniele Magazzeni, Manuela Veloso

Abstract:The Concept Bottleneck Models (CBMs) of Koh et al. [2020] provide a means to ensure that a neural network based classifier bases its predictions solely on human understandable concepts. The concept labels, or rationales as we refer to them, are learned by the concept labeling component of the CBM. Another component learns to predict the target classification label from these predicted concept labels. Unfortunately, these models are heavily reliant on human provided concept labels for each datapoint. To enable CBMs to behave robustly when these labels are not readily available, we show how to equip them with the ability to abstain from predicting concepts when the concept labeling component is uncertain. In other words, our model learns to provide rationales for its predictions, but only whenever it is sure the rationale is correct.

Via

Access Paper or Ask Questions

Towards learning to explain with concept bottleneck models: mitigating information leakage

Nov 07, 2022

Joshua Lockhart, Nicolas Marchesotti, Daniele Magazzeni, Manuela Veloso

Abstract:Concept bottleneck models perform classification by first predicting which of a list of human provided concepts are true about a datapoint. Then a downstream model uses these predicted concept labels to predict the target label. The predicted concepts act as a rationale for the target prediction. Model trust issues emerge in this paradigm when soft concept labels are used: it has previously been observed that extra information about the data distribution leaks into the concept predictions. In this work we show how Monte-Carlo Dropout can be used to attain soft concept predictions that do not contain leaked information.

* Presented at ICLR 2022 Workshop on Socially Responsible Machine Learning

Via

Access Paper or Ask Questions

Feature Importance for Time Series Data: Improving KernelSHAP

Oct 05, 2022

Mattia Villani, Joshua Lockhart, Daniele Magazzeni

Figure 1 for Feature Importance for Time Series Data: Improving KernelSHAP

Figure 2 for Feature Importance for Time Series Data: Improving KernelSHAP

Abstract:Feature importance techniques have enjoyed widespread attention in the explainable AI literature as a means of determining how trained machine learning models make their predictions. We consider Shapley value based approaches to feature importance, applied in the context of time series data. We present closed form solutions for the SHAP values of a number of time series models, including VARMAX. We also show how KernelSHAP can be applied to time series tasks, and how the feature importances that come from this technique can be combined to perform "event detection". Finally, we explore the use of Time Consistent Shapley values for feature importance.

* Will appear at ICAIF Workshop on Explainable Artificial Intelligence in Finance, November 2, 2022

Via

Access Paper or Ask Questions

Reductive MDPs: A Perspective Beyond Temporal Horizons

May 15, 2022

Thomas Spooner, Rui Silva, Joshua Lockhart, Jason Long, Vacslav Glukhov

Figure 1 for Reductive MDPs: A Perspective Beyond Temporal Horizons

Figure 2 for Reductive MDPs: A Perspective Beyond Temporal Horizons

Figure 3 for Reductive MDPs: A Perspective Beyond Temporal Horizons

Figure 4 for Reductive MDPs: A Perspective Beyond Temporal Horizons

Abstract:Solving general Markov decision processes (MDPs) is a computationally hard problem. Solving finite-horizon MDPs, on the other hand, is highly tractable with well known polynomial-time algorithms. What drives this extreme disparity, and do problems exist that lie between these diametrically opposed complexities? In this paper we identify and analyse a sub-class of stochastic shortest path problems (SSPs) for general state-action spaces whose dynamics satisfy a particular drift condition. This construction generalises the traditional, temporal notion of a horizon via decreasing reachability: a property called reductivity. It is shown that optimal policies can be recovered in polynomial-time for reductive SSPs -- via an extension of backwards induction -- with an efficient analogue in reductive MDPs. The practical considerations of the proposed approach are discussed, and numerical verification provided on a canonical optimal liquidation problem.

* 15 pages, 10 figures, 1 algorithm

Via

Access Paper or Ask Questions

Asynchronous Collaborative Learning Across Data Silos

Mar 23, 2022

Tiffany Tuor, Joshua Lockhart, Daniele Magazzeni

Figure 1 for Asynchronous Collaborative Learning Across Data Silos

Figure 2 for Asynchronous Collaborative Learning Across Data Silos

Figure 3 for Asynchronous Collaborative Learning Across Data Silos

Figure 4 for Asynchronous Collaborative Learning Across Data Silos

Abstract:Machine learning algorithms can perform well when trained on large datasets. While large organisations often have considerable data assets, it can be difficult for these assets to be unified in a manner that makes training possible. Data is very often 'siloed' in different parts of the organisation, with little to no access between silos. This fragmentation of data assets is especially prevalent in heavily regulated industries like financial services or healthcare. In this paper we propose a framework to enable asynchronous collaborative training of machine learning models across data silos. This allows data science teams to collaboratively train a machine learning model, without sharing data with one another. Our proposed approach enhances conventional federated learning techniques to make them suitable for this asynchronous training in this intra-organisation, cross-silo setting. We validate our proposed approach via extensive experiments.

* Will appear in conference proceedings of ACM International Conference on AI in Finance (ICAIF '21)

Via

Access Paper or Ask Questions

SURF: Improving classifiers in production by learning from busy and noisy end users

Oct 12, 2020

Joshua Lockhart, Samuel Assefa, Ayham Alajdad, Andrew Alexander, Tucker Balch, Manuela Veloso

Figure 1 for SURF: Improving classifiers in production by learning from busy and noisy end users

Figure 2 for SURF: Improving classifiers in production by learning from busy and noisy end users

Abstract:Supervised learning classifiers inevitably make mistakes in production, perhaps mis-labeling an email, or flagging an otherwise routine transaction as fraudulent. It is vital that the end users of such a system are provided with a means of relabeling data points that they deem to have been mislabeled. The classifier can then be retrained on the relabeled data points in the hope of performance improvement. To reduce noise in this feedback data, well known algorithms from the crowdsourcing literature can be employed. However, the feedback setting provides a new challenge: how do we know what to do in the case of user non-response? If a user provides us with no feedback on a label then it can be dangerous to assume they implicitly agree: a user can be busy, lazy, or no longer a user of the system! We show that conventional crowdsourcing algorithms struggle in this user feedback setting, and present a new algorithm, SURF, that can cope with this non-response ambiguity.

* Will appear in ACM International Conference on AI in Finance (ICAIF '20), October 15-16, 2020, New York, NY, USA

Via

Access Paper or Ask Questions

Some people aren't worth listening to: periodically retraining classifiers with feedback from a team of end users

Apr 27, 2020

Joshua Lockhart, Samuel Assefa, Tucker Balch, Manuela Veloso

Figure 1 for Some people aren't worth listening to: periodically retraining classifiers with feedback from a team of end users

Figure 2 for Some people aren't worth listening to: periodically retraining classifiers with feedback from a team of end users

Figure 3 for Some people aren't worth listening to: periodically retraining classifiers with feedback from a team of end users

Figure 4 for Some people aren't worth listening to: periodically retraining classifiers with feedback from a team of end users

Abstract:Document classification is ubiquitous in a business setting, but often the end users of a classifier are engaged in an ongoing feedback-retrain loop with the team that maintain it. We consider this feedback-retrain loop from a multi-agent point of view, considering the end users as autonomous agents that provide feedback on the labelled data provided by the classifier. This allows us to examine the effect on the classifier's performance of unreliable end users who provide incorrect feedback. We demonstrate a classifier that can learn which users tend to be unreliable, filtering their feedback out of the loop, thus improving performance in subsequent iterations.

* Presented at the 2019 ICML Workshop on AI in Finance: Applications and Infrastructure for Multi-Agent Learning. Long Beach, CA

Via

Access Paper or Ask Questions