Picture for Junqi Jiang

Junqi Jiang

Interpreting Language Reward Models via Contrastive Explanations

Add code
Nov 25, 2024
Figure 1 for Interpreting Language Reward Models via Contrastive Explanations
Figure 2 for Interpreting Language Reward Models via Contrastive Explanations
Figure 3 for Interpreting Language Reward Models via Contrastive Explanations
Figure 4 for Interpreting Language Reward Models via Contrastive Explanations
Viaarxiv icon

Heterogeneous Graph Neural Networks with Post-hoc Explanations for Multi-modal and Explainable Land Use Inference

Add code
Jun 19, 2024
Viaarxiv icon

Contestable AI needs Computational Argumentation

Add code
May 17, 2024
Viaarxiv icon

Interval Abstractions for Robust Counterfactual Explanations

Add code
Apr 21, 2024
Viaarxiv icon

Robust Counterfactual Explanations in Machine Learning: A Survey

Add code
Feb 02, 2024
Viaarxiv icon

Recourse under Model Multiplicity via Argumentative Ensembling (Technical Report)

Add code
Jan 03, 2024
Viaarxiv icon

Provably Robust and Plausible Counterfactual Explanations for Neural Networks via Robust Optimisation

Add code
Sep 22, 2023
Viaarxiv icon

Formalising the Robustness of Counterfactual Explanations for Neural Networks

Add code
Aug 31, 2022
Figure 1 for Formalising the Robustness of Counterfactual Explanations for Neural Networks
Figure 2 for Formalising the Robustness of Counterfactual Explanations for Neural Networks
Figure 3 for Formalising the Robustness of Counterfactual Explanations for Neural Networks
Figure 4 for Formalising the Robustness of Counterfactual Explanations for Neural Networks
Viaarxiv icon