Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eslam Zaher

Counterfactual Explanations on Robust Perceptual Geodesics

Jan 26, 2026

Eslam Zaher, Maciej Trzaskowski, Quan Nguyen, Fred Roosta

Abstract:Latent-space optimization methods for counterfactual explanations - framed as minimal semantic perturbations that change model predictions - inherit the ambiguity of Wachter et al.'s objective: the choice of distance metric dictates whether perturbations are meaningful or adversarial. Existing approaches adopt flat or misaligned geometries, leading to off-manifold artifacts, semantic drift, or adversarial collapse. We introduce Perceptual Counterfactual Geodesics (PCG), a method that constructs counterfactuals by tracing geodesics under a perceptually Riemannian metric induced from robust vision features. This geometry aligns with human perception and penalizes brittle directions, enabling smooth, on-manifold, semantically valid transitions. Experiments on three vision datasets show that PCG outperforms baselines and reveals failure modes hidden under standard metrics.

* Accepted at ICLR 2026

Via

Access Paper or Ask Questions

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

May 16, 2024

Eslam Zaher, Maciej Trzaskowski, Quan Nguyen, Fred Roosta

Figure 1 for Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Figure 2 for Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Figure 3 for Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Figure 4 for Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Abstract:In this paper, we dive into the reliability concerns of Integrated Gradients (IG), a prevalent feature attribution method for black-box deep learning models. We particularly address two predominant challenges associated with IG: the generation of noisy feature visualizations for vision models and the vulnerability to adversarial attributional attacks. Our approach involves an adaptation of path-based feature attribution, aligning the path of attribution more closely to the intrinsic geometry of the data manifold. Our experiments utilise deep generative models applied to several real-world image datasets. They demonstrate that IG along the geodesics conforms to the curved geometry of the Riemannian data manifold, generating more perceptually intuitive explanations and, subsequently, substantially increasing robustness to targeted attributional attacks.

* Accepted at ICML 2024

Via

Access Paper or Ask Questions