Picture for Zeynep Akata

Zeynep Akata

TimeSAE: Sparse Decoding for Faithful Explanations of Black-Box Time Series Models

Add code
Jan 14, 2026
Viaarxiv icon

Beyond the final layer: Attentive multilayer fusion for vision transformers

Add code
Jan 14, 2026
Viaarxiv icon

Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs

Add code
Oct 01, 2025
Figure 1 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Figure 2 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Figure 3 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Figure 4 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Viaarxiv icon

Stitch: Training-Free Position Control in Multimodal Diffusion Transformers

Add code
Sep 30, 2025
Viaarxiv icon

Road Obstacle Video Segmentation

Add code
Sep 16, 2025
Viaarxiv icon

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

Add code
Aug 13, 2025
Viaarxiv icon

SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions

Add code
Jul 31, 2025
Figure 1 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Figure 2 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Figure 3 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Figure 4 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Viaarxiv icon

Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study

Add code
Jul 28, 2025
Figure 1 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Figure 2 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Figure 3 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Figure 4 for Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Viaarxiv icon

Align-then-Unlearn: Embedding Alignment for LLM Unlearning

Add code
Jun 16, 2025
Viaarxiv icon

Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers

Add code
Jun 10, 2025
Viaarxiv icon