Picture for Ilija Bogunovic

Ilija Bogunovic

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Add code
Feb 05, 2026
Viaarxiv icon

Robust Bayesian Optimisation with Unbounded Corruptions

Add code
Nov 19, 2025
Viaarxiv icon

Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data

Add code
Aug 17, 2025
Viaarxiv icon

Robust Multi-Objective Controlled Decoding of Large Language Models

Add code
Mar 11, 2025
Figure 1 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 2 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 3 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 4 for Robust Multi-Objective Controlled Decoding of Large Language Models
Viaarxiv icon

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs

Add code
Mar 07, 2025
Viaarxiv icon

Mean-Field Bayesian Optimisation

Add code
Feb 17, 2025
Figure 1 for Mean-Field Bayesian Optimisation
Figure 2 for Mean-Field Bayesian Optimisation
Figure 3 for Mean-Field Bayesian Optimisation
Figure 4 for Mean-Field Bayesian Optimisation
Viaarxiv icon

Almost Surely Safe Alignment of Large Language Models at Inference-Time

Add code
Feb 03, 2025
Viaarxiv icon

No-Regret Linear Bandits under Gap-Adjusted Misspecification

Add code
Jan 09, 2025
Figure 1 for No-Regret Linear Bandits under Gap-Adjusted Misspecification
Figure 2 for No-Regret Linear Bandits under Gap-Adjusted Misspecification
Figure 3 for No-Regret Linear Bandits under Gap-Adjusted Misspecification
Viaarxiv icon

Sample-efficient Bayesian Optimisation Using Known Invariances

Add code
Oct 22, 2024
Figure 1 for Sample-efficient Bayesian Optimisation Using Known Invariances
Figure 2 for Sample-efficient Bayesian Optimisation Using Known Invariances
Figure 3 for Sample-efficient Bayesian Optimisation Using Known Invariances
Figure 4 for Sample-efficient Bayesian Optimisation Using Known Invariances
Viaarxiv icon

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

Add code
Jul 26, 2024
Figure 1 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Figure 2 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Figure 3 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Figure 4 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Viaarxiv icon