Picture for Nikos Vlassis

Nikos Vlassis

CAPO: Counterfactual Credit Assignment in Sequential Cooperative Teams

Add code
Apr 20, 2026
Viaarxiv icon

Stepwise Credit Assignment for GRPO on Flow-Matching Models

Add code
Mar 30, 2026
Viaarxiv icon

Voice Evaluation of Reasoning Ability: Diagnosing the Modality-Induced Performance Gap

Add code
Sep 30, 2025
Viaarxiv icon

Distributional Off-Policy Evaluation for Slate Recommendations

Add code
Aug 27, 2023
Viaarxiv icon

FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback

Add code
Jul 20, 2023
Figure 1 for FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Figure 2 for FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Figure 3 for FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Figure 4 for FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Viaarxiv icon

Local Policy Improvement for Recommender Systems

Add code
Dec 22, 2022
Figure 1 for Local Policy Improvement for Recommender Systems
Figure 2 for Local Policy Improvement for Recommender Systems
Figure 3 for Local Policy Improvement for Recommender Systems
Figure 4 for Local Policy Improvement for Recommender Systems
Viaarxiv icon

Control Variates for Slate Off-Policy Evaluation

Add code
Jun 15, 2021
Figure 1 for Control Variates for Slate Off-Policy Evaluation
Figure 2 for Control Variates for Slate Off-Policy Evaluation
Figure 3 for Control Variates for Slate Off-Policy Evaluation
Figure 4 for Control Variates for Slate Off-Policy Evaluation
Viaarxiv icon

Off-Policy Evaluation of Slate Policies under Bayes Risk

Add code
Jan 05, 2021
Figure 1 for Off-Policy Evaluation of Slate Policies under Bayes Risk
Figure 2 for Off-Policy Evaluation of Slate Policies under Bayes Risk
Figure 3 for Off-Policy Evaluation of Slate Policies under Bayes Risk
Figure 4 for Off-Policy Evaluation of Slate Policies under Bayes Risk
Viaarxiv icon

More Efficient Off-Policy Evaluation through Regularized Targeted Learning

Add code
Dec 13, 2019
Viaarxiv icon

Posterior Sampling for Large Scale Reinforcement Learning

Add code
Oct 22, 2018
Figure 1 for Posterior Sampling for Large Scale Reinforcement Learning
Figure 2 for Posterior Sampling for Large Scale Reinforcement Learning
Figure 3 for Posterior Sampling for Large Scale Reinforcement Learning
Viaarxiv icon