Picture for Youssef Mroueh

Youssef Mroueh

IBM Research, USA

Reinforcement Learning with Verifiable Rewards: GRPO's Effective Loss, Dynamics, and Success Amplification

Add code
Mar 09, 2025
Viaarxiv icon

Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection

Add code
Feb 20, 2025
Viaarxiv icon

Theoretical Analysis of KL-regularized RLHF with Multiple Reference Models

Add code
Feb 03, 2025
Figure 1 for Theoretical Analysis of KL-regularized RLHF with Multiple Reference Models
Viaarxiv icon

Large Language Models can be Strong Self-Detoxifiers

Add code
Oct 04, 2024
Viaarxiv icon

Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry

Add code
Jul 16, 2024
Figure 1 for Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
Figure 2 for Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
Figure 3 for Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
Figure 4 for Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry
Viaarxiv icon

Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking

Add code
Jun 10, 2024
Viaarxiv icon

Distributional Preference Alignment of LLMs via Optimal Transport

Add code
Jun 09, 2024
Figure 1 for Distributional Preference Alignment of LLMs via Optimal Transport
Figure 2 for Distributional Preference Alignment of LLMs via Optimal Transport
Figure 3 for Distributional Preference Alignment of LLMs via Optimal Transport
Figure 4 for Distributional Preference Alignment of LLMs via Optimal Transport
Viaarxiv icon

Information Theoretic Guarantees For Policy Alignment In Large Language Models

Add code
Jun 09, 2024
Figure 1 for Information Theoretic Guarantees For Policy Alignment In Large Language Models
Figure 2 for Information Theoretic Guarantees For Policy Alignment In Large Language Models
Viaarxiv icon

Risk Assessment and Statistical Significance in the Age of Foundation Models

Add code
Oct 11, 2023
Figure 1 for Risk Assessment and Statistical Significance in the Age of Foundation Models
Figure 2 for Risk Assessment and Statistical Significance in the Age of Foundation Models
Figure 3 for Risk Assessment and Statistical Significance in the Age of Foundation Models
Figure 4 for Risk Assessment and Statistical Significance in the Age of Foundation Models
Viaarxiv icon

Auditing and Generating Synthetic Data with Controllable Trust Trade-offs

Add code
May 02, 2023
Figure 1 for Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Figure 2 for Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Figure 3 for Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Figure 4 for Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Viaarxiv icon