Picture for Andrew Perrault

Andrew Perrault

Using RLHF to align speech enhancement approaches to mean-opinion quality scores

Add code
Oct 17, 2024
Viaarxiv icon

ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback

Add code
Jun 25, 2024
Viaarxiv icon

Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

Add code
May 29, 2024
Viaarxiv icon

Reinforcement Learning for Fine-tuning Text-to-speech Diffusion Models

Add code
May 23, 2024
Viaarxiv icon

The Distributional Reward Critic Architecture for Perturbed-Reward Reinforcement Learning

Add code
Jan 11, 2024
Viaarxiv icon

Coevolutionary Algorithm for Building Robust Decision Trees under Minimax Regret

Add code
Dec 14, 2023
Viaarxiv icon

Reflections from the Workshop on AI-Assisted Decision Making for Conservation

Add code
Jul 17, 2023
Viaarxiv icon

Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize

Add code
May 26, 2023
Viaarxiv icon

Normality-Guided Distributional Reinforcement Learning for Continuous Control

Add code
Aug 28, 2022
Figure 1 for Normality-Guided Distributional Reinforcement Learning for Continuous Control
Figure 2 for Normality-Guided Distributional Reinforcement Learning for Continuous Control
Figure 3 for Normality-Guided Distributional Reinforcement Learning for Continuous Control
Figure 4 for Normality-Guided Distributional Reinforcement Learning for Continuous Control
Viaarxiv icon

Learning (Local) Surrogate Loss Functions for Predict-Then-Optimize Problems

Add code
Mar 30, 2022
Figure 1 for Learning (Local) Surrogate Loss Functions for Predict-Then-Optimize Problems
Figure 2 for Learning (Local) Surrogate Loss Functions for Predict-Then-Optimize Problems
Viaarxiv icon