Picture for Alfredo Garcia

Alfredo Garcia

Local Linear Convergence of Infeasible Optimization with Orthogonal Constraints

Add code
Dec 07, 2024
Viaarxiv icon

Distributed Networked Multi-task Learning

Add code
Oct 04, 2024
Viaarxiv icon

FedGlu: A personalized federated learning-based glucose forecasting algorithm for improved performance in glycemic excursion regions

Add code
Aug 25, 2024
Viaarxiv icon

Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback

Add code
Jun 11, 2024
Viaarxiv icon

Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment

Add code
May 29, 2024
Figure 1 for Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Figure 2 for Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Figure 3 for Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Figure 4 for Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Viaarxiv icon

Global Convergence of Decentralized Retraction-Free Optimization on the Stiefel Manifold

Add code
May 19, 2024
Viaarxiv icon

Regularized Q-Learning with Linear Function Approximation

Add code
Jan 26, 2024
Figure 1 for Regularized Q-Learning with Linear Function Approximation
Figure 2 for Regularized Q-Learning with Linear Function Approximation
Figure 3 for Regularized Q-Learning with Linear Function Approximation
Viaarxiv icon

Resolving uncertainty on the fly: Modeling adaptive driving behavior as active inference

Add code
Nov 10, 2023
Viaarxiv icon

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

Add code
Oct 10, 2023
Viaarxiv icon

A Bayesian Approach to Robust Inverse Reinforcement Learning

Add code
Sep 15, 2023
Viaarxiv icon