Picture for Wesley A. Suttle

Wesley A. Suttle

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

Add code
Nov 01, 2024
Viaarxiv icon

AIME: AI System Optimization via Multiple LLM Evaluators

Add code
Oct 04, 2024
Viaarxiv icon

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

Add code
Jun 16, 2024
Viaarxiv icon

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

Add code
Apr 20, 2024
Viaarxiv icon

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

Add code
Mar 18, 2024
Viaarxiv icon

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

Add code
Mar 06, 2024
Viaarxiv icon

Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks

Add code
Feb 09, 2024
Viaarxiv icon

Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation

Add code
Jun 09, 2023
Viaarxiv icon

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

Add code
Feb 01, 2023
Viaarxiv icon

Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search

Add code
Jan 21, 2022
Figure 1 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Figure 2 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Figure 3 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Figure 4 for Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search
Viaarxiv icon