Picture for Gal Dalal

Gal Dalal

Gradient Boosting Reinforcement Learning

Add code
Jul 11, 2024
Viaarxiv icon

PlaMo: Plan and Move in Rich 3D Physical Environments

Add code
Jun 26, 2024
Viaarxiv icon

Tree Search-Based Policy Optimization under Stochastic Execution Delay

Add code
Apr 08, 2024
Viaarxiv icon

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

Add code
Feb 15, 2024
Figure 1 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Figure 2 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Figure 3 for Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Viaarxiv icon

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Jan 30, 2023
Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Add code
Sep 28, 2022
Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Add code
Jul 05, 2022
Figure 1 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 2 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 3 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Figure 4 for Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
May 30, 2022
Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon

Planning and Learning with Adaptive Lookahead

Add code
Jan 28, 2022
Viaarxiv icon

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Add code
Oct 13, 2021
Figure 1 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 2 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 3 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 4 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Viaarxiv icon