Picture for Assaf Hallak

Assaf Hallak

Towards Large Language Models with Self-Consistent Natural Language Explanations

Add code
Jun 09, 2025
Viaarxiv icon

"You just can't go around killing people" Explaining Agent Behavior to a Human Terminator

Add code
Apr 06, 2025
Viaarxiv icon

RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression

Add code
Jan 21, 2025
Viaarxiv icon

PlaMo: Plan and Move in Rich 3D Physical Environments

Add code
Jun 26, 2024
Viaarxiv icon

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Jan 30, 2023
Figure 1 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 2 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 3 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 4 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Add code
Sep 28, 2022
Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
May 30, 2022
Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon

Planning and Learning with Adaptive Lookahead

Add code
Jan 28, 2022
Figure 1 for Planning and Learning with Adaptive Lookahead
Figure 2 for Planning and Learning with Adaptive Lookahead
Figure 3 for Planning and Learning with Adaptive Lookahead
Figure 4 for Planning and Learning with Adaptive Lookahead
Viaarxiv icon

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Add code
Oct 13, 2021
Figure 1 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 2 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 3 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 4 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Viaarxiv icon

Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

Add code
Jul 04, 2021
Figure 1 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 2 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 3 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 4 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Viaarxiv icon