Picture for Uri Gadot

Uri Gadot

Policy Gradient with Tree Search: Avoiding Local Optimas through Lookahead

Add code
Jun 08, 2025
Viaarxiv icon

Policy Optimized Text-to-Image Pipeline Design

Add code
May 27, 2025
Viaarxiv icon

$\text{M}^{\text{3}}$: A Modular World Model over Streams of Tokens

Add code
Feb 20, 2025
Viaarxiv icon

RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression

Add code
Jan 21, 2025
Viaarxiv icon

Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization

Add code
Sep 03, 2023
Viaarxiv icon

Robust Reinforcement Learning via Adversarial Kernel Approximation

Add code
Jun 09, 2023
Viaarxiv icon