Picture for Nitsan Soffair

Nitsan Soffair

Markov flow policy -- deep MC

Add code
May 01, 2024
Viaarxiv icon

Conservative DDPG -- Pessimistic RL without Ensemble

Add code
Mar 08, 2024
Viaarxiv icon

SQT -- std $Q$-target

Add code
Feb 12, 2024
Viaarxiv icon

MinMaxMin $Q$-learning

Add code
Feb 12, 2024
Viaarxiv icon

Optimizing Agent Collaboration through Heuristic Multi-Agent Planning

Add code
Jan 04, 2023
Viaarxiv icon