Picture for Norio Kosaka

Norio Kosaka

Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions

Add code
Oct 15, 2024
Figure 1 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 2 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 3 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Figure 4 for Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Viaarxiv icon

Designing an offline reinforcement learning objective from scratch

Add code
Jan 30, 2023
Figure 1 for Designing an offline reinforcement learning objective from scratch
Figure 2 for Designing an offline reinforcement learning objective from scratch
Figure 3 for Designing an offline reinforcement learning objective from scratch
Figure 4 for Designing an offline reinforcement learning objective from scratch
Viaarxiv icon

PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference

Add code
Mar 01, 2020
Figure 1 for PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Figure 2 for PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Figure 3 for PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Figure 4 for PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Viaarxiv icon