Picture for Takumi Tanabe

Takumi Tanabe

Stepwise Alignment for Constrained Language Model Policy Optimization

Add code
Apr 17, 2024
Viaarxiv icon

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

Add code
Nov 07, 2022
Viaarxiv icon

Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution

Add code
Apr 13, 2021
Figure 1 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Figure 2 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Figure 3 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Figure 4 for Level Generation for Angry Birds with Sequential VAE and Latent Variable Evolution
Viaarxiv icon