Picture for Rei Sato

Rei Sato

Stepwise Alignment for Constrained Language Model Policy Optimization

Add code
Apr 17, 2024
Viaarxiv icon

Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning

Add code
Jan 31, 2023
Viaarxiv icon

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

Add code
Nov 07, 2022
Viaarxiv icon

AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment

Add code
Dec 11, 2020
Figure 1 for AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment
Figure 2 for AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment
Figure 3 for AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment
Figure 4 for AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment
Viaarxiv icon