Picture for Nikolaus H. R. Howe

Nikolaus H. R. Howe

Defining and Characterizing Reward Hacking

Add code
Sep 27, 2022
Figure 1 for Defining and Characterizing Reward Hacking
Figure 2 for Defining and Characterizing Reward Hacking
Figure 3 for Defining and Characterizing Reward Hacking
Figure 4 for Defining and Characterizing Reward Hacking
Viaarxiv icon

Myriad: a real-world testbed to bridge trajectory optimization and deep learning

Add code
Feb 22, 2022
Figure 1 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Figure 2 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Figure 3 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Figure 4 for Myriad: a real-world testbed to bridge trajectory optimization and deep learning
Viaarxiv icon