Picture for Bo-Kai Huang

Bo-Kai Huang

Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots

Add code
Dec 06, 2022
Viaarxiv icon