Picture for Dongge Han

Dongge Han

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Add code
Apr 22, 2024
Viaarxiv icon

Multiagent Model-based Credit Assignment for Continuous Control

Add code
Dec 27, 2021
Figure 1 for Multiagent Model-based Credit Assignment for Continuous Control
Figure 2 for Multiagent Model-based Credit Assignment for Continuous Control
Figure 3 for Multiagent Model-based Credit Assignment for Continuous Control
Figure 4 for Multiagent Model-based Credit Assignment for Continuous Control
Viaarxiv icon

MDP Abstraction with Successor Features

Add code
Oct 18, 2021
Figure 1 for MDP Abstraction with Successor Features
Figure 2 for MDP Abstraction with Successor Features
Figure 3 for MDP Abstraction with Successor Features
Figure 4 for MDP Abstraction with Successor Features
Viaarxiv icon

Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces

Add code
Jun 25, 2020
Figure 1 for Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces
Figure 2 for Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces
Figure 3 for Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces
Figure 4 for Replication-Robust Payoff-Allocation with Applications in Machine Learning Marketplaces
Viaarxiv icon

Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination

Add code
Oct 21, 2019
Figure 1 for Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination
Figure 2 for Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination
Viaarxiv icon