Picture for Jakub Grudzien Kuba

Jakub Grudzien Kuba

Cliqueformer: Model-Based Optimization with Structured Transformers

Add code
Oct 17, 2024
Viaarxiv icon

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

Add code
Jan 12, 2024
Figure 1 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 2 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 3 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Figure 4 for Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
Viaarxiv icon

IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies

Add code
Apr 20, 2023
Figure 1 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 2 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 3 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 4 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Viaarxiv icon

Heterogeneous-Agent Reinforcement Learning

Add code
Apr 19, 2023
Viaarxiv icon

Discovered Policy Optimisation

Add code
Oct 13, 2022
Figure 1 for Discovered Policy Optimisation
Figure 2 for Discovered Policy Optimisation
Figure 3 for Discovered Policy Optimisation
Figure 4 for Discovered Policy Optimisation
Viaarxiv icon

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

Add code
Aug 02, 2022
Figure 1 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 2 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 3 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 4 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Viaarxiv icon

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

Add code
May 30, 2022
Figure 1 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 2 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 3 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 4 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Viaarxiv icon

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 16, 2022
Viaarxiv icon

Mirror Learning: A Unifying Framework of Policy Optimisation

Add code
Feb 02, 2022
Figure 1 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 2 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 3 for Mirror Learning: A Unifying Framework of Policy Optimisation
Viaarxiv icon

Multi-Agent Constrained Policy Optimisation

Add code
Oct 06, 2021
Figure 1 for Multi-Agent Constrained Policy Optimisation
Figure 2 for Multi-Agent Constrained Policy Optimisation
Figure 3 for Multi-Agent Constrained Policy Optimisation
Figure 4 for Multi-Agent Constrained Policy Optimisation
Viaarxiv icon