Picture for Thomy Phan

Thomy Phan

LMU Munich

Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic

Add code
Aug 06, 2024
Viaarxiv icon

Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization

Add code
Jul 30, 2024
Viaarxiv icon

Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms

Add code
Jan 13, 2024
Viaarxiv icon

ClusterComm: Discrete Communication in Decentralized MARL using Internal Representation Clustering

Add code
Jan 07, 2024
Viaarxiv icon

Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search

Add code
Jan 01, 2024
Figure 1 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Figure 2 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Figure 3 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Figure 4 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Viaarxiv icon

Challenges for Reinforcement Learning in Quantum Computing

Add code
Dec 18, 2023
Viaarxiv icon

Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization

Add code
Nov 09, 2023
Viaarxiv icon

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

Add code
Apr 26, 2023
Viaarxiv icon

DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training

Add code
Jan 18, 2023
Viaarxiv icon

Capturing Dependencies within Machine Learning via a Formal Process Model

Add code
Aug 10, 2022
Figure 1 for Capturing Dependencies within Machine Learning via a Formal Process Model
Figure 2 for Capturing Dependencies within Machine Learning via a Formal Process Model
Figure 3 for Capturing Dependencies within Machine Learning via a Formal Process Model
Figure 4 for Capturing Dependencies within Machine Learning via a Formal Process Model
Viaarxiv icon