Picture for Hanhan Zhou

Hanhan Zhou

RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Add code
Oct 21, 2024
Viaarxiv icon

Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

Add code
Mar 22, 2024
Viaarxiv icon

Real-time Network Intrusion Detection via Decision Transformers

Add code
Dec 17, 2023
Viaarxiv icon

Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction

Add code
Oct 26, 2023
Viaarxiv icon

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

Add code
Aug 28, 2023
Viaarxiv icon

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

Add code
Feb 28, 2023
Viaarxiv icon

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

Add code
Feb 11, 2023
Viaarxiv icon

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

Add code
Jun 22, 2022
Figure 1 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 2 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 3 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 4 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Viaarxiv icon

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning

Add code
Feb 09, 2022
Figure 1 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Figure 2 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Figure 3 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Figure 4 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Viaarxiv icon

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

Add code
Jan 04, 2022
Figure 1 for Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Figure 2 for Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Figure 3 for Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Figure 4 for Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Viaarxiv icon