Picture for Flint Xiaofeng Fan

Flint Xiaofeng Fan

FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHF

Add code
Dec 20, 2024
Viaarxiv icon

Quality Diversity Imitation Learning

Add code
Oct 08, 2024
Viaarxiv icon

An LLM-based Recommender System Environment

Add code
Jun 01, 2024
Figure 1 for An LLM-based Recommender System Environment
Figure 2 for An LLM-based Recommender System Environment
Figure 3 for An LLM-based Recommender System Environment
Figure 4 for An LLM-based Recommender System Environment
Viaarxiv icon

CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Add code
Mar 29, 2024
Viaarxiv icon

Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

Add code
Jan 07, 2024
Viaarxiv icon

Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning

Add code
Jun 28, 2023
Figure 1 for Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning
Figure 2 for Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning
Figure 3 for Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning
Figure 4 for Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning
Viaarxiv icon

FedHQL: Federated Heterogeneous Q-Learning

Add code
Jan 26, 2023
Viaarxiv icon

Federated Neural Bandit

Add code
May 28, 2022
Figure 1 for Federated Neural Bandit
Figure 2 for Federated Neural Bandit
Figure 3 for Federated Neural Bandit
Viaarxiv icon

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

Add code
Oct 26, 2021
Figure 1 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Figure 2 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Figure 3 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Figure 4 for Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee
Viaarxiv icon