Picture for Mingfei Sun

Mingfei Sun

The University of Manchester

Gradient Regularized Natural Gradients

Add code
Jan 26, 2026
Viaarxiv icon

Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning

Add code
Jan 26, 2026
Viaarxiv icon

Softly Constrained Denoisers for Diffusion Models

Add code
Dec 20, 2025
Figure 1 for Softly Constrained Denoisers for Diffusion Models
Figure 2 for Softly Constrained Denoisers for Diffusion Models
Figure 3 for Softly Constrained Denoisers for Diffusion Models
Figure 4 for Softly Constrained Denoisers for Diffusion Models
Viaarxiv icon

From Grunts to Grammar: Emergent Language from Cooperative Foraging

Add code
May 19, 2025
Viaarxiv icon

Reinforcement Learning (RL) Meets Urban Climate Modeling: Investigating the Efficacy and Impacts of RL-Based HVAC Control

Add code
May 11, 2025
Viaarxiv icon

Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning

Add code
Feb 08, 2025
Viaarxiv icon

$TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning

Add code
Feb 07, 2025
Figure 1 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Figure 2 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Figure 3 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Figure 4 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Viaarxiv icon

Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning

Add code
Dec 19, 2024
Figure 1 for Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning
Viaarxiv icon

LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models

Add code
Oct 15, 2024
Figure 1 for LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Figure 2 for LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Figure 3 for LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Figure 4 for LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Viaarxiv icon

DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models

Add code
Sep 17, 2024
Figure 1 for DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models
Figure 2 for DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models
Figure 3 for DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models
Figure 4 for DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models
Viaarxiv icon