Picture for Jiahui Li

Jiahui Li

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Add code
Mar 14, 2025
Viaarxiv icon

AoI-Sensitive Data Forwarding with Distributed Beamforming in UAV-Assisted IoT

Add code
Feb 13, 2025
Viaarxiv icon

UAV-assisted Joint Mobile Edge Computing and Data Collection via Matching-enabled Deep Reinforcement Learning

Add code
Feb 11, 2025
Viaarxiv icon

Aerial Reliable Collaborative Communications for Terrestrial Mobile Users via Evolutionary Multi-Objective Deep Reinforcement Learning

Add code
Feb 09, 2025
Viaarxiv icon

Low-altitude Friendly-Jamming for Satellite-Maritime Communications via Generative AI-enabled Deep Reinforcement Learning

Add code
Jan 26, 2025
Viaarxiv icon

Task Delay and Energy Consumption Minimization for Low-altitude MEC via Evolutionary Multi-objective Deep Reinforcement Learning

Add code
Jan 11, 2025
Viaarxiv icon

UAV Swarm-enabled Collaborative Post-disaster Communications in Low Altitude Economy via a Two-stage Optimization Approach

Add code
Jan 10, 2025
Viaarxiv icon

Learning Causal Transition Matrix for Instance-dependent Label Noise

Add code
Dec 18, 2024
Figure 1 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 2 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 3 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 4 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Viaarxiv icon

ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning

Add code
Dec 18, 2024
Figure 1 for ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning
Figure 2 for ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning
Figure 3 for ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning
Figure 4 for ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning
Viaarxiv icon

iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop

Add code
Dec 17, 2024
Figure 1 for iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop
Figure 2 for iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop
Figure 3 for iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop
Figure 4 for iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop
Viaarxiv icon