Picture for Junshan Zhang

Junshan Zhang

Sherman

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

Add code
Feb 23, 2026
Viaarxiv icon

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking

Add code
Feb 02, 2026
Viaarxiv icon

VITA: Vision-to-Action Flow Matching Policy

Add code
Jul 17, 2025
Viaarxiv icon

Ego-centric Learning of Communicative World Models for Autonomous Driving

Add code
Jun 09, 2025
Figure 1 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Figure 2 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Figure 3 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Figure 4 for Ego-centric Learning of Communicative World Models for Autonomous Driving
Viaarxiv icon

IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

Add code
May 15, 2025
Viaarxiv icon

AugFL: Augmenting Federated Learning with Pretrained Models

Add code
Mar 04, 2025
Viaarxiv icon

Heterogeneous Decision Making in Mixed Traffic: Uncertainty-aware Planning and Bounded Rationality

Add code
Feb 25, 2025
Viaarxiv icon

Towards Unraveling and Improving Generalization in World Models

Add code
Dec 31, 2024
Viaarxiv icon

EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models

Add code
Dec 13, 2024
Figure 1 for EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models
Figure 2 for EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models
Figure 3 for EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models
Figure 4 for EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models
Viaarxiv icon

OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning

Add code
May 29, 2024
Figure 1 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Figure 2 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Figure 3 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Figure 4 for OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Viaarxiv icon