Picture for Qian Zhang

Qian Zhang

University of California Riverside

A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Add code
Mar 11, 2025
Viaarxiv icon

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Add code
Mar 10, 2025
Viaarxiv icon

Customized SAM 2 for Referring Remote Sensing Image Segmentation

Add code
Mar 10, 2025
Viaarxiv icon

GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving

Add code
Mar 07, 2025
Viaarxiv icon

LLM-EvRep: Learning an LLM-Compatible Event Representation Using a Self-Supervised Framework

Add code
Feb 20, 2025
Viaarxiv icon

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Add code
Feb 18, 2025
Viaarxiv icon

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Add code
Feb 18, 2025
Viaarxiv icon

MTDP: Modulated Transformer Diffusion Policy Model

Add code
Feb 13, 2025
Viaarxiv icon

MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation

Add code
Jan 23, 2025
Figure 1 for MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Figure 2 for MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Figure 3 for MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Figure 4 for MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Viaarxiv icon