Picture for Guang Li

Guang Li

VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Add code
Feb 24, 2026
Viaarxiv icon

MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving

Add code
Feb 23, 2026
Viaarxiv icon

DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving

Add code
Feb 16, 2026
Viaarxiv icon

L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts

Add code
Jan 29, 2026
Viaarxiv icon

Difficulty-guided Sampling: Bridging the Target Gap between Dataset Distillation and Downstream Tasks

Add code
Jan 15, 2026
Viaarxiv icon

SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning

Add code
Jan 10, 2026
Viaarxiv icon

Foreground-Aware Dataset Distillation via Dynamic Patch Selection

Add code
Jan 06, 2026
Viaarxiv icon

Chat with UAV -- Human-UAV Interaction Based on Large Language Models

Add code
Dec 09, 2025
Viaarxiv icon

Otter: Mitigating Background Distractions of Wide-Angle Few-Shot Action Recognition with Enhanced RWKV

Add code
Nov 11, 2025
Viaarxiv icon

Privacy-Aware Continual Self-Supervised Learning on Multi-Window Chest Computed Tomography for Domain-Shift Robustness

Add code
Oct 31, 2025
Viaarxiv icon