Picture for Jiaming Zhang

Jiaming Zhang

ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

Add code
Apr 21, 2026
Viaarxiv icon

Benign Overfitting in Adversarial Training for Vision Transformers

Add code
Apr 21, 2026
Viaarxiv icon

RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization

Add code
Mar 29, 2026
Viaarxiv icon

Not an Obstacle for Dog, but a Hazard for Human: A Co-Ego Navigation System for Guide Dog Robots

Add code
Mar 20, 2026
Viaarxiv icon

Taming the Long Tail: Efficient Item-wise Sharpness-Aware Minimization for LLM-based Recommender Systems

Add code
Mar 13, 2026
Viaarxiv icon

DriveXQA: Cross-modal Visual Question Answering for Adverse Driving Scene Understanding

Add code
Mar 11, 2026
Viaarxiv icon

More than the Sum: Panorama-Language Models for Adverse Omni-Scenes

Add code
Mar 10, 2026
Viaarxiv icon

Extend Your Horizon: A Device-Agnostic Surgical Tool Tracking Framework with Multi-View Optimization for Augmented Reality

Add code
Mar 09, 2026
Viaarxiv icon

SGR3 Model: Scene Graph Retrieval-Reasoning Model in 3D

Add code
Mar 04, 2026
Viaarxiv icon

ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction

Add code
Feb 24, 2026
Viaarxiv icon