Picture for Shuang Zeng

Shuang Zeng

MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving

Add code
Feb 25, 2026
Viaarxiv icon

Geometry-as-context: Modulating Explicit 3D in Scene-consistent Video Generation to Geometry Context

Add code
Feb 25, 2026
Viaarxiv icon

ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

Add code
Feb 11, 2026
Viaarxiv icon

MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation

Add code
Feb 05, 2026
Viaarxiv icon

Bridging Information Asymmetry: A Hierarchical Framework for Deterministic Blind Face Restoration

Add code
Jan 27, 2026
Viaarxiv icon

AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs

Add code
Nov 18, 2025
Figure 1 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Figure 2 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Figure 3 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Figure 4 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Viaarxiv icon

UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data

Add code
Sep 26, 2025
Viaarxiv icon

JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation

Add code
Sep 26, 2025
Figure 1 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 2 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 3 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Figure 4 for JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Viaarxiv icon

FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving

Add code
May 23, 2025
Viaarxiv icon

Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation

Add code
May 06, 2025
Viaarxiv icon