Picture for Bencheng Liao

Bencheng Liao

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

Add code
Dec 24, 2025
Viaarxiv icon

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Add code
Dec 09, 2025
Viaarxiv icon

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

Add code
Dec 08, 2025
Figure 1 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Figure 2 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Figure 3 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Figure 4 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling

Add code
Mar 17, 2025
Viaarxiv icon

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Add code
Mar 11, 2025
Viaarxiv icon

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Add code
Feb 18, 2025
Figure 1 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 2 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 3 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 4 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Viaarxiv icon

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Add code
Feb 18, 2025
Viaarxiv icon

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Add code
Nov 22, 2024
Figure 1 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Figure 2 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Figure 3 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Figure 4 for DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Viaarxiv icon

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Add code
Oct 29, 2024
Figure 1 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 2 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 3 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 4 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Viaarxiv icon