Picture for Xiaogang Wang

Xiaogang Wang

Beihang University, Simon Fraser University

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Add code
Dec 12, 2024
Viaarxiv icon

TCFormer: Visual Recognition via Token Clustering Transformer

Add code
Jul 16, 2024
Viaarxiv icon

Parametric Primitive Analysis of CAD Sketches with Vision Transformer

Add code
Jun 29, 2024
Viaarxiv icon

FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance

Add code
Jun 04, 2024
Viaarxiv icon

Phased Consistency Model

Add code
May 28, 2024
Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Dec 25, 2023
Figure 1 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 2 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 3 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 4 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Viaarxiv icon

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Add code
Dec 20, 2023
Viaarxiv icon

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

Add code
Dec 14, 2023
Figure 1 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 2 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 3 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 4 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Viaarxiv icon

Digital Life Project: Autonomous 3D Characters with Social Intelligence

Add code
Dec 07, 2023
Viaarxiv icon

CoNe: Contrast Your Neighbours for Supervised Image Classification

Add code
Aug 21, 2023
Viaarxiv icon