Picture for Xiuwei Xu

Xiuwei Xu

EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models

Add code
Mar 19, 2025
Viaarxiv icon

MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation

Add code
Mar 17, 2025
Viaarxiv icon

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Add code
Mar 13, 2025
Viaarxiv icon

Q-VLM: Post-training Quantization for Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 2 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 3 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 4 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Viaarxiv icon

SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation

Add code
Oct 10, 2024
Figure 1 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Figure 2 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Figure 3 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Figure 4 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Viaarxiv icon

EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Add code
Aug 21, 2024
Figure 1 for EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Figure 2 for EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Figure 3 for EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Figure 4 for EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Viaarxiv icon

Embodied Instruction Following in Unknown Environments

Add code
Jun 17, 2024
Figure 1 for Embodied Instruction Following in Unknown Environments
Figure 2 for Embodied Instruction Following in Unknown Environments
Figure 3 for Embodied Instruction Following in Unknown Environments
Figure 4 for Embodied Instruction Following in Unknown Environments
Viaarxiv icon

Memory-based Adapters for Online 3D Scene Perception

Add code
Mar 11, 2024
Viaarxiv icon

MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory

Add code
Oct 27, 2023
Figure 1 for MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory
Figure 2 for MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory
Figure 3 for MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory
Figure 4 for MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory
Viaarxiv icon

Anyview: Generalizable Indoor 3D Object Detection with Variable Frames

Add code
Oct 09, 2023
Viaarxiv icon