Picture for Gao Huang

Gao Huang

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Add code
Nov 04, 2024
Figure 1 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 2 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 3 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 4 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Viaarxiv icon

How Far is Video Generation from World Model: A Physical Law Perspective

Add code
Nov 04, 2024
Viaarxiv icon

Exploring contextual modeling with linear complexity for point cloud segmentation

Add code
Oct 28, 2024
Viaarxiv icon

LLM-based Optimization of Compound AI Systems: A Survey

Add code
Oct 21, 2024
Viaarxiv icon

Differential Transformer

Add code
Oct 07, 2024
Figure 1 for Differential Transformer
Figure 2 for Differential Transformer
Figure 3 for Differential Transformer
Figure 4 for Differential Transformer
Viaarxiv icon

Dynamic Diffusion Transformer

Add code
Oct 04, 2024
Viaarxiv icon

Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation

Add code
Sep 24, 2024
Figure 1 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Figure 2 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Figure 3 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Figure 4 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Viaarxiv icon

OStr-DARTS: Differentiable Neural Architecture Search based on Operation Strength

Add code
Sep 22, 2024
Viaarxiv icon

Multiple-Exit Tuning: Towards Inference-Efficient Adaptation for Vision Transformer

Add code
Sep 21, 2024
Viaarxiv icon

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Add code
Aug 31, 2024
Viaarxiv icon