Picture for Yizeng Han

Yizeng Han

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Add code
Nov 04, 2024
Figure 1 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 2 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 3 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 4 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Viaarxiv icon

Exploring contextual modeling with linear complexity for point cloud segmentation

Add code
Oct 28, 2024
Viaarxiv icon

Dynamic Diffusion Transformer

Add code
Oct 04, 2024
Viaarxiv icon

Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation

Add code
Sep 24, 2024
Figure 1 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Figure 2 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Figure 3 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Figure 4 for Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation
Viaarxiv icon

OStr-DARTS: Differentiable Neural Architecture Search based on Operation Strength

Add code
Sep 22, 2024
Viaarxiv icon

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Add code
Aug 11, 2024
Figure 1 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 2 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 3 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 4 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Viaarxiv icon

UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

Add code
Jul 29, 2024
Viaarxiv icon

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection

Add code
Jul 03, 2024
Viaarxiv icon

Demystify Mamba in Vision: A Linear Attention Perspective

Add code
May 26, 2024
Viaarxiv icon

EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training

Add code
May 14, 2024
Viaarxiv icon