Picture for Yulin Wang

Yulin Wang

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Add code
Nov 04, 2024
Figure 1 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 2 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 3 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 4 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Viaarxiv icon

Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation

Add code
Sep 25, 2024
Viaarxiv icon

Basket-Enhanced Heterogenous Hypergraph for Price-Sensitive Next Basket Recommendation

Add code
Sep 18, 2024
Viaarxiv icon

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Add code
Aug 31, 2024
Viaarxiv icon

UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

Add code
Jul 29, 2024
Viaarxiv icon

Rethinking the Architecture Design for Efficient Generic Event Boundary Detection

Add code
Jul 17, 2024
Viaarxiv icon

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Add code
Jun 08, 2024
Figure 1 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 2 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 3 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 4 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Viaarxiv icon

EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training

Add code
May 14, 2024
Viaarxiv icon

Probabilistic Contrastive Learning for Long-Tailed Visual Recognition

Add code
Mar 14, 2024
Viaarxiv icon

Fine-grained Recognition with Learnable Semantic Data Augmentation

Add code
Sep 01, 2023
Viaarxiv icon