Picture for Yulin Wang

Yulin Wang

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition

Add code
Dec 15, 2024
Viaarxiv icon

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

Add code
Nov 11, 2024
Viaarxiv icon

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Add code
Nov 04, 2024
Figure 1 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 2 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 3 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Figure 4 for DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Viaarxiv icon

Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation

Add code
Sep 25, 2024
Viaarxiv icon

Basket-Enhanced Heterogenous Hypergraph for Price-Sensitive Next Basket Recommendation

Add code
Sep 18, 2024
Viaarxiv icon

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Add code
Aug 31, 2024
Figure 1 for AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Figure 2 for AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Figure 3 for AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Figure 4 for AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Viaarxiv icon

UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

Add code
Jul 29, 2024
Viaarxiv icon

Rethinking the Architecture Design for Efficient Generic Event Boundary Detection

Add code
Jul 17, 2024
Viaarxiv icon

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Add code
Jun 08, 2024
Figure 1 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 2 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 3 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 4 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Viaarxiv icon

EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training

Add code
May 14, 2024
Figure 1 for EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Figure 2 for EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Figure 3 for EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Figure 4 for EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Viaarxiv icon