Picture for Wangbo Zhao

Wangbo Zhao

Dynamic Diffusion Transformer

Add code
Oct 04, 2024
Viaarxiv icon

Prioritize Alignment in Dataset Distillation

Add code
Aug 06, 2024
Viaarxiv icon

Conditional LoRA Parameter Generation

Add code
Aug 02, 2024
Viaarxiv icon

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

Add code
May 06, 2024
Viaarxiv icon

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Add code
Mar 18, 2024
Viaarxiv icon

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Add code
Nov 25, 2023
Viaarxiv icon

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation

Add code
Sep 20, 2023
Viaarxiv icon

Learning Referring Video Object Segmentation from Weak Annotation

Add code
Aug 04, 2023
Viaarxiv icon

MMBench: Is Your Multi-modal Model an All-around Player?

Add code
Jul 26, 2023
Viaarxiv icon

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation

Add code
Apr 06, 2022
Figure 1 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 2 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 3 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 4 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Viaarxiv icon