Picture for Zhiwu Qing

Zhiwu Qing

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

Add code
Jul 22, 2024
Figure 1 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Figure 2 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Figure 3 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Figure 4 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Viaarxiv icon

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Add code
Dec 25, 2023
Figure 1 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Figure 2 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Figure 3 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Figure 4 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Viaarxiv icon

Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation

Add code
Dec 07, 2023
Viaarxiv icon

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Add code
Dec 07, 2023
Viaarxiv icon

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning

Add code
Sep 14, 2023
Viaarxiv icon

HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation

Add code
Aug 24, 2023
Viaarxiv icon

Temporally-Adaptive Models for Efficient Video Understanding

Add code
Aug 10, 2023
Viaarxiv icon

MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition

Add code
Apr 03, 2023
Viaarxiv icon

Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition

Add code
Mar 25, 2023
Viaarxiv icon

HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition

Add code
Jan 09, 2023
Viaarxiv icon