Picture for Zejia Weng

Zejia Weng

GenRec: Unifying Video Generation and Recognition with Diffusion Models

Add code
Aug 27, 2024
Figure 1 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Figure 2 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Figure 3 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Figure 4 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Viaarxiv icon

AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction

Add code
Jun 10, 2024
Viaarxiv icon

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Add code
Nov 29, 2023
Viaarxiv icon

Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data

Add code
Oct 08, 2023
Viaarxiv icon

BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning

Add code
May 22, 2023
Viaarxiv icon

Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization

Add code
Feb 01, 2023
Viaarxiv icon

Semi-Supervised Vision Transformers

Add code
Nov 22, 2021
Figure 1 for Semi-Supervised Vision Transformers
Figure 2 for Semi-Supervised Vision Transformers
Figure 3 for Semi-Supervised Vision Transformers
Figure 4 for Semi-Supervised Vision Transformers
Viaarxiv icon

A Multimodal Framework for Video Ads Understanding

Add code
Aug 29, 2021
Figure 1 for A Multimodal Framework for Video Ads Understanding
Figure 2 for A Multimodal Framework for Video Ads Understanding
Figure 3 for A Multimodal Framework for Video Ads Understanding
Figure 4 for A Multimodal Framework for Video Ads Understanding
Viaarxiv icon

Cross-domain Contrastive Learning for Unsupervised Domain Adaptation

Add code
Jun 10, 2021
Figure 1 for Cross-domain Contrastive Learning for Unsupervised Domain Adaptation
Figure 2 for Cross-domain Contrastive Learning for Unsupervised Domain Adaptation
Figure 3 for Cross-domain Contrastive Learning for Unsupervised Domain Adaptation
Figure 4 for Cross-domain Contrastive Learning for Unsupervised Domain Adaptation
Viaarxiv icon

VideoLT: Large-scale Long-tailed Video Recognition

Add code
May 06, 2021
Figure 1 for VideoLT: Large-scale Long-tailed Video Recognition
Figure 2 for VideoLT: Large-scale Long-tailed Video Recognition
Figure 3 for VideoLT: Large-scale Long-tailed Video Recognition
Figure 4 for VideoLT: Large-scale Long-tailed Video Recognition
Viaarxiv icon