Picture for Zhiyu Zhao

Zhiyu Zhao

AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

Add code
Jul 05, 2024
Viaarxiv icon

Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach

Add code
Mar 14, 2024
Viaarxiv icon

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Add code
Nov 06, 2023
Viaarxiv icon

MGMAE: Motion Guided Masking for Video Masked Autoencoding

Add code
Aug 21, 2023
Viaarxiv icon

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Add code
Apr 18, 2023
Viaarxiv icon

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Add code
Dec 07, 2022
Viaarxiv icon

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Add code
Nov 17, 2022
Viaarxiv icon