Picture for David Fan

David Fan

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Add code
Dec 18, 2024
Viaarxiv icon

NowYouSee Me: Context-Aware Automatic Audio Description

Add code
Dec 13, 2024
Viaarxiv icon

GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning

Add code
Dec 10, 2024
Viaarxiv icon

Video Token Merging for Long-form Video Understanding

Add code
Oct 31, 2024
Figure 1 for Video Token Merging for Long-form Video Understanding
Figure 2 for Video Token Merging for Long-form Video Understanding
Figure 3 for Video Token Merging for Long-form Video Understanding
Figure 4 for Video Token Merging for Long-form Video Understanding
Viaarxiv icon

Text-Guided Video Masked Autoencoder

Add code
Aug 01, 2024
Figure 1 for Text-Guided Video Masked Autoencoder
Figure 2 for Text-Guided Video Masked Autoencoder
Figure 3 for Text-Guided Video Masked Autoencoder
Figure 4 for Text-Guided Video Masked Autoencoder
Viaarxiv icon

Motion-Guided Masking for Spatiotemporal Representation Learning

Add code
Aug 24, 2023
Viaarxiv icon

MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation

Add code
Aug 22, 2023
Viaarxiv icon

A Multi-step Dynamics Modeling Framework For Autonomous Driving In Multiple Environments

Add code
May 03, 2023
Viaarxiv icon

Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos

Add code
Mar 13, 2023
Viaarxiv icon

PrePARE: Predictive Proprioception for Agile Failure Event Detection in Robotic Exploration of Extreme Terrains

Add code
Jul 30, 2022
Figure 1 for PrePARE: Predictive Proprioception for Agile Failure Event Detection in Robotic Exploration of Extreme Terrains
Figure 2 for PrePARE: Predictive Proprioception for Agile Failure Event Detection in Robotic Exploration of Extreme Terrains
Figure 3 for PrePARE: Predictive Proprioception for Agile Failure Event Detection in Robotic Exploration of Extreme Terrains
Figure 4 for PrePARE: Predictive Proprioception for Agile Failure Event Detection in Robotic Exploration of Extreme Terrains
Viaarxiv icon