Picture for Ruyang Liu

Ruyang Liu

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Add code
Dec 02, 2024
Viaarxiv icon

PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance

Add code
Nov 05, 2024
Figure 1 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Figure 2 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Figure 3 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Figure 4 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Viaarxiv icon

MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval

Add code
Aug 20, 2024
Figure 1 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Figure 2 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Figure 3 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Figure 4 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Viaarxiv icon

RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

Add code
May 29, 2024
Viaarxiv icon

ST-LLM: Large Language Models Are Effective Temporal Learners

Add code
Mar 30, 2024
Figure 1 for ST-LLM: Large Language Models Are Effective Temporal Learners
Figure 2 for ST-LLM: Large Language Models Are Effective Temporal Learners
Figure 3 for ST-LLM: Large Language Models Are Effective Temporal Learners
Figure 4 for ST-LLM: Large Language Models Are Effective Temporal Learners
Viaarxiv icon

Mug-STAN: Adapting Image-Language Pretrained Models for General Video Understanding

Add code
Nov 25, 2023
Viaarxiv icon

One For All: Video Conversation is Feasible Without Video Instruction Tuning

Add code
Sep 27, 2023
Viaarxiv icon

Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring

Add code
Jan 26, 2023
Viaarxiv icon