Picture for Orr Zohar

Orr Zohar

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Add code
Dec 13, 2024
Viaarxiv icon

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Add code
Jul 08, 2024
Figure 1 for Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Figure 2 for Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Figure 3 for Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Figure 4 for Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Mar 15, 2024
Viaarxiv icon

Open World Object Detection in the Era of Foundation Models

Add code
Dec 10, 2023
Viaarxiv icon

LOVM: Language-Only Vision Model Selection

Add code
Jun 15, 2023
Viaarxiv icon

PROB: Probabilistic Objectness for Open World Object Detection

Add code
Dec 02, 2022
Viaarxiv icon