Picture for Jinrui Zhang

Jinrui Zhang

Differential Informed Auto-Encoder

Add code
Oct 24, 2024
Viaarxiv icon

MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos

Add code
Sep 06, 2024
Figure 1 for MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos
Figure 2 for MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos
Figure 3 for MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos
Figure 4 for MultiCounter: Multiple Action Agnostic Repetition Counting in Untrimmed Videos
Viaarxiv icon

Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

Add code
Jul 16, 2024
Viaarxiv icon

Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views

Add code
Sep 21, 2023
Viaarxiv icon

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Add code
Jul 31, 2023
Viaarxiv icon

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning

Add code
Jun 17, 2023
Viaarxiv icon

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Add code
May 08, 2023
Viaarxiv icon

Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

Add code
Mar 11, 2023
Viaarxiv icon

Exploiting Context Information for Generic Event Boundary Captioning

Add code
Jul 03, 2022
Figure 1 for Exploiting Context Information for Generic Event Boundary Captioning
Figure 2 for Exploiting Context Information for Generic Event Boundary Captioning
Viaarxiv icon