Picture for Jianxin Liang

Jianxin Liang

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

Add code
Nov 27, 2024
Viaarxiv icon

Understanding Multimodal Hallucination with Parameter-Free Representation Alignment

Add code
Sep 02, 2024
Viaarxiv icon

End-to-End Video Question Answering with Frame Scoring Mechanisms and Adaptive Sampling

Add code
Jul 23, 2024
Viaarxiv icon

HawkEye: Training Video-Text LLMs for Grounding Text in Videos

Add code
Mar 15, 2024
Viaarxiv icon

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding

Add code
Feb 25, 2024
Viaarxiv icon