Picture for Bernard Ghanem

Bernard Ghanem

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

Add code
Apr 08, 2025
Viaarxiv icon

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning

Add code
Apr 01, 2025
Viaarxiv icon

BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding

Add code
Mar 27, 2025
Viaarxiv icon

Can Video Diffusion Model Reconstruct 4D Geometry?

Add code
Mar 27, 2025
Viaarxiv icon

Structured-Noise Masked Modeling for Video, Audio and Beyond

Add code
Mar 20, 2025
Viaarxiv icon

TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos

Add code
Mar 09, 2025
Viaarxiv icon

DiffCLIP: Differential Attention Meets CLIP

Add code
Mar 09, 2025
Viaarxiv icon

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

Add code
Feb 27, 2025
Viaarxiv icon

Shh, don't say that! Domain Certification in LLMs

Add code
Feb 26, 2025
Viaarxiv icon

Optimizing Singular Spectrum for Large Language Model Compression

Add code
Feb 20, 2025
Viaarxiv icon