Picture for Hanan Gani

Hanan Gani

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Add code
Nov 07, 2024
Figure 1 for VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Figure 2 for VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Figure 3 for VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Figure 4 for VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Viaarxiv icon

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment

Add code
Oct 02, 2024
Viaarxiv icon

Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models

Add code
Jul 22, 2024
Viaarxiv icon

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs

Add code
Jun 14, 2024
Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Feb 27, 2024
Viaarxiv icon

Multi-Attribute Vision Transformers are Efficient and Robust Learners

Add code
Feb 12, 2024
Viaarxiv icon

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization

Add code
Nov 02, 2023
Viaarxiv icon

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Add code
Oct 16, 2023
Viaarxiv icon

How to Train Vision Transformer on Small-scale Datasets?

Add code
Oct 13, 2022
Figure 1 for How to Train Vision Transformer on Small-scale Datasets?
Figure 2 for How to Train Vision Transformer on Small-scale Datasets?
Figure 3 for How to Train Vision Transformer on Small-scale Datasets?
Figure 4 for How to Train Vision Transformer on Small-scale Datasets?
Viaarxiv icon

A Supervised Learning Methodology for Real-Time Disguised Face Recognition in the Wild

Add code
Sep 08, 2018
Figure 1 for A Supervised Learning Methodology for Real-Time Disguised Face Recognition in the Wild
Figure 2 for A Supervised Learning Methodology for Real-Time Disguised Face Recognition in the Wild
Figure 3 for A Supervised Learning Methodology for Real-Time Disguised Face Recognition in the Wild
Figure 4 for A Supervised Learning Methodology for Real-Time Disguised Face Recognition in the Wild
Viaarxiv icon