Picture for Mohammed Bennamoun

Mohammed Bennamoun

UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation

Add code
Nov 13, 2024
Viaarxiv icon

Referring Human Pose and Mask Estimation in the Wild

Add code
Oct 27, 2024
Viaarxiv icon

Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels

Add code
Oct 05, 2024
Viaarxiv icon

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Add code
Aug 22, 2024
Figure 1 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 2 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 3 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 4 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Viaarxiv icon

Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions

Add code
Jul 27, 2024
Viaarxiv icon

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Add code
Jul 06, 2024
Viaarxiv icon

Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey

Add code
Jun 28, 2024
Viaarxiv icon

Supervised Radio Frequency Interference Detection with SNNs

Add code
Jun 10, 2024
Viaarxiv icon

CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Add code
Jun 07, 2024
Viaarxiv icon

Language Model Guided Interpretable Video Action Reasoning

Add code
Apr 02, 2024
Viaarxiv icon