Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning

Add code
Nov 15, 2024
Figure 1 for Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning
Figure 2 for Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning
Figure 3 for Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning
Figure 4 for Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning
Viaarxiv icon

RETR: Multi-View Radar Detection Transformer for Indoor Perception

Add code
Nov 15, 2024
Viaarxiv icon

MOT\_FCG++: Enhanced Representation of Motion and Appearance Features

Add code
Nov 15, 2024
Viaarxiv icon

Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras

Add code
Nov 15, 2024
Viaarxiv icon

LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection

Add code
Nov 14, 2024
Figure 1 for LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
Figure 2 for LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
Figure 3 for LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
Figure 4 for LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection
Viaarxiv icon

Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions

Add code
Nov 15, 2024
Viaarxiv icon

Diachronic Document Dataset for Semantic Layout Analysis

Add code
Nov 15, 2024
Figure 1 for Diachronic Document Dataset for Semantic Layout Analysis
Figure 2 for Diachronic Document Dataset for Semantic Layout Analysis
Figure 3 for Diachronic Document Dataset for Semantic Layout Analysis
Figure 4 for Diachronic Document Dataset for Semantic Layout Analysis
Viaarxiv icon

Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction

Add code
Nov 14, 2024
Figure 1 for Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction
Figure 2 for Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction
Figure 3 for Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction
Figure 4 for Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction
Viaarxiv icon

Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration

Add code
Nov 14, 2024
Viaarxiv icon

Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks

Add code
Nov 14, 2024
Viaarxiv icon