Picture for Arka Sadhu

Arka Sadhu

Agentic Very Long Video Understanding

Add code
Jan 26, 2026
Viaarxiv icon

Improving Object Detection and Attribute Recognition by Feature Entanglement Reduction

Add code
Aug 25, 2021
Figure 1 for Improving Object Detection and Attribute Recognition by Feature Entanglement Reduction
Figure 2 for Improving Object Detection and Attribute Recognition by Feature Entanglement Reduction
Figure 3 for Improving Object Detection and Attribute Recognition by Feature Entanglement Reduction
Figure 4 for Improving Object Detection and Attribute Recognition by Feature Entanglement Reduction
Viaarxiv icon

Video Question Answering with Phrases via Semantic Roles

Add code
Apr 08, 2021
Figure 1 for Video Question Answering with Phrases via Semantic Roles
Figure 2 for Video Question Answering with Phrases via Semantic Roles
Figure 3 for Video Question Answering with Phrases via Semantic Roles
Figure 4 for Video Question Answering with Phrases via Semantic Roles
Viaarxiv icon

Visual Semantic Role Labeling for Video Understanding

Add code
Apr 02, 2021
Figure 1 for Visual Semantic Role Labeling for Video Understanding
Figure 2 for Visual Semantic Role Labeling for Video Understanding
Figure 3 for Visual Semantic Role Labeling for Video Understanding
Figure 4 for Visual Semantic Role Labeling for Video Understanding
Viaarxiv icon

Utilizing Every Image Object for Semi-supervised Phrase Grounding

Add code
Nov 05, 2020
Figure 1 for Utilizing Every Image Object for Semi-supervised Phrase Grounding
Figure 2 for Utilizing Every Image Object for Semi-supervised Phrase Grounding
Figure 3 for Utilizing Every Image Object for Semi-supervised Phrase Grounding
Figure 4 for Utilizing Every Image Object for Semi-supervised Phrase Grounding
Viaarxiv icon

Video Object Grounding using Semantic Roles in Language Description

Add code
Mar 24, 2020
Figure 1 for Video Object Grounding using Semantic Roles in Language Description
Figure 2 for Video Object Grounding using Semantic Roles in Language Description
Figure 3 for Video Object Grounding using Semantic Roles in Language Description
Figure 4 for Video Object Grounding using Semantic Roles in Language Description
Viaarxiv icon

Zero-Shot Grounding of Objects from Natural Language Queries

Add code
Aug 20, 2019
Figure 1 for Zero-Shot Grounding of Objects from Natural Language Queries
Figure 2 for Zero-Shot Grounding of Objects from Natural Language Queries
Figure 3 for Zero-Shot Grounding of Objects from Natural Language Queries
Figure 4 for Zero-Shot Grounding of Objects from Natural Language Queries
Viaarxiv icon