Fine Grained Visual Categorization


TimeBlind: A Spatio-Temporal Compositionality Benchmark for Video LLMs

Add code
Jan 30, 2026
Viaarxiv icon

Beyond Accuracy: Evaluating Grounded Visual Evidence in Thinking with Images

Add code
Jan 14, 2026
Viaarxiv icon

CausalFSFG: Rethinking Few-Shot Fine-Grained Visual Categorization from Causal Perspective

Add code
Dec 25, 2025
Viaarxiv icon

APEX: Academic Poster Editing Agentic Expert

Add code
Jan 08, 2026
Viaarxiv icon

MovieRecapsQA: A Multimodal Open-Ended Video Question-Answering Benchmark

Add code
Jan 05, 2026
Viaarxiv icon

FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification

Add code
Dec 23, 2025
Figure 1 for FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification
Figure 2 for FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification
Figure 3 for FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification
Figure 4 for FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification
Viaarxiv icon

EchoFoley: Event-Centric Hierarchical Control for Video Grounded Creative Sound Generation

Add code
Dec 31, 2025
Viaarxiv icon

CAVE: Detecting and Explaining Commonsense Anomalies in Visual Environments

Add code
Oct 29, 2025
Viaarxiv icon

ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts

Add code
Oct 30, 2025
Viaarxiv icon

SDS KoPub VDR: A Benchmark Dataset for Visual Document Retrieval in Korean Public Documents

Add code
Nov 07, 2025
Viaarxiv icon