Picture for Shenghao Fu

Shenghao Fu

ViSpeak: Visual Instruction Feedback in Streaming Videos

Add code
Mar 17, 2025
Viaarxiv icon

A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection

Add code
Mar 13, 2025
Viaarxiv icon

LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models

Add code
Jan 31, 2025
Figure 1 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Figure 2 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Figure 3 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Figure 4 for LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Viaarxiv icon

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Add code
Jan 25, 2025
Figure 1 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 2 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 3 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Figure 4 for HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
Viaarxiv icon

Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models

Add code
Oct 25, 2024
Figure 1 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 2 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 3 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 4 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Viaarxiv icon

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

Add code
Jul 16, 2024
Figure 1 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Figure 2 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Figure 3 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Figure 4 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Viaarxiv icon

ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation

Add code
Aug 18, 2023
Viaarxiv icon