Picture for Haoji Zhang

Haoji Zhang

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition

Add code
Dec 15, 2024
Viaarxiv icon

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Add code
Dec 02, 2024
Viaarxiv icon

Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation

Add code
Nov 24, 2024
Viaarxiv icon

Hierarchical Memory for Long Video QA

Add code
Jun 30, 2024
Viaarxiv icon

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

Add code
Jun 12, 2024
Viaarxiv icon

PREIM3D: 3D Consistent Precise Image Attribute Editing from a Single Image

Add code
Apr 20, 2023
Viaarxiv icon