Picture for Zhixin Zhang

Zhixin Zhang

Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

Add code
Oct 28, 2024
Figure 1 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 2 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 3 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 4 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Viaarxiv icon

CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification

Add code
Oct 07, 2024
Viaarxiv icon

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Add code
Feb 05, 2024
Viaarxiv icon

Online Vectorized HD Map Construction using Geometry

Add code
Dec 06, 2023
Viaarxiv icon

Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Add code
Oct 29, 2023
Figure 1 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Figure 2 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Figure 3 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Figure 4 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery
Viaarxiv icon

TransForensics: Image Forgery Localization with Dense Self-Attention

Add code
Aug 09, 2021
Figure 1 for TransForensics: Image Forgery Localization with Dense Self-Attention
Figure 2 for TransForensics: Image Forgery Localization with Dense Self-Attention
Figure 3 for TransForensics: Image Forgery Localization with Dense Self-Attention
Figure 4 for TransForensics: Image Forgery Localization with Dense Self-Attention
Viaarxiv icon

Rotated Feature Network for multi-orientation object detection

Add code
Mar 27, 2019
Figure 1 for Rotated Feature Network for multi-orientation object detection
Figure 2 for Rotated Feature Network for multi-orientation object detection
Figure 3 for Rotated Feature Network for multi-orientation object detection
Figure 4 for Rotated Feature Network for multi-orientation object detection
Viaarxiv icon