Picture for Xiaoqin Zhang

Xiaoqin Zhang

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Viaarxiv icon

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection

Add code
Jul 17, 2024
Viaarxiv icon

CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder

Add code
Jun 09, 2024
Viaarxiv icon

One-shot Training for Video Object Segmentation

Add code
May 22, 2024
Viaarxiv icon

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Add code
May 13, 2024
Viaarxiv icon

Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

Add code
Apr 18, 2024
Viaarxiv icon

Masked AutoDecoder is Effective Multi-Task Vision Generalist

Add code
Mar 14, 2024
Figure 1 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 2 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 3 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 4 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Viaarxiv icon

Weakly Supervised Monocular 3D Detection with a Single-View Image

Add code
Feb 29, 2024
Viaarxiv icon

CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Add code
Feb 06, 2024
Figure 1 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Figure 2 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Figure 3 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Figure 4 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Viaarxiv icon