Picture for Guangxing Han

Guangxing Han

TIPS: Text-Image Pretraining with Spatial Awareness

Add code
Oct 21, 2024
Figure 1 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 2 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 3 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 4 for TIPS: Text-Image Pretraining with Spatial Awareness
Viaarxiv icon

WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization

Add code
May 28, 2024
Viaarxiv icon

Mitigating Dialogue Hallucination for Large Multi-modal Models via Adversarial Instruction Tuning

Add code
Mar 15, 2024
Viaarxiv icon

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Add code
Dec 19, 2023
Viaarxiv icon

Supervised Masked Knowledge Distillation for Few-Shot Transformers

Add code
Mar 29, 2023
Viaarxiv icon

DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection

Add code
Mar 16, 2023
Viaarxiv icon

TempCLR: Temporal Alignment Representation with Contrastive Learning

Add code
Dec 28, 2022
Viaarxiv icon

Explicit Image Caption Editing

Add code
Jul 20, 2022
Figure 1 for Explicit Image Caption Editing
Figure 2 for Explicit Image Caption Editing
Figure 3 for Explicit Image Caption Editing
Figure 4 for Explicit Image Caption Editing
Viaarxiv icon

Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting

Add code
Apr 16, 2022
Figure 1 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 2 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 3 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 4 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Viaarxiv icon

Few-Shot Object Detection with Fully Cross-Transformer

Add code
Mar 28, 2022
Figure 1 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 2 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 3 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 4 for Few-Shot Object Detection with Fully Cross-Transformer
Viaarxiv icon