Picture for Kelu Yao

Kelu Yao

Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Add code
Oct 15, 2024
Figure 1 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
Figure 2 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
Figure 3 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
Figure 4 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques
Viaarxiv icon

OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion

Add code
Aug 22, 2024
Figure 1 for OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion
Figure 2 for OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion
Figure 3 for OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion
Figure 4 for OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion
Viaarxiv icon

Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View

Add code
May 27, 2024
Figure 1 for Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Figure 2 for Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Figure 3 for Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Figure 4 for Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Viaarxiv icon