Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Oct 15, 2024

Lijie Tao, Haokui Zhang, Haizhao Jing, Yu Liu, Kelu Yao, Chao Li, Xizhe Xue

Figure 1 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Figure 2 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Figure 3 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Figure 4 for Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Share this with someone who'll enjoy it:

Abstract:Recently, the remarkable success of ChatGPT has sparked a renewed wave of interest in artificial intelligence (AI), and the advancements in visual language models (VLMs) have pushed this enthusiasm to new heights. Differring from previous AI approaches that generally formulated different tasks as discriminative models, VLMs frame tasks as generative models and align language with visual information, enabling the handling of more challenging problems. The remote sensing (RS) field, a highly practical domain, has also embraced this new trend and introduced several VLM-based RS methods that have demonstrated promising performance and enormous potential. In this paper, we first review the fundamental theories related to VLM, then summarize the datasets constructed for VLMs in remote sensing and the various tasks they addressed. Finally, we categorize the improvement methods into three main parts according to the core components of VLMs and provide a detailed introduction and comparison of these methods.

View paper on

Share this with someone who'll enjoy it:

Title:Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and Enhancement Techniques

Paper and Code