Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Enyuan Zhao

Copy-Move Forgery Detection and Question Answering for Remote Sensing Image

Dec 03, 2024

Ze Zhang, Enyuan Zhao, Ziyi Wan, Jie Nie, Xinyue Liang, Lei Huang

Figure 1 for Copy-Move Forgery Detection and Question Answering for Remote Sensing Image

Figure 2 for Copy-Move Forgery Detection and Question Answering for Remote Sensing Image

Figure 3 for Copy-Move Forgery Detection and Question Answering for Remote Sensing Image

Figure 4 for Copy-Move Forgery Detection and Question Answering for Remote Sensing Image

Abstract:This paper introduces the task of Remote Sensing Copy-Move Question Answering (RSCMQA). Unlike traditional Remote Sensing Visual Question Answering (RSVQA), RSCMQA focuses on interpreting complex tampering scenarios and inferring relationships between objects. Based on the practical needs of national defense security and land resource monitoring, we have developed an accurate and comprehensive global dataset for remote sensing image copy-move question answering, named RS-CMQA-2.1M. These images were collected from 29 different regions across 14 countries. Additionally, we have refined a balanced dataset, RS-CMQA-B, to address the long-standing issue of long-tail data in the remote sensing field. Furthermore, we propose a region-discriminative guided multimodal CMQA model, which enhances the accuracy of answering questions about tampered images by leveraging prompt about the differences and connections between the source and tampered domains. Extensive experiments demonstrate that our method provides a stronger benchmark for RS-CMQA compared to general VQA and RSVQA models. Our dataset and code are available at https://github.com/shenyedepisa/RSCMQA.

* 7 figs, 7 tables

Via

Access Paper or Ask Questions