Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Irem Ulku

Near-Infrared and Low-Rank Adaptation of Vision Transformers in Remote Sensing

May 28, 2024

Irem Ulku, O. Ozgur Tanriover, Erdem Akagündüz

Abstract:Plant health can be monitored dynamically using multispectral sensors that measure Near-Infrared reflectance (NIR). Despite this potential, obtaining and annotating high-resolution NIR images poses a significant challenge for training deep neural networks. Typically, large networks pre-trained on the RGB domain are utilized to fine-tune infrared images. This practice introduces a domain shift issue because of the differing visual traits between RGB and NIR images.As an alternative to fine-tuning, a method called low-rank adaptation (LoRA) enables more efficient training by optimizing rank-decomposition matrices while keeping the original network weights frozen. However, existing parameter-efficient adaptation strategies for remote sensing images focus on RGB images and overlook domain shift issues in the NIR domain. Therefore, this study investigates the potential benefits of using vision transformer (ViT) backbones pre-trained in the RGB domain, with low-rank adaptation for downstream tasks in the NIR domain. Extensive experiments demonstrate that employing LoRA with pre-trained ViT backbones yields the best performance for downstream tasks applied to NIR images.

* 7 pages, 3 figures, 3 tables

Via

Access Paper or Ask Questions

A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images

Dec 21, 2019

Irem Ulku, Erdem Akagunduz

Figure 1 for A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images

Figure 2 for A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images

Figure 3 for A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images

Figure 4 for A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D images

Abstract:Semantic segmentation is the pixel-wise labelling of an image. Since the problem is defined at the pixel level, determining image class labels only is not acceptable, but localising them at the original image pixel resolution is necessary. Boosted by the extraordinary ability of convolutional neural networks (CNN) in creating semantic, high level and hierarchical image features; excessive numbers of deep learning-based 2D semantic segmentation approaches have been proposed within the last decade. In this survey, we mainly focus on the recent scientific developments in semantic segmentation, specifically on deep learning-based methods using 2D images. We started with an analysis of the public image sets and leaderboards for 2D semantic segmantation, with an overview of the techniques employed in performance evaluation. In examining the evolution of the field, we chronologically categorised the approaches into three main periods, namely pre-and early deep learning era, the fully convolutional era, and the post-FCN era. We technically analysed the solutions put forward in terms of solving the fundamental problems of the field, such as fine-grained localisation and scale invariance. Before drawing our conclusions, we present a table of methods from all mentioned eras, with a brief summary of each approach that explains their contribution to the field. We conclude the survey by discussing the current challenges of the field and to what extent they have been solved.

* Submitted to Pattern Recognition Journal (13.12.2029)

Via

Access Paper or Ask Questions