Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Nov 10, 2024

Shuang Chen, Haozheng Zhang, Amir Atapour-Abarghouei, Hubert P. H. Shum

Figure 1 for SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Figure 2 for SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Figure 3 for SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Figure 4 for SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Share this with someone who'll enjoy it:

Abstract:Image inpainting aims to repair a partially damaged image based on the information from known regions of the images. \revise{Achieving semantically plausible inpainting results is particularly challenging because it requires the reconstructed regions to exhibit similar patterns to the semanticly consistent regions}. This requires a model with a strong capacity to capture long-range dependencies. Existing models struggle in this regard due to the slow growth of receptive field for Convolutional Neural Networks (CNNs) based methods and patch-level interactions in Transformer-based methods, which are ineffective for capturing long-range dependencies. Motivated by this, we propose SEM-Net, a novel visual State Space model (SSM) vision network, modelling corrupted images at the pixel level while capturing long-range dependencies (LRDs) in state space, achieving a linear computational complexity. To address the inherent lack of spatial awareness in SSM, we introduce the Snake Mamba Block (SMB) and Spatially-Enhanced Feedforward Network. These innovations enable SEM-Net to outperform state-of-the-art inpainting methods on two distinct datasets, showing significant improvements in capturing LRDs and enhancement in spatial consistency. Additionally, SEM-Net achieves state-of-the-art performance on motion deblurring, demonstrating its generalizability. Our source code will be released in https://github.com/ChrisChen1023/SEM-Net.

* Accepted by WACV 2025

View paper on

Share this with someone who'll enjoy it:

Title:SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM

Paper and Code