Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Nov 29, 2023

Zuoyan Zhao, Shipeng Zhu, Pengfei Fang, Hui Xue

Figure 1 for PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Figure 2 for PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Figure 3 for PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Figure 4 for PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Share this with someone who'll enjoy it:

Abstract:Scene text image super-resolution (STISR) aims at simultaneously increasing the resolution and readability of low-resolution scene text images, thus boosting the performance of the downstream recognition task. Two factors in scene text images, semantic information and visual structure, affect the recognition performance significantly. To mitigate the effects from these factors, this paper proposes a Prior-Enhanced Attention Network (PEAN). Specifically, a diffusion-based module is developed to enhance the text prior, hence offering better guidance for the SR network to generate SR images with higher semantic accuracy. Meanwhile, the proposed PEAN leverages an attention-based modulation module to understand scene text images by neatly perceiving the local and global dependence of images, despite the shape of the text. A multi-task learning paradigm is employed to optimize the network, enabling the model to generate legible SR images. As a result, PEAN establishes new SOTA results on the TextZoom benchmark. Experiments are also conducted to analyze the importance of the enhanced text prior as a means of improving the performance of the SR network. Code will be made available at https://github.com/jdfxzzy/PEAN.

View paper on

Share this with someone who'll enjoy it:

Title:PEAN: A Diffusion-based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution

Paper and Code