Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Nov 16, 2023

Jiaxin Zhang, Joy Rimchala, Lalla Mouatadid, Kamalika Das, Sricharan Kumar

Figure 1 for DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Figure 2 for DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Figure 3 for DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Figure 4 for DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Share this with someone who'll enjoy it:

Abstract:The performance of optical character recognition (OCR) heavily relies on document image quality, which is crucial for automatic document processing and document intelligence. However, most existing document enhancement methods require supervised data pairs, which raises concerns about data separation and privacy protection, and makes it challenging to adapt these methods to new domain pairs. To address these issues, we propose DECDM, an end-to-end document-level image translation method inspired by recent advances in diffusion models. Our method overcomes the limitations of paired training by independently training the source (noisy input) and target (clean output) models, making it possible to apply domain-specific diffusion models to other pairs. DECDM trains on one dataset at a time, eliminating the need to scan both datasets concurrently, and effectively preserving data privacy from the source or target domain. We also introduce simple data augmentation strategies to improve character-glyph conservation during translation. We compare DECDM with state-of-the-art methods on multiple synthetic data and benchmark datasets, such as document denoising and {\color{black}shadow} removal, and demonstrate the superiority of performance quantitatively and qualitatively.

* Accepted by IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

View paper on

Share this with someone who'll enjoy it:

Title:DECDM: Document Enhancement using Cycle-Consistent Diffusion Models

Paper and Code