Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shumpei Takezaki

Inverse Scene Text Removal

Jun 26, 2025

Takumi Yoshimatsu, Shumpei Takezaki, Seiichi Uchida

Abstract:Scene text removal (STR) aims to erase textual elements from images. It was originally intended for removing privacy-sensitiveor undesired texts from natural scene images, but is now also appliedto typographic images. STR typically detects text regions and theninpaints them. Although STR has advanced through neural networksand synthetic data, misuse risks have increased. This paper investi-gates Inverse STR (ISTR), which analyzes STR-processed images andfocuses on binary classification (detecting whether an image has un-dergone STR) and localizing removed text regions. We demonstrate inexperiments that these tasks are achievable with high accuracies, en-abling detection of potential misuse and improving STR. We also at-tempt to recover the removed text content by training a text recognizerto understand its difficulty.

* 17 pages

Via

Access Paper or Ask Questions

Guidance-base Diffusion Models for Improving Photoacoustic Image Quality

Feb 10, 2025

Tatsuhiro Eguchi, Shumpei Takezaki, Mihoko Shimano, Takayuki Yagi, Ryoma Bise

Figure 1 for Guidance-base Diffusion Models for Improving Photoacoustic Image Quality

Figure 2 for Guidance-base Diffusion Models for Improving Photoacoustic Image Quality

Figure 3 for Guidance-base Diffusion Models for Improving Photoacoustic Image Quality

Figure 4 for Guidance-base Diffusion Models for Improving Photoacoustic Image Quality

Abstract:Photoacoustic(PA) imaging is a non-destructive and non-invasive technology for visualizing minute blood vessel structures in the body using ultrasonic sensors. In PA imaging, the image quality of a single-shot image is poor, and it is necessary to improve the image quality by averaging many single-shot images. Therefore, imaging the entire subject requires high imaging costs. In our study, we propose a method to improve the quality of PA images using diffusion models. In our method, we improve the reverse diffusion process using sensor information of PA imaging and introduce a guidance method using imaging condition information to generate high-quality images.

* BMVC 2024

Via

Access Paper or Ask Questions

Self-Relaxed Joint Training: Sample Selection for Severity Estimation with Ordinal Noisy Labels

Oct 29, 2024

Shumpei Takezaki, Kiyohito Tanaka, Seiichi Uchida

Abstract:Severity level estimation is a crucial task in medical image diagnosis. However, accurately assigning severity class labels to individual images is very costly and challenging. Consequently, the attached labels tend to be noisy. In this paper, we propose a new framework for training with ``ordinal'' noisy labels. Since severity levels have an ordinal relationship, we can leverage this to train a classifier while mitigating the negative effects of noisy labels. Our framework uses two techniques: clean sample selection and dual-network architecture. A technical highlight of our approach is the use of soft labels derived from noisy hard labels. By appropriately using the soft and hard labels in the two techniques, we achieve more accurate sample selection and robust network training. The proposed method outperforms various state-of-the-art methods in experiments using two endoscopic ulcerative colitis (UC) datasets and a retinal Diabetic Retinopathy (DR) dataset. Our codes are available at https://github.com/shumpei-takezaki/Self-Relaxed-Joint-Training.

* Accepted at WACV2025

Via

Access Paper or Ask Questions

Cross-Domain Image Conversion by CycleDM

Mar 05, 2024

Sho Shimotsumagari, Shumpei Takezaki, Daichi Haraguchi, Seiichi Uchida

Abstract:The purpose of this paper is to enable the conversion between machine-printed character images (i.e., font images) and handwritten character images through machine learning. For this purpose, we propose a novel unpaired image-to-image domain conversion method, CycleDM, which incorporates the concept of CycleGAN into the diffusion model. Specifically, CycleDM has two internal conversion models that bridge the denoising processes of two image domains. These conversion models are efficiently trained without explicit correspondence between the domains. By applying machine-printed and handwritten character images to the two modalities, CycleDM realizes the conversion between them. Our experiments for evaluating the converted images quantitatively and qualitatively found that ours performs better than other comparable approaches.

Via

Access Paper or Ask Questions

An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels

Mar 01, 2024

Shumpei Takezaki, Seiichi Uchida

Abstract:Diffusion models have recently been used for medical image generation because of their high image quality. In this study, we focus on generating medical images with ordinal classes, which have ordinal relationships, such as severity levels. We propose an Ordinal Diffusion Model (ODM) that controls the ordinal relationships of the estimated noise images among the classes. Our model was evaluated experimentally by generating retinal and endoscopic images of multiple severity classes. ODM achieved higher performance than conventional generative models by generating realistic images, especially in high-severity classes with fewer training samples.

* Accepted at ISBI2024

Via

Access Paper or Ask Questions

Font Style Interpolation with Diffusion Models

Feb 22, 2024

Tetta Kondo, Shumpei Takezaki, Daichi Haraguchi, Seiichi Uchida

Figure 1 for Font Style Interpolation with Diffusion Models

Figure 2 for Font Style Interpolation with Diffusion Models

Figure 3 for Font Style Interpolation with Diffusion Models

Figure 4 for Font Style Interpolation with Diffusion Models

Abstract:Fonts have huge variations in their styles and give readers different impressions. Therefore, generating new fonts is worthy of giving new impressions to readers. In this paper, we employ diffusion models to generate new font styles by interpolating a pair of reference fonts with different styles. More specifically, we propose three different interpolation approaches, image-blending, condition-blending, and noise-blending, with the diffusion models. We perform qualitative and quantitative experimental analyses to understand the style generation ability of the three approaches. According to experimental results, three proposed approaches can generate not only expected font styles but also somewhat serendipitous font styles. We also compare the approaches with a state-of-the-art style-conditional Latin-font generative network model to confirm the validity of using the diffusion models for the style interpolation task.

Via

Access Paper or Ask Questions

Disease Severity Regression with Continuous Data Augmentation

Feb 24, 2023

Shumpei Takezaki, Kiyohito Tanaka, Seiichi Uchida, Takeaki Kadota

Figure 1 for Disease Severity Regression with Continuous Data Augmentation

Figure 2 for Disease Severity Regression with Continuous Data Augmentation

Figure 3 for Disease Severity Regression with Continuous Data Augmentation

Figure 4 for Disease Severity Regression with Continuous Data Augmentation

Abstract:Disease severity regression by a convolutional neural network (CNN) for medical images requires a sufficient number of image samples labeled with severity levels. Conditional generative adversarial network (cGAN)-based data augmentation (DA) is a possible solution, but it encounters two issues. The first issue is that existing cGANs cannot deal with real-valued severity levels as their conditions, and the second is that the severity of the generated images is not fully reliable. We propose continuous DA as a solution to the two issues. Our method uses continuous severity GAN to generate images at real-valued severity levels and dataset-disjoint multi-objective optimization to deal with the second issue. Our method was evaluated for estimating ulcerative colitis (UC) severity of endoscopic images and achieved higher classification performance than conventional DA methods.

* Accepted at ISBI2023

Via

Access Paper or Ask Questions