Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hyunju Lee

Artificial Intelligence Graduate School of Gwangju Institute of Science and Technology, School of Electrical Engineering and Computer Science of Gwangju Institute of Science and Technology

Quantum Supremacy in Tomographic Imaging: Advances in Quantum Tomography Algorithms

Feb 07, 2025

Hyunju Lee, Kyungtaek Jun

Abstract:Quantum computing has emerged as a transformative paradigm, capable of tackling complex computational problems that are infeasible for classical methods within a practical timeframe. At the core of this advancement lies the concept of quantum supremacy, which signifies the ability of quantum processors to surpass classical systems in specific tasks. In the context of tomographic image reconstruction, quantum optimization algorithms enable faster processing and clearer imaging than conventional methods. This study further substantiates quantum supremacy by reducing the required projection angles for tomographic reconstruction while enhancing robustness against image artifacts. Notably, our experiments demonstrated that the proposed algorithm accurately reconstructed tomographic images without artifacts, even when up to 50% error was introduced into the sinogram to induce ring artifacts. Furthermore, it achieved precise reconstructions using only 50% of the projection angles from the original sinogram spanning 0{\deg} to 180{\deg}. These findings highlight the potential of quantum algorithms to revolutionize tomographic imaging by enabling efficient and accurate reconstructions under challenging conditions, paving the way for broader applications in medical imaging, material science, and advanced tomography systems as quantum computing technologies continue to advance.

* 14 pages, 8 figures, 1 table

Via

Access Paper or Ask Questions

Efficient Few-Shot Neural Architecture Search by Counting the Number of Nonlinear Functions

Dec 19, 2024

Youngmin Oh, Hyunju Lee, Bumsub Ham

Abstract:Neural architecture search (NAS) enables finding the best-performing architecture from a search space automatically. Most NAS methods exploit an over-parameterized network (i.e., a supernet) containing all possible architectures (i.e., subnets) in the search space. However, the subnets that share the same set of parameters are likely to have different characteristics, interfering with each other during training. To address this, few-shot NAS methods have been proposed that divide the space into a few subspaces and employ a separate supernet for each subspace to limit the extent of weight sharing. They achieve state-of-the-art performance, but the computational cost increases accordingly. We introduce in this paper a novel few-shot NAS method that exploits the number of nonlinear functions to split the search space. To be specific, our method divides the space such that each subspace consists of subnets with the same number of nonlinear functions. Our splitting criterion is efficient, since it does not require comparing gradients of a supernet to split the space. In addition, we have found that dividing the space allows us to reduce the channel dimensions required for each supernet, which enables training multiple supernets in an efficient manner. We also introduce a supernet-balanced sampling (SBS) technique, sampling several subnets at each training step, to train different supernets evenly within a limited number of training steps. Extensive experiments on standard NAS benchmarks demonstrate the effectiveness of our approach. Our code is available at https://cvlab.yonsei.ac.kr/projects/EFS-NAS.

* Accepted to AAAI 2025

Via

Access Paper or Ask Questions

Illustrious: an Open Advanced Illustration Model

Sep 30, 2024

Sang Hyun Park, Jun Young Koh, Junha Lee, Joy Song, Dongha Kim, Hoyeon Moon, Hyunju Lee, Min Song

Figure 1 for Illustrious: an Open Advanced Illustration Model

Figure 2 for Illustrious: an Open Advanced Illustration Model

Figure 3 for Illustrious: an Open Advanced Illustration Model

Figure 4 for Illustrious: an Open Advanced Illustration Model

Abstract:In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three critical approaches for model improvement. First, we delve into the significance of the batch size and dropout control, which enables faster learning of controllable token based concept activations. Second, we increase the training resolution of images, affecting the accurate depiction of character anatomy in much higher resolution, extending its generation capability over 20MP with proper methods. Finally, we propose the refined multi-level captions, covering all tags and various natural language captions as a critical factor for model development. Through extensive analysis and experiments, Illustrious demonstrates state-of-the-art performance in terms of animation style, outperforming widely-used models in illustration domains, propelling easier customization and personalization with nature of open source. We plan to publicly release updated Illustrious model series sequentially as well as sustainable plans for improvements.

Via

Access Paper or Ask Questions

Deep Metric Loss for Multimodal Learning

Aug 21, 2023

Sehwan Moon, Hyunju Lee

Abstract:Multimodal learning often outperforms its unimodal counterparts by exploiting unimodal contributions and cross-modal interactions. However, focusing only on integrating multimodal features into a unified comprehensive representation overlooks the unimodal characteristics. In real data, the contributions of modalities can vary from instance to instance, and they often reinforce or conflict with each other. In this study, we introduce a novel \text{MultiModal} loss paradigm for multimodal learning, which subgroups instances according to their unimodal contributions. \text{MultiModal} loss can prevent inefficient learning caused by overfitting and efficiently optimize multimodal models. On synthetic data, \text{MultiModal} loss demonstrates improved classification performance by subgrouping difficult instances within certain modalities. On four real multimodal datasets, our loss is empirically shown to improve the performance of recent models. Ablation studies verify the effectiveness of our loss. Additionally, we show that our loss generates a reliable prediction score for each modality, which is essential for subgrouping. Our \text{MultiModal} loss is a novel loss function to subgroup instances according to the contribution of modalities in multimodal learning and is applicable to a variety of multimodal models with unimodal decisions. Our code is available at https://github.com/SehwanMoon/MultiModalLoss.

* 18 pages, 9 figures

Via

Access Paper or Ask Questions

PINNet: a deep neural network with pathway prior knowledge for Alzheimer's disease

Nov 27, 2022

Yeojin Kim, Hyunju Lee

Figure 1 for PINNet: a deep neural network with pathway prior knowledge for Alzheimer's disease

Figure 2 for PINNet: a deep neural network with pathway prior knowledge for Alzheimer's disease

Figure 3 for PINNet: a deep neural network with pathway prior knowledge for Alzheimer's disease

Figure 4 for PINNet: a deep neural network with pathway prior knowledge for Alzheimer's disease

Abstract:Identification of Alzheimer's Disease (AD)-related transcriptomic signatures from blood is important for early diagnosis of the disease. Deep learning techniques are potent classifiers for AD diagnosis, but most have been unable to identify biomarkers because of their lack of interpretability. To address these challenges, we propose a pathway information-based neural network (PINNet) to predict AD patients and analyze blood and brain transcriptomic signatures using an interpretable deep learning model. PINNet is a deep neural network (DNN) model with pathway prior knowledge from either the Gene Ontology or Kyoto Encyclopedia of Genes and Genomes databases. Then, a backpropagation-based model interpretation method was applied to reveal essential pathways and genes for predicting AD. We compared the performance of PINNet with a DNN model without a pathway. Performances of PINNet outperformed or were similar to those of DNN without a pathway using blood and brain gene expressions, respectively. Moreover, PINNet considers more AD-related genes as essential features than DNN without a pathway in the learning process. Pathway analysis of protein-protein interaction modules of highly contributed genes showed that AD-related genes in blood were enriched with cell migration, PI3K-Akt, MAPK signaling, and apoptosis in blood. The pathways enriched in the brain module included cell migration, PI3K-Akt, MAPK signaling, apoptosis, protein ubiquitination, and t-cell activation. Collectively, with prior knowledge about pathways, PINNet reveals essential pathways related to AD.

Via

Access Paper or Ask Questions

SDGCCA: Supervised Deep Generalized Canonical Correlation Analysis for Multi-omics Integration

Apr 17, 2022

Jeongyoung Hwang, Sehwan Moon, Hyunju Lee

Figure 1 for SDGCCA: Supervised Deep Generalized Canonical Correlation Analysis for Multi-omics Integration

Figure 2 for SDGCCA: Supervised Deep Generalized Canonical Correlation Analysis for Multi-omics Integration

Figure 3 for SDGCCA: Supervised Deep Generalized Canonical Correlation Analysis for Multi-omics Integration

Figure 4 for SDGCCA: Supervised Deep Generalized Canonical Correlation Analysis for Multi-omics Integration

Abstract:Integration of multi-omics data provides opportunities for revealing biological mechanisms related to certain phenotypes. We propose a novel method of multi-omics integration called supervised deep generalized canonical correlation analysis (SDGCCA) for modeling correlation structures between nonlinear multi-omics manifolds, aiming for improving classification of phenotypes and revealing biomarkers related to phenotypes. SDGCCA addresses the limitations of other canonical correlation analysis (CCA)-based models (e.g., deep CCA, deep generalized CCA) by considering complex/nonlinear cross-data correlations and discriminating phenotype groups. Although there are a few methods for nonlinear CCA projections for discriminant purposes of phenotypes, they only consider two views. On the other hand, SDGCCA is the nonlinear multiview CCA projection method for discrimination. When we applied SDGCCA to prediction of patients of Alzheimer's disease (AD) and discrimination of early- and late-stage cancers, it outperformed other CCA-based methods and other supervised methods. In addition, we demonstrate that SDGCCA can be used for feature selection to identify important multi-omics biomarkers. In the application on AD data, SDGCCA identified clusters of genes in multi-omics data, which are well known to be associated with AD.

* 10 pages, 5 figures

Via

Access Paper or Ask Questions

A molecular generative model with genetic algorithm and tree search for cancer samples

Dec 16, 2021

Sejin Park, Hyunju Lee

Figure 1 for A molecular generative model with genetic algorithm and tree search for cancer samples

Figure 2 for A molecular generative model with genetic algorithm and tree search for cancer samples

Figure 3 for A molecular generative model with genetic algorithm and tree search for cancer samples

Figure 4 for A molecular generative model with genetic algorithm and tree search for cancer samples

Abstract:Personalized medicine is expected to maximize the intended drug effects and minimize side effects by treating patients based on their genetic profiles. Thus, it is important to generate drugs based on the genetic profiles of diseases, especially in anticancer drug discovery. However, this is challenging because the vast chemical space and variations in cancer properties require a huge time resource to search for proper molecules. Therefore, an efficient and fast search method considering genetic profiles is required for de novo molecular design of anticancer drugs. Here, we propose a faster molecular generative model with genetic algorithm and tree search for cancer samples (FasterGTS). FasterGTS is constructed with a genetic algorithm and a Monte Carlo tree search with three deep neural networks: supervised learning, self-trained, and value networks, and it generates anticancer molecules based on the genetic profiles of a cancer sample. When compared to other methods, FasterGTS generated cancer sample-specific molecules with general chemical properties required for cancer drugs within the limited numbers of samplings. We expect that FasterGTS contributes to the anticancer drug generation.

Via

Access Paper or Ask Questions

Noise-resistant reconstruction algorithm based on the sinogram pattern

Nov 19, 2021

Byung Chun Kim, Hyunju Lee, Kyungtaek Jun

Figure 1 for Noise-resistant reconstruction algorithm based on the sinogram pattern

Figure 2 for Noise-resistant reconstruction algorithm based on the sinogram pattern

Abstract:We introduce a new CT image reconstruction algorithm that is less affected by various artifacts. The new reconstruction algorithm is a method of minimizing the difference between synchrotron X-ray tomography data and sinograms generated using Radon transform of CT images. The CT image is iteratively updated to reduce the difference from the sinogram of the data. This method can obtain clean CT images from the projection data, which can create ring artifacts or metal artifacts. Also, even if the sample size is larger than the CCD and/or the projection image does not satisfy the Beer-Lambert law, a clean CT image can be reconstructed. Our new reconstruction algorithm can also be applied to fan beam CT or cone beam CT

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters

Nov 12, 2021

HongSeok Choi, Hyunju Lee

Figure 1 for Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters

Figure 2 for Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters

Figure 3 for Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters

Figure 4 for Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters

Abstract:In low resource settings, deep neural models have often shown lower performance due to overfitting. The primary method to solve the overfitting problem is to generalize model parameters. To this end, many researchers have depended on large external resources with various manipulation techniques. In this study, we discuss how to exploit all available samples in low resource settings, without external datasets and model manipulation. This study focuses on natural language processing task. We propose a simple algorithm to find out good initialization parameters that improve robustness to a small sample set. We apply early stopping techniques that enable the use of all samples for training. Finally, the proposed learning strategy is to train all samples with the good initialization parameters and stop the model with the early stopping techniques. Extensive experiments are conducted on seven public sentence classification datasets, and the results demonstrate that the proposed learning strategy achieves better performance than several state-of-the-art works across the seven datasets.

* 28 pages, 5 figures, submitted to Neurocomputing (in review)

Via

Access Paper or Ask Questions

Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

Nov 04, 2020

Dongha Choi, Hyunju Lee

Figure 1 for Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

Figure 2 for Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

Figure 3 for Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

Figure 4 for Extracting Chemical-Protein Interactions via Calibrated Deep Neural Network and Self-training

Abstract:The extraction of interactions between chemicals and proteins from several biomedical articles is important in many fields of biomedical research such as drug development and prediction of drug side effects. Several natural language processing methods, including deep neural network (DNN) models, have been applied to address this problem. However, these methods were trained with hard-labeled data, which tend to become over-confident, leading to degradation of the model reliability. To estimate the data uncertainty and improve the reliability, "calibration" techniques have been applied to deep learning models. In this study, to extract chemical--protein interactions, we propose a DNN-based approach incorporating uncertainty information and calibration techniques. Our model first encodes the input sequence using a pre-trained language-understanding model, following which it is trained using two calibration methods: mixup training and addition of a confidence penalty loss. Finally, the model is re-trained with augmented data that are extracted using the estimated uncertainties. Our approach has achieved state-of-the-art performance with regard to the Biocreative VI ChemProt task, while preserving higher calibration abilities than those of previous approaches. Furthermore, our approach also presents the possibilities of using uncertainty estimation for performance improvement.

* 10 pages, 4 figures, accepted for the Findings of EMNLP

Via

Access Paper or Ask Questions