Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Junghoon Lee

HyperCLOVA X Technical Report

Apr 13, 2024

Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim(+386 more)

Abstract:We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.

* 44 pages; updated authors list and fixed author names

Via

Access Paper or Ask Questions

Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification

Jul 22, 2021

Junghoon Lee, Jounghee Kim, Pilsung Kang

Figure 1 for Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification

Figure 2 for Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification

Figure 3 for Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification

Figure 4 for Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification

Abstract:Language models (LMs) pretrained on a large text corpus and fine-tuned on a downstream text corpus and fine-tuned on a downstream task becomes a de facto training strategy for several natural language processing (NLP) tasks. Recently, an adaptive pretraining method retraining the pretrained language model with task-relevant data has shown significant performance improvements. However, current adaptive pretraining methods suffer from underfitting on the task distribution owing to a relatively small amount of data to re-pretrain the LM. To completely use the concept of adaptive pretraining, we propose a back-translated task-adaptive pretraining (BT-TAPT) method that increases the amount of task-specific data for LM re-pretraining by augmenting the task data using back-translation to generalize the LM to the target task domain. The experimental results show that the proposed BT-TAPT yields improved classification accuracy on both low- and high-resource data and better robustness to noise than the conventional adaptive pretraining method.

Via

Access Paper or Ask Questions

Validating uncertainty in medical image translation

Feb 11, 2020

Jacob C. Reinhold, Yufan He, Shizhong Han, Yunqiang Chen, Dashan Gao, Junghoon Lee, Jerry L. Prince, Aaron Carass

Figure 1 for Validating uncertainty in medical image translation

Figure 2 for Validating uncertainty in medical image translation

Figure 3 for Validating uncertainty in medical image translation

Figure 4 for Validating uncertainty in medical image translation

Abstract:Medical images are increasingly used as input to deep neural networks to produce quantitative values that aid researchers and clinicians. However, standard deep neural networks do not provide a reliable measure of uncertainty in those quantitative values. Recent work has shown that using dropout during training and testing can provide estimates of uncertainty. In this work, we investigate using dropout to estimate epistemic and aleatoric uncertainty in a CT-to-MR image translation task. We show that both types of uncertainty are captured, as defined, providing confidence in the output uncertainty estimates.

* IEEE ISBI 2020

Via

Access Paper or Ask Questions

Finding novelty with uncertainty

Feb 11, 2020

Jacob C. Reinhold, Yufan He, Shizhong Han, Yunqiang Chen, Dashan Gao, Junghoon Lee, Jerry L. Prince, Aaron Carass

Figure 1 for Finding novelty with uncertainty

Figure 2 for Finding novelty with uncertainty

Figure 3 for Finding novelty with uncertainty

Abstract:Medical images are often used to detect and characterize pathology and disease; however, automatically identifying and segmenting pathology in medical images is challenging because the appearance of pathology across diseases varies widely. To address this challenge, we propose a Bayesian deep learning method that learns to translate healthy computed tomography images to magnetic resonance images and simultaneously calculates voxel-wise uncertainty. Since high uncertainty occurs in pathological regions of the image, this uncertainty can be used for unsupervised anomaly segmentation. We show encouraging experimental results on an unsupervised anomaly segmentation task by combining two types of uncertainty into a novel quantity we call scibilic uncertainty.

* SPIE Medical Imaging 2020

Via

Access Paper or Ask Questions

Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

Sep 12, 2018

Heran Yang, Jian Sun, Aaron Carass, Can Zhao, Junghoon Lee, Zongben Xu, Jerry Prince

Figure 1 for Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

Figure 2 for Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

Figure 3 for Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

Figure 4 for Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

Abstract:The cycleGAN is becoming an influential method in medical image synthesis. However, due to a lack of direct constraints between input and synthetic images, the cycleGAN cannot guarantee structural consistency between these two images, and such consistency is of extreme importance in medical imaging. To overcome this, we propose a structure-constrained cycleGAN for brain MR-to-CT synthesis using unpaired data that defines an extra structure-consistency loss based on the modality independent neighborhood descriptor to constrain structural consistency. Additionally, we use a position-based selection strategy for selecting training images instead of a completely random selection scheme. Experimental results on synthesizing CT images from brain MR images demonstrate that our method is better than the conventional cycleGAN and approximates the cycleGAN trained with paired data.

* 8 pages, 5 figures, accepted by MICCAI 2018 Workshop: Deep Learning in Medical Image Analysis (DLMIA)

Via

Access Paper or Ask Questions