Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joon Hee Choi

COPAL: Continual Pruning in Large Language Generative Models

May 02, 2024

Srikanth Malla, Joon Hee Choi, Chiho Choi

Abstract:Adapting pre-trained large language models to different domains in natural language processing requires two key considerations: high computational demands and model's inability to continual adaptation. To simultaneously address both issues, this paper presents COPAL (COntinual Pruning in Adaptive Language settings), an algorithm developed for pruning large language generative models under a continual model adaptation setting. While avoiding resource-heavy finetuning or retraining, our pruning process is guided by the proposed sensitivity analysis. The sensitivity effectively measures model's ability to withstand perturbations introduced by the new dataset and finds model's weights that are relevant for all encountered datasets. As a result, COPAL allows seamless model adaptation to new domains while enhancing the resource efficiency. Our empirical evaluation on a various size of LLMs show that COPAL outperforms baseline models, demonstrating its efficacy in efficiency and adaptability.

* Accepted to ICML2024

Via

Access Paper or Ask Questions

DRAMA: Joint Risk Localization and Captioning in Driving

Oct 05, 2022

Srikanth Malla, Chiho Choi, Isht Dwivedi, Joon Hee Choi, Jiachen Li

Figure 1 for DRAMA: Joint Risk Localization and Captioning in Driving

Figure 2 for DRAMA: Joint Risk Localization and Captioning in Driving

Figure 3 for DRAMA: Joint Risk Localization and Captioning in Driving

Figure 4 for DRAMA: Joint Risk Localization and Captioning in Driving

Abstract:Considering the functionality of situational awareness in safety-critical automation systems, the perception of risk in driving scenes and its explainability is of particular importance for autonomous and cooperative driving. Toward this goal, this paper proposes a new research direction of joint risk localization in driving scenes and its risk explanation as a natural language description. Due to the lack of standard benchmarks, we collected a large-scale dataset, DRAMA (Driving Risk Assessment Mechanism with A captioning module), which consists of 17,785 interactive driving scenarios collected in Tokyo, Japan. Our DRAMA dataset accommodates video- and object-level questions on driving risks with associated important objects to achieve the goal of visual captioning as a free-form language description utilizing closed and open-ended responses for multi-level questions, which can be used to evaluate a range of visual captioning capabilities in driving scenarios. We make this data available to the community for further research. Using DRAMA, we explore multiple facets of joint risk localization and captioning in interactive driving scenarios. In particular, we benchmark various multi-task prediction architectures and provide a detailed analysis of joint risk localization and risk captioning. The data set is available at https://usa.honda-ri.com/drama

* WACV 2023 (Winter Conference on Applications of Computer Vision)

Via

Access Paper or Ask Questions

Shared Cross-Modal Trajectory Prediction for Autonomous Driving

Nov 15, 2020

Chiho Choi, Joon Hee Choi, Jiachen Li, Srikanth Malla

Figure 1 for Shared Cross-Modal Trajectory Prediction for Autonomous Driving

Figure 2 for Shared Cross-Modal Trajectory Prediction for Autonomous Driving

Figure 3 for Shared Cross-Modal Trajectory Prediction for Autonomous Driving

Figure 4 for Shared Cross-Modal Trajectory Prediction for Autonomous Driving

Abstract:Predicting future trajectories of traffic agents in highly interactive environments is an essential and challenging problem for the safe operation of autonomous driving systems. On the basis of the fact that self-driving vehicles are equipped with various types of sensors (e.g., LiDAR scanner, RGB camera, radar, etc.), we propose a Cross-Modal Embedding framework that aims to benefit from the use of multiple input modalities. At training time, our model learns to embed a set of complementary features in a shared latent space by jointly optimizing the objective functions across different types of input data. At test time, a single input modality (e.g., LiDAR data) is required to generate predictions from the input perspective (i.e., in the LiDAR space), while taking advantages from the model trained with multiple sensor modalities. An extensive evaluation is conducted to show the efficacy of the proposed framework using two benchmark driving datasets.

* arXiv admin note: substantial text overlap with arXiv:2004.00202

Via

Access Paper or Ask Questions

Optimal Combination of Image Denoisers

May 30, 2018

Joon Hee Choi, Omar Elgendy, Stanley H. Chan

Figure 1 for Optimal Combination of Image Denoisers

Figure 2 for Optimal Combination of Image Denoisers

Figure 3 for Optimal Combination of Image Denoisers

Figure 4 for Optimal Combination of Image Denoisers

Abstract:Given a set of image denoisers, each having a different denoising capability, is there a provably optimal way of combining these denoisers to produce an overall better result? An answer to this question is fundamental to designing ensembles of weak estimators for complex scenes. In this paper, we present an optimal procedure leveraging deep neural networks and convex optimization. The proposed framework, called the Consensus Neural Network (CsNet), introduces three new concepts in image denoising: (1) A deep neural network to estimate the mean squared error (MSE) of denoised images without needing the ground truths; (2) A provably optimal procedure to combine the denoised outputs via convex optimization; (3) An image boosting procedure using a deep neural network to improve contrast and to recover lost details of the combined images. Experimental results show that CsNet can consistently improve denoising performance for both deterministic and neural network denoisers.

Via

Access Paper or Ask Questions

DFacTo: Distributed Factorization of Tensors

Jun 17, 2014

Joon Hee Choi, S. V. N. Vishwanathan

Figure 1 for DFacTo: Distributed Factorization of Tensors

Figure 2 for DFacTo: Distributed Factorization of Tensors

Figure 3 for DFacTo: Distributed Factorization of Tensors

Figure 4 for DFacTo: Distributed Factorization of Tensors

Abstract:We present a technique for significantly speeding up Alternating Least Squares (ALS) and Gradient Descent (GD), two widely used algorithms for tensor factorization. By exploiting properties of the Khatri-Rao product, we show how to efficiently address a computationally challenging sub-step of both algorithms. Our algorithm, DFacTo, only requires two sparse matrix-vector products and is easy to parallelize. DFacTo is not only scalable but also on average 4 to 10 times faster than competing algorithms on a variety of datasets. For instance, DFacTo only takes 480 seconds on 4 machines to perform one iteration of the ALS algorithm and 1,143 seconds to perform one iteration of the GD algorithm on a 6.5 million x 2.5 million x 1.5 million dimensional tensor with 1.2 billion non-zero entries.

* Under review for NIPS 2014

Via

Access Paper or Ask Questions