Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Namil Kim

Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation

May 11, 2025

Seokjun Kwon, Jeongmin Shin, Namil Kim, Soonmin Hwang, Yukyung Choi

Abstract:In autonomous driving, thermal image semantic segmentation has emerged as a critical research area, owing to its ability to provide robust scene understanding under adverse visual conditions. In particular, unsupervised domain adaptation (UDA) for thermal image segmentation can be an efficient solution to address the lack of labeled thermal datasets. Nevertheless, since these methods do not effectively utilize the complementary information between RGB and thermal images, they significantly decrease performance during domain adaptation. In this paper, we present a comprehensive study on cross-spectral UDA for thermal image semantic segmentation. We first propose a novel masked mutual learning strategy that promotes complementary information exchange by selectively transferring results between each spectral model while masking out uncertain regions. Additionally, we introduce a novel prototypical self-supervised loss designed to enhance the performance of the thermal segmentation model in nighttime scenarios. This approach addresses the limitations of RGB pre-trained networks, which cannot effectively transfer knowledge under low illumination due to the inherent constraints of RGB sensors. In experiments, our method achieves higher performance over previous UDA methods and comparable performance to state-of-the-art supervised methods.

* 7 pages, 4 figures, International Conference on Robotics and Automation(ICRA) 2025

Via

Access Paper or Ask Questions

Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

Mar 20, 2024

Mincheol Chang, Siyeong Lee, Jinkyu Kim, Namil Kim

Figure 1 for Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

Figure 2 for Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

Figure 3 for Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

Figure 4 for Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem

Abstract:Typical LiDAR-based 3D object detection models are trained in a supervised manner with real-world data collection, which is often imbalanced over classes (or long-tailed). To deal with it, augmenting minority-class examples by sampling ground truth (GT) LiDAR points from a database and pasting them into a scene of interest is often used, but challenges still remain: inflexibility in locating GT samples and limited sample diversity. In this work, we propose to leverage pseudo-LiDAR point clouds generated (at a low cost) from videos capturing a surround view of miniatures or real-world objects of minor classes. Our method, called Pseudo Ground Truth Augmentation (PGT-Aug), consists of three main steps: (i) volumetric 3D instance reconstruction using a 2D-to-3D view synthesis model, (ii) object-level domain alignment with LiDAR intensity estimation and (iii) a hybrid context-aware placement method from ground and map information. We demonstrate the superiority and generality of our method through performance improvements in extensive experiments conducted on three popular benchmarks, i.e., nuScenes, KITTI, and Lyft, especially for the datasets with large domain gaps captured by different LiDAR configurations. Our code and data will be publicly available upon publication.

* 28 pages, 12 figures, 11 tables

Via

Access Paper or Ask Questions

PANDAS: Prototype-based Novel Class Discovery and Detection

Feb 27, 2024

Tyler L. Hayes, César R. de Souza, Namil Kim, Jiwon Kim, Riccardo Volpi, Diane Larlus

Figure 1 for PANDAS: Prototype-based Novel Class Discovery and Detection

Figure 2 for PANDAS: Prototype-based Novel Class Discovery and Detection

Figure 3 for PANDAS: Prototype-based Novel Class Discovery and Detection

Figure 4 for PANDAS: Prototype-based Novel Class Discovery and Detection

Abstract:Object detectors are typically trained once and for all on a fixed set of classes. However, this closed-world assumption is unrealistic in practice, as new classes will inevitably emerge after the detector is deployed in the wild. In this work, we look at ways to extend a detector trained for a set of base classes so it can i) spot the presence of novel classes, and ii) automatically enrich its repertoire to be able to detect those newly discovered classes together with the base ones. We propose PANDAS, a method for novel class discovery and detection. It discovers clusters representing novel classes from unlabeled data, and represents old and new classes with prototypes. During inference, a distance-based classifier uses these prototypes to assign a label to each detected object instance. The simplicity of our method makes it widely applicable. We experimentally demonstrate the effectiveness of PANDAS on the VOC 2012 and COCO-to-LVIS benchmarks. It performs favorably against the state of the art for this task while being computationally more affordable.

Via

Access Paper or Ask Questions

Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation

Oct 12, 2019

Seungmin Lee, Dongwan Kim, Namil Kim, Seong-Gyun Jeong

Figure 1 for Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation

Figure 2 for Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation

Figure 3 for Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation

Figure 4 for Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation

Abstract:Recent works on domain adaptation exploit adversarial training to obtain domain-invariant feature representations from the joint learning of feature extractor and domain discriminator networks. However, domain adversarial methods render suboptimal performances since they attempt to match the distributions among the domains without considering the task at hand. We propose Drop to Adapt (DTA), which leverages adversarial dropout to learn strongly discriminative features by enforcing the cluster assumption. Accordingly, we design objective functions to support robust domain adaptation. We demonstrate efficacy of the proposed method on various experiments and achieve consistent improvements in both image classification and semantic segmentation tasks. Our source code is available at https://github.com/postBG/DTA.pytorch.

* ICCV 2019

Via

Access Paper or Ask Questions

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

Oct 17, 2017

Seokju Lee, Junsik Kim, Jae Shin Yoon, Seunghak Shin, Oleksandr Bailo, Namil Kim, Tae-Hee Lee, Hyun Seok Hong, Seung-Hoon Han, In So Kweon

Figure 1 for VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

Figure 2 for VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

Figure 3 for VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

Figure 4 for VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

Abstract:In this paper, we propose a unified end-to-end trainable multi-task network that jointly handles lane and road marking detection and recognition that is guided by a vanishing point under adverse weather conditions. We tackle rainy and low illumination conditions, which have not been extensively studied until now due to clear challenges. For example, images taken under rainy days are subject to low illumination, while wet roads cause light reflection and distort the appearance of lane and road markings. At night, color distortion occurs under limited illumination. As a result, no benchmark dataset exists and only a few developed algorithms work under poor weather conditions. To address this shortcoming, we build up a lane and road marking benchmark which consists of about 20,000 images with 17 lane and road marking classes under four different scenarios: no rain, rain, heavy rain, and night. We train and evaluate several versions of the proposed multi-task network and validate the importance of each task. The resulting approach, VPGNet, can detect and classify lanes and road markings, and predict a vanishing point with a single forward pass. Experimental results show that our approach achieves high accuracy and robustness under various conditions in real-time (20 fps). The benchmark and the VPGNet model will be publicly available.

* To appear on ICCV 2017

Via

Access Paper or Ask Questions

Pixel-Level Domain Transfer

Nov 28, 2016

Donggeun Yoo, Namil Kim, Sunggyun Park, Anthony S. Paek, In So Kweon

Figure 1 for Pixel-Level Domain Transfer

Figure 2 for Pixel-Level Domain Transfer

Figure 3 for Pixel-Level Domain Transfer

Figure 4 for Pixel-Level Domain Transfer

Abstract:We present an image-conditional image generation model. The model transfers an input domain to a target domain in semantic level, and generates the target image in pixel level. To generate realistic target images, we employ the real/fake-discriminator as in Generative Adversarial Nets, but also introduce a novel domain-discriminator to make the generated image relevant to the input image. We verify our model through a challenging task of generating a piece of clothing from an input image of a dressed person. We present a high quality clothing dataset containing the two domains, and succeed in demonstrating decent results.

* Published in ECCV 2016. Code and dataset available at dgyoo.github.io

Via

Access Paper or Ask Questions

Fine-scale Surface Normal Estimation using a Single NIR Image

Mar 24, 2016

Youngjin Yoon, Gyeongmin Choe, Namil Kim, Joon-Young Lee, In So Kweon

Figure 1 for Fine-scale Surface Normal Estimation using a Single NIR Image

Figure 2 for Fine-scale Surface Normal Estimation using a Single NIR Image

Figure 3 for Fine-scale Surface Normal Estimation using a Single NIR Image

Figure 4 for Fine-scale Surface Normal Estimation using a Single NIR Image

Abstract:We present surface normal estimation using a single near infrared (NIR) image. We are focusing on fine-scale surface geometry captured with an uncalibrated light source. To tackle this ill-posed problem, we adopt a generative adversarial network which is effective in recovering a sharp output, which is also essential for fine-scale surface normal estimation. We incorporate angular error and integrability constraint into the objective function of the network to make estimated normals physically meaningful. We train and validate our network on a recent NIR dataset, and also evaluate the generality of our trained model by using new external datasets which are captured with a different camera under different environment.

Via

Access Paper or Ask Questions