Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sunwoo Kim

On the Effect of Uncertainty on Layer-wise Inference Dynamics

Jul 09, 2025

Sunwoo Kim, Haneul Yoo, Alice Oh

Abstract:Understanding how large language models (LLMs) internally represent and process their predictions is central to detecting uncertainty and preventing hallucinations. While several studies have shown that models encode uncertainty in their hidden states, it is underexplored how this affects the way they process such hidden states. In this work, we demonstrate that the dynamics of output token probabilities across layers for certain and uncertain outputs are largely aligned, revealing that uncertainty does not seem to affect inference dynamics. Specifically, we use the Tuned Lens, a variant of the Logit Lens, to analyze the layer-wise probability trajectories of final prediction tokens across 11 datasets and 5 models. Using incorrect predictions as those with higher epistemic uncertainty, our results show aligned trajectories for certain and uncertain predictions that both observe abrupt increases in confidence at similar layers. We balance this finding by showing evidence that more competent models may learn to process uncertainty differently. Our findings challenge the feasibility of leveraging simplistic methods for detecting uncertainty at inference. More broadly, our work demonstrates how interpretability methods may be used to investigate the way uncertainty affects inference.

* Accepted to Actionable Interpretability Workshop - ICML 2025

Via

Access Paper or Ask Questions

'Hello, World!': Making GNNs Talk with LLMs

May 27, 2025

Sunwoo Kim, Soo Yong Lee, Jaemin Yoo, Kijung Shin

Abstract:While graph neural networks (GNNs) have shown remarkable performance across diverse graph-related tasks, their high-dimensional hidden representations render them black boxes. In this work, we propose Graph Lingual Network (GLN), a GNN built on large language models (LLMs), with hidden representations in the form of human-readable text. Through careful prompt design, GLN incorporates not only the message passing module of GNNs but also advanced GNN techniques, including graph attention and initial residual connection. The comprehensibility of GLN's hidden representations enables an intuitive analysis of how node representations change (1) across layers and (2) under advanced GNN techniques, shedding light on the inner workings of GNNs. Furthermore, we demonstrate that GLN achieves strong zero-shot performance on node classification and link prediction, outperforming existing LLM-based baseline methods.

* Code and datasets are in https://github.com/kswoo97/GLN-Code

Via

Access Paper or Ask Questions

A Domain-Agnostic Scalable AI Safety Ensuring Framework

Apr 30, 2025

Beomjun Kim, Kangyeon Kim, Sunwoo Kim, Heejin Ahn

Abstract:Ensuring the safety of AI systems has recently emerged as a critical priority for real-world deployment, particularly in physical AI applications. Current approaches to AI safety typically address predefined domain-specific safety conditions, limiting their ability to generalize across contexts. We propose a novel AI safety framework that ensures AI systems comply with any user-defined constraint, with any desired probability, and across various domains. In this framework, we combine an AI component (e.g., neural network) with an optimization problem to produce responses that minimize objectives while satisfying user-defined constraints with probabilities exceeding user-defined thresholds. For credibility assessment of the AI component, we propose internal test data, a supplementary set of safety-labeled data, and a conservative testing methodology that provides statistical validity of using internal test data. We also present an approximation method of a loss function and how to compute its gradient for training. We mathematically prove that probabilistic constraint satisfaction is guaranteed under specific, mild conditions and prove a scaling law between safety and the number of internal test data. We demonstrate our framework's effectiveness through experiments in diverse domains: demand prediction for production decision, safe reinforcement learning within the SafetyGym simulator, and guarding AI chatbot outputs. Through these experiments, we demonstrate that our method guarantees safety for user-specified constraints, outperforms for up to several order of magnitudes existing methods in low safety threshold regions, and scales effectively with respect to the size of internal test data.

* Theoretical supplementary material (Part 1) is available in submitted files. Experimental supplementary material (Part 2) will be available before May 22 23:59PM AOE

Via

Access Paper or Ask Questions

Multi-Behavior Recommender Systems: A Survey

Mar 10, 2025

Kyungho Kim, Sunwoo Kim, Geon Lee, Jinhong Jung, Kijung Shin

Abstract:Traditional recommender systems primarily rely on a single type of user-item interaction, such as item purchases or ratings, to predict user preferences. However, in real-world scenarios, users engage in a variety of behaviors, such as clicking on items or adding them to carts, offering richer insights into their interests. Multi-behavior recommender systems leverage these diverse interactions to enhance recommendation quality, and research on this topic has grown rapidly in recent years. This survey provides a timely review of multi-behavior recommender systems, focusing on three key steps: (1) Data Modeling: representing multi-behaviors at the input level, (2) Encoding: transforming these inputs into vector representations (i.e., embeddings), and (3) Training: optimizing machine-learning models. We systematically categorize existing multi-behavior recommender systems based on the commonalities and differences in their approaches across the above steps. Additionally, we discuss promising future directions for advancing multi-behavior recommender systems.

* Accepted in the PAKDD 2025 Survey Track

Via

Access Paper or Ask Questions

Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Jan 10, 2025

Sunwoo Kim, Minkyu Kim, Dongmin Park

Figure 1 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Figure 2 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Figure 3 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Figure 4 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Abstract:Diffusion models excel in generative tasks, but aligning them with specific objectives while maintaining their versatility remains challenging. Existing fine-tuning methods often suffer from reward over-optimization, while approximate guidance approaches fail to optimize target rewards effectively. Addressing these limitations, we propose a training-free sampling method based on Sequential Monte Carlo (SMC) to sample from the reward-aligned target distribution. Our approach, tailored for diffusion sampling and incorporating tempering techniques, achieves comparable or superior target rewards to fine-tuning methods while preserving diversity and cross-reward generalization. We demonstrate its effectiveness in single-reward optimization, multi-objective scenarios, and online black-box optimization. This work offers a robust solution for aligning diffusion models with diverse downstream objectives without compromising their general capabilities. Code is available at https://github.com/krafton-ai/DAS .

Via

Access Paper or Ask Questions

Quantum-MUSIC: Multiple Signal Classification for Quantum Wireless Sensing

Dec 31, 2024

Hanvit Kim, Hyunwoo Park, Sunwoo Kim

Figure 1 for Quantum-MUSIC: Multiple Signal Classification for Quantum Wireless Sensing

Figure 2 for Quantum-MUSIC: Multiple Signal Classification for Quantum Wireless Sensing

Figure 3 for Quantum-MUSIC: Multiple Signal Classification for Quantum Wireless Sensing

Figure 4 for Quantum-MUSIC: Multiple Signal Classification for Quantum Wireless Sensing

Abstract:This paper proposes a Quantum-MUSIC, the first multiple signal classification (MUSIC) algorithm for quantum wireless sensing of multi-user. Since an atomic receiver for quantum wireless sensing can only measure the magnitude of a received signal, sensing performance degradation of traditional antenna-based signal processing algorithms is inevitable. To overcome this limitation, the proposed algorithm recovers the channel information and incorporates the traditional MUSIC algorithm, enabling the sensing of multi-user with magnitude-only measurement. Simulation results showed that the proposed algorithm outperforms the existing MUSIC algorithm, validating the superior potential of quantum wireless sensing.

* 5 pages, 6 figures

Via

Access Paper or Ask Questions

Rethinking Reconstruction-based Graph-Level Anomaly Detection: Limitations and a Simple Remedy

Oct 27, 2024

Sunwoo Kim, Soo Yong Lee, Fanchen Bu, Shinhwan Kang, Kyungho Kim, Jaemin Yoo, Kijung Shin

Abstract:Graph autoencoders (Graph-AEs) learn representations of given graphs by aiming to accurately reconstruct them. A notable application of Graph-AEs is graph-level anomaly detection (GLAD), whose objective is to identify graphs with anomalous topological structures and/or node features compared to the majority of the graph population. Graph-AEs for GLAD regard a graph with a high mean reconstruction error (i.e. mean of errors from all node pairs and/or nodes) as anomalies. Namely, the methods rest on the assumption that they would better reconstruct graphs with similar characteristics to the majority. We, however, report non-trivial counter-examples, a phenomenon we call reconstruction flip, and highlight the limitations of the existing Graph-AE-based GLAD methods. Specifically, we empirically and theoretically investigate when this assumption holds and when it fails. Through our analyses, we further argue that, while the reconstruction errors for a given graph are effective features for GLAD, leveraging the multifaceted summaries of the reconstruction errors, beyond just mean, can further strengthen the features. Thus, we propose a novel and simple GLAD method, named MUSE. The key innovation of MUSE involves taking multifaceted summaries of reconstruction errors as graph features for GLAD. This surprisingly simple method obtains SOTA performance in GLAD, performing best overall among 14 methods across 10 datasets.

* Published as a conference paper at NeurIPS 2024

Via

Access Paper or Ask Questions

VOMTC: Vision Objects for Millimeter and Terahertz Communications

Sep 14, 2024

Sunwoo Kim, Yongjun Ahn, Daeyoung Park, Byonghyo Shim

Figure 1 for VOMTC: Vision Objects for Millimeter and Terahertz Communications

Figure 2 for VOMTC: Vision Objects for Millimeter and Terahertz Communications

Figure 3 for VOMTC: Vision Objects for Millimeter and Terahertz Communications

Figure 4 for VOMTC: Vision Objects for Millimeter and Terahertz Communications

Abstract:Recent advances in sensing and computer vision (CV) technologies have opened the door for the application of deep learning (DL)-based CV technologies in the realm of 6G wireless communications. For the successful application of this emerging technology, it is crucial to have a qualified vision dataset tailored for wireless applications (e.g., RGB images containing wireless devices such as laptops and cell phones). An aim of this paper is to propose a large-scale vision dataset referred to as Vision Objects for Millimeter and Terahertz Communications (VOMTC). The VOMTC dataset consists of 20,232 pairs of RGB and depth images obtained from a camera attached to the base station (BS), with each pair labeled with three representative object categories (person, cell phone, and laptop) and bounding boxes of the objects. Through experimental studies of the VOMTC datasets, we show that the beamforming technique exploiting the VOMTC-trained object detector outperforms conventional beamforming techniques.

* IEEE Transactions on Cognitive Communications and Networking, 2024

Via

Access Paper or Ask Questions

Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

May 31, 2024

Langzhang Liang, Sunwoo Kim, Kijung Shin, Zenglin Xu, Shirui Pan, Yuan Qi

Figure 1 for Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Figure 2 for Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Figure 3 for Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Figure 4 for Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Abstract:Graph Neural Networks (GNNs) have gained significant attention as a powerful modeling and inference method, especially for homophilic graph-structured data. To empower GNNs in heterophilic graphs, where adjacent nodes exhibit dissimilar labels or features, Signed Message Passing (SMP) has been widely adopted. However, there is a lack of theoretical and empirical analysis regarding the limitations of SMP. In this work, we unveil some potential pitfalls of SMP and their remedies. We first identify two limitations of SMP: undesirable representation update for multi-hop neighbors and vulnerability against oversmoothing issues. To overcome these challenges, we propose a novel message passing function called Multiset to Multiset GNN(M2M-GNN). Our theoretical analyses and extensive experiments demonstrate that M2M-GNN effectively alleviates the aforementioned limitations of SMP, yielding superior performance in comparison

* Published as a conference paper at ICML 2024

Via

Access Paper or Ask Questions

Near-Field Localization with RIS via Two-Dimensional Signal Path Classification

May 29, 2024

Jeongwan Kang, Seung-Woo Ko, Sunwoo Kim

Abstract:In this paper, we propose two-dimensional signal path classification (2D-SPC) for reconfigurable intelligent surface (RIS)-assisted near-field (NF) localization. In the NF regime, multiple RIS-driven signal paths (SPs) can contribute to precise localization if these are decomposable and the reflected locations on the RIS are known, referred to as SP decomposition (SPD) and SP labeling (SPL), respectively. To this end, each RIS element modulates the incoming SP's phase by shifting it by one of the values in the phase shift profile (PSP) lists satisfying resolution requirements. By interworking with a conventional orthogonal frequency division multiplexing (OFDM) waveform, the user equipment can construct a 2D spectrum map that couples each SPs time of arrival (ToA) and PSP. Then, we design SPL by mapping SPs with the corresponding reflected RIS elements when they share the same PSP. Given two unlabeled SPs, we derive a geometric discriminant from checking whether the current label is correct. It can be extended to more than three SPs by sorting them using pairwise geometric discriminants between adjacent ones. From simulation results, it has been demonstrated that the proposed 2D SPC achieves consistent localization accuracy even if insufficient PSPs are given.

* 15pages, 12figures, Submitted to IEEE Transactions on Wireless Communications

Via

Access Paper or Ask Questions