Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sungsu Lim

Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation

Apr 18, 2025

SoYoung Park, Hyewon Lee, Mingyu Choi, Seunghoon Han, Jong-Ryul Lee, Sungsu Lim, Tae-Ho Kim

Abstract:Anomaly segmentation is essential for industrial quality, maintenance, and stability. Existing text-guided zero-shot anomaly segmentation models are effective but rely on fixed prompts, limiting adaptability in diverse industrial scenarios. This highlights the need for flexible, context-aware prompting strategies. We propose Image-Aware Prompt Anomaly Segmentation (IAP-AS), which enhances anomaly segmentation by generating dynamic, context-aware prompts using an image tagging model and a large language model (LLM). IAP-AS extracts object attributes from images to generate context-aware prompts, improving adaptability and generalization in dynamic and unstructured industrial environments. In our experiments, IAP-AS improves the F1-max metric by up to 10%, demonstrating superior adaptability and generalization. It provides a scalable solution for anomaly segmentation across industries

* Accepted to PAKDD 2025, 12 pages

Via

Access Paper or Ask Questions

Multi-Hyperbolic Space-based Heterogeneous Graph Attention Network

Nov 18, 2024

Jongmin Park, Seunghoon Han, Jong-Ryul Lee, Sungsu Lim

Abstract:To leverage the complex structures within heterogeneous graphs, recent studies on heterogeneous graph embedding use a hyperbolic space, characterized by a constant negative curvature and exponentially increasing space, which aligns with the structural properties of heterogeneous graphs. However, despite heterogeneous graphs inherently possessing diverse power-law structures, most hyperbolic heterogeneous graph embedding models use a single hyperbolic space for the entire heterogeneous graph, which may not effectively capture the diverse power-law structures within the heterogeneous graph. To address this limitation, we propose Multi-hyperbolic Space-based heterogeneous Graph Attention Network (MSGAT), which uses multiple hyperbolic spaces to effectively capture diverse power-law structures within heterogeneous graphs. We conduct comprehensive experiments to evaluate the effectiveness of MSGAT. The experimental results demonstrate that MSGAT outperforms state-of-the-art baselines in various graph machine learning tasks, effectively capturing the complex structures of heterogeneous graphs.

* Accepted in IEEE ICDM 2024

Via

Access Paper or Ask Questions

ANOMIX: A Simple yet Effective Hard Negative Generation via Mixing for Graph Anomaly Detection

Oct 27, 2024

Hwan Kim, Junghoon Kim, Sungsu Lim

Figure 1 for ANOMIX: A Simple yet Effective Hard Negative Generation via Mixing for Graph Anomaly Detection

Figure 2 for ANOMIX: A Simple yet Effective Hard Negative Generation via Mixing for Graph Anomaly Detection

Figure 3 for ANOMIX: A Simple yet Effective Hard Negative Generation via Mixing for Graph Anomaly Detection

Figure 4 for ANOMIX: A Simple yet Effective Hard Negative Generation via Mixing for Graph Anomaly Detection

Abstract:Graph contrastive learning (GCL) generally requires a large number of samples. The one of the effective ways to reduce the number of samples is using hard negatives (e.g., Mixup). Designing mixing-based approach for GAD can be difficult due to imbalanced data or limited number of anomalies. We propose ANOMIX, a framework that consists of a novel graph mixing approach, ANOMIX-M, and multi-level contrasts for GAD. ANOMIX-M can effectively mix abnormality and normality from input graph to generate hard negatives, which are important for efficient GCL. ANOMIX is (a) A first mixing approach: firstly attempting graph mixing to generate hard negatives for GAD task and node- and subgraph-level contrasts to distinguish underlying anomalies. (b) Accurate: winning the highest AUC, up to 5.49% higher and 1.76% faster. (c) Effective: reducing the number of samples nearly 80% in GCL. Code is available at https://github.com/missinghwan/ANOMIX.

Via

Access Paper or Ask Questions

Hyperbolic Heterogeneous Graph Attention Networks

Apr 15, 2024

Jongmin Park, Seunghoon Han, Soohwan Jeong, Sungsu Lim

Figure 1 for Hyperbolic Heterogeneous Graph Attention Networks

Figure 2 for Hyperbolic Heterogeneous Graph Attention Networks

Figure 3 for Hyperbolic Heterogeneous Graph Attention Networks

Abstract:Most previous heterogeneous graph embedding models represent elements in a heterogeneous graph as vector representations in a low-dimensional Euclidean space. However, because heterogeneous graphs inherently possess complex structures, such as hierarchical or power-law structures, distortions can occur when representing them in Euclidean space. To overcome this limitation, we propose Hyperbolic Heterogeneous Graph Attention Networks (HHGAT) that learn vector representations in hyperbolic spaces with meta-path instances. We conducted experiments on three real-world heterogeneous graph datasets, demonstrating that HHGAT outperforms state-of-the-art heterogeneous graph embedding models in node classification and clustering tasks.

* Accepted in ACM THE WEB CONFERENCE 2024 short paper track

Via

Access Paper or Ask Questions

Deep Semi-supervised Anomaly Detection with Metapath-based Context Knowledge

Aug 21, 2023

Hwan Kim, Junghoon Kim, Byung Suk Lee, Sungsu Lim

Figure 1 for Deep Semi-supervised Anomaly Detection with Metapath-based Context Knowledge

Figure 2 for Deep Semi-supervised Anomaly Detection with Metapath-based Context Knowledge

Figure 3 for Deep Semi-supervised Anomaly Detection with Metapath-based Context Knowledge

Figure 4 for Deep Semi-supervised Anomaly Detection with Metapath-based Context Knowledge

Abstract:Graph anomaly detection has attracted considerable attention in recent years. This paper introduces a novel approach that leverages metapath-based semi-supervised learning, addressing the limitations of previous methods. We present a new framework, Metapath-based Semi-supervised Anomaly Detection (MSAD), incorporating GCN layers in both the encoder and decoder to efficiently propagate context information between abnormal and normal nodes. The design of metapath-based context information and a specifically crafted anomaly community enhance the process of learning differences in structures and attributes, both globally and locally. Through a comprehensive set of experiments conducted on seven real-world networks, this paper demonstrates the superiority of the MSAD method compared to state-of-the-art techniques. The promising results of this study pave the way for future investigations, focusing on the optimization and analysis of metapath patterns to further enhance the effectiveness of anomaly detection on attributed networks.

Via

Access Paper or Ask Questions

A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation

Apr 02, 2023

Bo-Kyeong Kim, Jaemin Kang, Daeun Seo, Hancheol Park, Shinkook Choi, Hyungshin Kim, Sungsu Lim

Abstract:Virtual humans have gained considerable attention in numerous industries, e.g., entertainment and e-commerce. As a core technology, synthesizing photorealistic face frames from target speech and facial identity has been actively studied with generative adversarial networks. Despite remarkable results of modern talking-face generation models, they often entail high computational burdens, which limit their efficient deployment. This study aims to develop a lightweight model for speech-driven talking-face synthesis. We build a compact generator by removing the residual blocks and reducing the channel width from Wav2Lip, a popular talking-face generator. We also present a knowledge distillation scheme to stably yet effectively train the small-capacity generator without adversarial learning. We reduce the number of parameters and MACs by 28$\times$ while retaining the performance of the original model. Moreover, to alleviate a severe performance drop when converting the whole generator to INT8 precision, we adopt a selective quantization method that uses FP16 for the quantization-sensitive layers and INT8 for the other layers. Using this mixed precision, we achieve up to a 19$\times$ speedup on edge GPUs without noticeably compromising the generation quality.

* MLSys Workshop on On-Device Intelligence, 2023

Via

Access Paper or Ask Questions

Graph Anomaly Detection with Graph Neural Networks: Current Status and Challenges

Oct 04, 2022

Hwan Kim, Byung Suk Lee, Won-Yong Shin, Sungsu Lim

Figure 1 for Graph Anomaly Detection with Graph Neural Networks: Current Status and Challenges

Figure 2 for Graph Anomaly Detection with Graph Neural Networks: Current Status and Challenges

Figure 3 for Graph Anomaly Detection with Graph Neural Networks: Current Status and Challenges

Abstract:Graphs are used widely to model complex systems, and detecting anomalies in a graph is an important task in the analysis of complex systems. Graph anomalies are patterns in a graph that do not conform to normal patterns expected of the attributes and/or structures of the graph. In recent years, graph neural networks (GNNs) have been studied extensively and have successfully performed difficult machine learning tasks in node classification, link prediction, and graph classification thanks to the highly expressive capability via message passing in effectively learning graph representations. To solve the graph anomaly detection problem, GNN-based methods leverage information about the graph attributes (or features) and/or structures to learn to score anomalies appropriately. In this survey, we review the recent advances made in detecting graph anomalies using GNN models. Specifically, we summarize GNN-based methods according to the graph type (i.e., static and dynamic), the anomaly type (i.e., node, edge, subgraph, and whole graph), and the network architecture (e.g., graph autoencoder, graph convolutional network). To the best of our knowledge, this survey is the first comprehensive review of graph anomaly detection methods based on GNNs.

* 9 pages, 2 figures, 1 tables; to appear in the IEEE Access (Please cite our journal version.)

Via

Access Paper or Ask Questions

SiReN: Sign-Aware Recommendation Using Graph Neural Networks

Aug 19, 2021

Changwon Seo, Kyeong-Joong Jeong, Sungsu Lim, Won-Yong Shin

Figure 1 for SiReN: Sign-Aware Recommendation Using Graph Neural Networks

Figure 2 for SiReN: Sign-Aware Recommendation Using Graph Neural Networks

Figure 3 for SiReN: Sign-Aware Recommendation Using Graph Neural Networks

Figure 4 for SiReN: Sign-Aware Recommendation Using Graph Neural Networks

Abstract:In recent years, many recommender systems using network embedding (NE) such as graph neural networks (GNNs) have been extensively studied in the sense of improving recommendation accuracy. However, such attempts have focused mostly on utilizing only the information of positive user-item interactions with high ratings. Thus, there is a challenge on how to make use of low rating scores for representing users' preferences since low ratings can be still informative in designing NE-based recommender systems. In this study, we present SiReN, a new sign-aware recommender system based on GNN models. Specifically, SiReN has three key components: 1) constructing a signed bipartite graph for more precisely representing users' preferences, which is split into two edge-disjoint graphs with positive and negative edges each, 2) generating two embeddings for the partitioned graphs with positive and negative edges via a GNN model and a multi-layer perceptron (MLP), respectively, and then using an attention model to obtain the final embeddings, and 3) establishing a sign-aware Bayesian personalized ranking (BPR) loss function in the process of optimization. Through comprehensive experiments, we empirically demonstrate that SiReN consistently outperforms state-of-the-art NE-aided recommendation methods.

* 14 pages, 5 figures, 6 tables

Via

Access Paper or Ask Questions

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Jul 17, 2021

Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

Figure 1 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Figure 2 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Figure 3 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Figure 4 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Abstract:Approximate Policy Iteration (API) algorithms alternate between (approximate) policy evaluation and (approximate) greedification. Many different approaches have been explored for approximate policy evaluation, but less is understood about approximate greedification and what choices guarantee policy improvement. In this work, we investigate approximate greedification when reducing the KL divergence between the parameterized policy and the Boltzmann distribution over action values. In particular, we investigate the difference between the forward and reverse KL divergences, with varying degrees of entropy regularization. We show that the reverse KL has stronger policy improvement guarantees, but that reducing the forward KL can result in a worse policy. We also demonstrate, however, that a large enough reduction of the forward KL can induce improvement under additional assumptions. Empirically, we show on simple continuous-action environments that the forward KL can induce more exploration, but at the cost of a more suboptimal policy. No significant differences were observed in the discrete-action setting or on a suite of benchmark problems. Throughout, we highlight that many policy gradient methods can be seen as an instance of API, with either the forward or reverse KL for the policy update, and discuss next steps for understanding and improving our policy optimization algorithms.

* Submitted to JMLR

Via

Access Paper or Ask Questions

TornadoAggregate: Accurate and Scalable Federated Learning via the Ring-Based Architecture

Dec 06, 2020

Jin-woo Lee, Jaehoon Oh, Sungsu Lim, Se-Young Yun, Jae-Gil Lee

Figure 1 for TornadoAggregate: Accurate and Scalable Federated Learning via the Ring-Based Architecture

Figure 2 for TornadoAggregate: Accurate and Scalable Federated Learning via the Ring-Based Architecture

Figure 3 for TornadoAggregate: Accurate and Scalable Federated Learning via the Ring-Based Architecture

Figure 4 for TornadoAggregate: Accurate and Scalable Federated Learning via the Ring-Based Architecture

Abstract:Federated learning has emerged as a new paradigm of collaborative machine learning; however, many prior studies have used global aggregation along a star topology without much consideration of the communication scalability or the diurnal property relied on clients' local time variety. In contrast, ring architecture can resolve the scalability issue and even satisfy the diurnal property by iterating nodes without an aggregation. Nevertheless, such ring-based algorithms can inherently suffer from the high-variance problem. To this end, we propose a novel algorithm called TornadoAggregate that improves both accuracy and scalability by facilitating the ring architecture. In particular, to improve the accuracy, we reformulate the loss minimization into a variance reduction problem and establish three principles to reduce variance: Ring-Aware Grouping, Small Ring, and Ring Chaining. Experimental results show that TornadoAggregate improved the test accuracy by up to 26.7% and achieved near-linear scalability.

Via

Access Paper or Ask Questions