Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jinu Gong

Improving Generalizability of Kolmogorov-Arnold Networks via Error-Correcting Output Codes

May 09, 2025

Youngjoon Lee, Jinu Gong, Joonhyuk Kang

Abstract:Kolmogorov-Arnold Networks (KAN) offer universal function approximation using univariate spline compositions without nonlinear activations. In this work, we integrate Error-Correcting Output Codes (ECOC) into the KAN framework to transform multi-class classification into multiple binary tasks, improving robustness via Hamming-distance decoding. Our proposed KAN with ECOC method outperforms vanilla KAN on a challenging blood cell classification dataset, achieving higher accuracy under diverse hyperparameter settings. Ablation studies further confirm that ECOC consistently enhances performance across FastKAN and FasterKAN variants. These results demonstrate that ECOC integration significantly boosts KAN generalizability in critical healthcare AI applications. To the best of our knowledge, this is the first integration of ECOC with KAN for enhancing multi-class medical image classification performance.

* 4 pages

Via

Access Paper or Ask Questions

A Unified Benchmark of Federated Learning with Kolmogorov-Arnold Networks for Medical Imaging

Apr 28, 2025

Youngjoon Lee, Jinu Gong, Joonhyuk Kang

Abstract:Federated Learning (FL) enables model training across decentralized devices without sharing raw data, thereby preserving privacy in sensitive domains like healthcare. In this paper, we evaluate Kolmogorov-Arnold Networks (KAN) architectures against traditional MLP across six state-of-the-art FL algorithms on a blood cell classification dataset. Notably, our experiments demonstrate that KAN can effectively replace MLP in federated environments, achieving superior performance with simpler architectures. Furthermore, we analyze the impact of key hyperparameters-grid size and network architecture-on KAN performance under varying degrees of Non-IID data distribution. Additionally, our ablation studies reveal that optimizing KAN width while maintaining minimal depth yields the best performance in federated settings. As a result, these findings establish KAN as a promising alternative for privacy-preserving medical imaging applications in distributed healthcare. To the best of our knowledge, this is the first comprehensive benchmark of KAN in FL settings for medical imaging task.

* 5 pages

Via

Access Paper or Ask Questions

Revisit the Stability of Vanilla Federated Learning Under Diverse Conditions

Feb 27, 2025

Youngjoon Lee, Jinu Gong, Sun Choi, Joonhyuk Kang

Abstract:Federated Learning (FL) is a distributed machine learning paradigm enabling collaborative model training across decentralized clients while preserving data privacy. In this paper, we revisit the stability of the vanilla FedAvg algorithm under diverse conditions. Despite its conceptual simplicity, FedAvg exhibits remarkably stable performance compared to more advanced FL techniques. Our experiments assess the performance of various FL methods on blood cell and skin lesion classification tasks using Vision Transformer (ViT). Additionally, we evaluate the impact of different representative classification models and analyze sensitivity to hyperparameter variations. The results consistently demonstrate that, regardless of dataset, classification model employed, or hyperparameter settings, FedAvg maintains robust performance. Given its stability, robust performance without the need for extensive hyperparameter tuning, FedAvg is a safe and efficient choice for FL deployments in resource-constrained hospitals handling medical data. These findings underscore the enduring value of the vanilla FedAvg approach as a trusted baseline for clinical practice.

* 10 pages

Via

Access Paper or Ask Questions

Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation

Jan 30, 2025

Youngjoon Lee, Taehyun Park, Yunho Lee, Jinu Gong, Joonhyuk Kang

Abstract:Federated Learning (FL) is increasingly being adopted in military collaborations to develop Large Language Models (LLMs) while preserving data sovereignty. However, prompt injection attacks-malicious manipulations of input prompts-pose new threats that may undermine operational security, disrupt decision-making, and erode trust among allies. This perspective paper highlights four potential vulnerabilities in federated military LLMs: secret data leakage, free-rider exploitation, system disruption, and misinformation spread. To address these potential risks, we propose a human-AI collaborative framework that introduces both technical and policy countermeasures. On the technical side, our framework uses red/blue team wargaming and quality assurance to detect and mitigate adversarial behaviors of shared LLM weights. On the policy side, it promotes joint AI-human policy development and verification of security protocols. Our findings will guide future research and emphasize proactive strategies for emerging military contexts.

* 7 pages

Via

Access Paper or Ask Questions

Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring Plugin

Nov 15, 2024

Youngjoon Lee, Jinu Gong, Joonhyuk Kang

Abstract:Given sufficient data from multiple edge devices, federated learning (FL) enables training a shared model without transmitting private data to a central server. However, FL is generally vulnerable to Byzantine attacks from compromised edge devices, which can significantly degrade the model performance. In this paper, we propose a intuitive plugin that can be integrated into existing FL techniques to achieve Byzantine-Resilience. Key idea is to generate virtual data samples and evaluate model consistency scores across local updates to effectively filter out compromised edge devices. By utilizing this scoring mechanism before the aggregation phase, the proposed plugin enables existing FL techniques to become robust against Byzantine attacks while maintaining their original benefits. Numerical results on medical image classification task validate that plugging the proposed approach into representative FL algorithms, effectively achieves Byzantine resilience. Furthermore, the proposed plugin maintains the original convergence properties of the base FL algorithms when no Byzantine attacks are present.

* 7 pages

Via

Access Paper or Ask Questions

Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks

Oct 31, 2024

Youngjoon Lee, Jinu Gong, Joonhyuk Kang

Figure 1 for Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks

Figure 2 for Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks

Figure 3 for Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks

Figure 4 for Generative AI-Powered Plugin for Robust Federated Learning in Heterogeneous IoT Networks

Abstract:Federated learning enables edge devices to collaboratively train a global model while maintaining data privacy by keeping data localized. However, the Non-IID nature of data distribution across devices often hinders model convergence and reduces performance. In this paper, we propose a novel plugin for federated optimization techniques that approximates Non-IID data distributions to IID through generative AI-enhanced data augmentation and balanced sampling strategy. Key idea is to synthesize additional data for underrepresented classes on each edge device, leveraging generative AI to create a more balanced dataset across the FL network. Additionally, a balanced sampling approach at the central server selectively includes only the most IID-like devices, accelerating convergence while maximizing the global model's performance. Experimental results validate that our approach significantly improves convergence speed and robustness against data imbalance, establishing a flexible, privacy-preserving FL plugin that is applicable even in data-scarce environments.

* 8 pages

Via

Access Paper or Ask Questions

Compressed Particle-Based Federated Bayesian Learning and Unlearning

Sep 19, 2022

Jinu Gong, Osvaldo Simeone, Joonhyuk Kang

Figure 1 for Compressed Particle-Based Federated Bayesian Learning and Unlearning

Figure 2 for Compressed Particle-Based Federated Bayesian Learning and Unlearning

Figure 3 for Compressed Particle-Based Federated Bayesian Learning and Unlearning

Figure 4 for Compressed Particle-Based Federated Bayesian Learning and Unlearning

Abstract:Conventional frequentist FL schemes are known to yield overconfident decisions. Bayesian FL addresses this issue by allowing agents to process and exchange uncertainty information encoded in distributions over the model parameters. However, this comes at the cost of a larger per-iteration communication overhead. This letter investigates whether Bayesian FL can still provide advantages in terms of calibration when constraining communication bandwidth. We present compressed particle-based Bayesian FL protocols for FL and federated "unlearning" that apply quantization and sparsification across multiple particles. The experimental results confirm that the benefits of Bayesian FL are robust to bandwidth constraints.

* Submitted for publication

Via

Access Paper or Ask Questions

Forget-SVGD: Particle-Based Bayesian Federated Unlearning

Nov 23, 2021

Jinu Gong, Osvaldo Simeone, Rahif Kassab, Joonhyuk Kang

Figure 1 for Forget-SVGD: Particle-Based Bayesian Federated Unlearning

Figure 2 for Forget-SVGD: Particle-Based Bayesian Federated Unlearning

Figure 3 for Forget-SVGD: Particle-Based Bayesian Federated Unlearning

Figure 4 for Forget-SVGD: Particle-Based Bayesian Federated Unlearning

Abstract:Variational particle-based Bayesian learning methods have the advantage of not being limited by the bias affecting more conventional parametric techniques. This paper proposes to leverage the flexibility of non-parametric Bayesian approximate inference to develop a novel Bayesian federated unlearning method, referred to as Forget-Stein Variational Gradient Descent (Forget-SVGD). Forget-SVGD builds on SVGD - a particle-based approximate Bayesian inference scheme using gradient-based deterministic updates - and on its distributed (federated) extension known as Distributed SVGD (DSVGD). Upon the completion of federated learning, as one or more participating agents request for their data to be "forgotten", Forget-SVGD carries out local SVGD updates at the agents whose data need to be "unlearned", which are interleaved with communication rounds with a parameter server. The proposed method is validated via performance comparisons with non-parametric schemes that train from scratch by excluding data to be forgotten, as well as with existing parametric Bayesian unlearning methods.

* submitted for conference publication

Via

Access Paper or Ask Questions

Bayesian Variational Federated Learning and Unlearning in Decentralized Networks

Apr 08, 2021

Jinu Gong, Osvaldo Simeone, Joonhyuk Kang

Figure 1 for Bayesian Variational Federated Learning and Unlearning in Decentralized Networks

Figure 2 for Bayesian Variational Federated Learning and Unlearning in Decentralized Networks

Figure 3 for Bayesian Variational Federated Learning and Unlearning in Decentralized Networks

Figure 4 for Bayesian Variational Federated Learning and Unlearning in Decentralized Networks

Abstract:Federated Bayesian learning offers a principled framework for the definition of collaborative training algorithms that are able to quantify epistemic uncertainty and to produce trustworthy decisions. Upon the completion of collaborative training, an agent may decide to exercise her legal "right to be forgotten", which calls for her contribution to the jointly trained model to be deleted and discarded. This paper studies federated learning and unlearning in a decentralized network within a Bayesian framework. It specifically develops federated variational inference (VI) solutions based on the decentralized solution of local free energy minimization problems within exponential-family models and on local gossip-driven communication. The proposed protocols are demonstrated to yield efficient unlearning mechanisms.

* Submitted for conference publication

Via

Access Paper or Ask Questions