Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander Jung

Enforcing Fundamental Relations via Adversarial Attacks on Input Parameter Correlations

Jan 09, 2025

Timo Saala, Lucie Flek, Alexander Jung, Akbar Karimi, Alexander Schmidt, Matthias Schott, Philipp Soldin, Christopher Wiebusch

Abstract:Correlations between input parameters play a crucial role in many scientific classification tasks, since these are often related to fundamental laws of nature. For example, in high energy physics, one of the common deep learning use-cases is the classification of signal and background processes in particle collisions. In many such cases, the fundamental principles of the correlations between observables are often better understood than the actual distributions of the observables themselves. In this work, we present a new adversarial attack algorithm called Random Distribution Shuffle Attack (RDSA), emphasizing the correlations between observables in the network rather than individual feature characteristics. Correct application of the proposed novel attack can result in a significant improvement in classification performance - particularly in the context of data augmentation - when using the generated adversaries within adversarial training. Given that correlations between input features are also crucial in many other disciplines. We demonstrate the RDSA effectiveness on six classification tasks, including two particle collision challenges (using CERN Open Data), hand-written digit recognition (MNIST784), human activity recognition (HAR), weather forecasting (Rain in Australia), and ICU patient mortality (MIMIC-IV), demonstrating a general use case beyond fundamental physics for this new type of adversarial attack algorithms.

* 12 pages, 8 figures (Without appendix)

Via

Access Paper or Ask Questions

Engineering Trustworthy AI: A Developer Guide for Empirical Risk Minimization

Oct 25, 2024

Diana Pfau, Alexander Jung

Abstract:AI systems increasingly shape critical decisions across personal and societal domains. While empirical risk minimization (ERM) drives much of the AI success, it typically prioritizes accuracy over trustworthiness, often resulting in biases, opacity, and other adverse effects. This paper discusses how key requirements for trustworthy AI can be translated into design choices for the components of ERM. We hope to provide actionable guidance for building AI systems that meet emerging standards for trustworthiness of AI.

Via

Access Paper or Ask Questions

Personalized Federated Learning via Active Sampling

Sep 03, 2024

Alexander Jung, Yasmin SarcheshmehPour, Amirhossein Mohammadi

Abstract:Consider a collection of data generators which could represent, e.g., humans equipped with a smart-phone or wearables. We want to train a personalized (or tailored) model for each data generator even if they provide only small local datasets. The available local datasets might fail to provide sufficient statistical power to train high-dimensional models (such as deep neural networks) effectively. One possible solution is to identify similar data generators and pool their local datasets to obtain a sufficiently large training set. This paper proposes a novel method for sequentially identifying similar (or relevant) data generators. Our method is similar in spirit to active sampling methods but does not require exchange of raw data. Indeed, our method evaluates the relevance of a data generator by evaluating the effect of a gradient step using its local dataset. This evaluation can be performed in a privacy-friendly fashion without sharing raw data. We extend this method to non-parametric models by a suitable generalization of the gradient step to update a hypothesis using the local dataset provided by a data generator.

Via

Access Paper or Ask Questions

Moreau Envelope ADMM for Decentralized Weakly Convex Optimization

Aug 31, 2023

Reza Mirzaeifard, Naveen K. D. Venkategowda, Alexander Jung, Stefan Werner

Figure 1 for Moreau Envelope ADMM for Decentralized Weakly Convex Optimization

Figure 2 for Moreau Envelope ADMM for Decentralized Weakly Convex Optimization

Figure 3 for Moreau Envelope ADMM for Decentralized Weakly Convex Optimization

Abstract:This paper proposes a proximal variant of the alternating direction method of multipliers (ADMM) for distributed optimization. Although the current versions of ADMM algorithm provide promising numerical results in producing solutions that are close to optimal for many convex and non-convex optimization problems, it remains unclear if they can converge to a stationary point for weakly convex and locally non-smooth functions. Through our analysis using the Moreau envelope function, we demonstrate that MADM can indeed converge to a stationary point under mild conditions. Our analysis also includes computing the bounds on the amount of change in the dual variable update step by relating the gradient of the Moreau envelope function to the proximal function. Furthermore, the results of our numerical experiments indicate that our method is faster and more robust than widely-used approaches.

Via

Access Paper or Ask Questions

Rethinking Drone-Based Search and Rescue with Aerial Person Detection

Nov 17, 2021

Pasi Pyrrö, Hassan Naseri, Alexander Jung

Figure 1 for Rethinking Drone-Based Search and Rescue with Aerial Person Detection

Figure 2 for Rethinking Drone-Based Search and Rescue with Aerial Person Detection

Figure 3 for Rethinking Drone-Based Search and Rescue with Aerial Person Detection

Figure 4 for Rethinking Drone-Based Search and Rescue with Aerial Person Detection

Abstract:The visual inspection of aerial drone footage is an integral part of land search and rescue (SAR) operations today. Since this inspection is a slow, tedious and error-prone job for humans, we propose a novel deep learning algorithm to automate this aerial person detection (APD) task. We experiment with model architecture selection, online data augmentation, transfer learning, image tiling and several other techniques to improve the test performance of our method. We present the novel Aerial Inspection RetinaNet (AIR) algorithm as the combination of these contributions. The AIR detector demonstrates state-of-the-art performance on a commonly used SAR test data set in terms of both precision (~21 percentage point increase) and speed. In addition, we provide a new formal definition for the APD problem in SAR missions. That is, we propose a novel evaluation scheme that ranks detectors in terms of real-world SAR localization requirements. Finally, we propose a novel postprocessing method for robust, approximate object localization: the merging of overlapping bounding boxes (MOB) algorithm. This final processing stage used in the AIR detector significantly improves its performance and usability in the face of real-world aerial SAR missions.

* 10 pages, 5 figures, 3 tables, 1 algorithm

Via

Access Paper or Ask Questions

Networked Federated Multi-Task Learning

May 26, 2021

Yasmin SarcheshmehPour, Yu Tian, Linli Zhang, Alexander Jung

Figure 1 for Networked Federated Multi-Task Learning

Figure 2 for Networked Federated Multi-Task Learning

Figure 3 for Networked Federated Multi-Task Learning

Abstract:Many important application domains generate distributed collections of heterogeneous local datasets. These local datasets are often related via an intrinsic network structure that arises from domain-specific notions of similarity between local datasets. Different notions of similarity are induced by spatiotemporal proximity, statistical dependencies, or functional relations. We use this network structure to adaptively pool similar local datasets into nearly homogenous training sets for learning tailored models. Our main conceptual contribution is to formulate networked federated learning using the concept of generalized total variation (GTV) minimization as a regularizer. This formulation is highly flexible and can be combined with almost any parametric model including Lasso or deep neural networks. We unify and considerably extend some well-known approaches to federated multi-task learning. Our main algorithmic contribution is a novel federated learning algorithm that is well suited for distributed computing environments such as edge computing over wireless networks. This algorithm is robust against model misspecification and numerical errors arising from limited computational resources including processing time or wireless channel bandwidth. As our main technical contribution, we offer precise conditions on the local models as well on their network structure such that our algorithm learns nearly optimal local models. Our analysis reveals an interesting interplay between the (information-) geometry of local models and the (cluster-) geometry of their network.

Via

Access Paper or Ask Questions

Local Graph Clustering with Network Lasso

Apr 25, 2020

Alexander Jung

Figure 1 for Local Graph Clustering with Network Lasso

Figure 2 for Local Graph Clustering with Network Lasso

Figure 3 for Local Graph Clustering with Network Lasso

Abstract:We study the statistical and computational properties of a network Lasso method for local graph clustering. The clusters delivered by nLasso can be characterized elegantly via network flows between cluster boundary and seed nodes. While spectral clustering methods are guided by a minimization of the graph Laplacian quadratic form, nLasso minimizes the total variation of cluster indicator signals. As demonstrated theoretically and numerically, nLasso methods can handle very sparse clusters (chain-like) which are difficult for spectral clustering. We also verify that a primal-dual method for non-smooth optimization allows to approximate nLasso solutions with optimal worst-case convergence rate.

Via

Access Paper or Ask Questions

An Information-Theoretic Approach to Personalized Explainable Machine Learning

Mar 15, 2020

Alexander Jung, Pedro H. J. Nardelli

Figure 1 for An Information-Theoretic Approach to Personalized Explainable Machine Learning

Figure 2 for An Information-Theoretic Approach to Personalized Explainable Machine Learning

Abstract:Automated decision making is used routinely throughout our everyday life. Recommender systems decide which jobs, movies, or other user profiles might be interesting to us. Spell checkers help us to make good use of language. Fraud detection systems decide if a credit card transactions should be verified more closely. Many of these decision making systems use machine learning methods that fit complex models to massive datasets. The successful deployment of machine learning (ML) methods to many (critical) application domains crucially depends on its explainability. Indeed, humans have a strong desire to get explanations that resolve the uncertainty about experienced phenomena like the predictions and decisions obtained from ML methods. Explainable ML is challenging since explanations must be tailored (personalized) to individual users with varying backgrounds. Some users might have received university-level education in ML, while other users might have no formal training in linear algebra. Linear regression with few features might be perfectly interpretable for the first group but might be considered a black-box by the latter. We propose a simple probabilistic model for the predictions and user knowledge. This model allows to study explainable ML using information theory. Explaining is here considered as the task of reducing the "surprise" incurred by a prediction. We quantify the effect of an explanation by the conditional mutual information between the explanation and prediction, given the user background.

Via

Access Paper or Ask Questions

Basic Principles of Clustering Methods

Dec 10, 2019

Alexander Jung, Ivan Baranov

Figure 1 for Basic Principles of Clustering Methods

Figure 2 for Basic Principles of Clustering Methods

Figure 3 for Basic Principles of Clustering Methods

Figure 4 for Basic Principles of Clustering Methods

Abstract:Clustering methods group a set of data points into a few coherent groups or clusters of similar data points. As an example, consider clustering pixels in an image (or video) if they belong to the same object. Different clustering methods are obtained by using different notions of similarity and different representations of data points.

Via

Access Paper or Ask Questions

Clustering in Partially Labeled Stochastic Block Models via Total Variation Minimization

Nov 03, 2019

Alexander Jung

Figure 1 for Clustering in Partially Labeled Stochastic Block Models via Total Variation Minimization

Figure 2 for Clustering in Partially Labeled Stochastic Block Models via Total Variation Minimization

Figure 3 for Clustering in Partially Labeled Stochastic Block Models via Total Variation Minimization

Abstract:A main task in data analysis is to organize data points into coherent groups or clusters. The stochastic block model is a probabilistic model for the cluster structure. This model prescribes different probabilities for the presence of edges within a cluster and between different clusters. We assume that the cluster assignments are known for at least one data point in each cluster. In such a partially labeled stochastic block model, clustering amounts to estimating the cluster assignments of the remaining data points. We study total variation minimization as a method for this clustering task. We implement the resulting clustering algorithm as a highly scalable message passing protocol. We also provide a condition on the model parameters such that total variation minimization allows for accurate clustering.

Via

Access Paper or Ask Questions