Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Christopher Zach

Certifiably Optimal Anisotropic Rotation Averaging

Mar 10, 2025

Carl Olsson, Yaroslava Lochman, Johan Malmport, Christopher Zach

Abstract:Rotation averaging is a key subproblem in applications of computer vision and robotics. Many methods for solving this problem exist, and there are also several theoretical results analyzing difficulty and optimality. However, one aspect that most of these have in common is a focus on the isotropic setting, where the intrinsic uncertainties in the measurements are not fully incorporated into the resulting optimization task. Recent empirical results suggest that moving to an anisotropic framework, where these uncertainties are explicitly included, can result in an improvement of solution quality. However, global optimization for rotation averaging has remained a challenge in this scenario. In this paper we show how anisotropic costs can be incorporated in certifiably optimal rotation averaging. We also demonstrate how existing solvers, designed for isotropic situations, fail in the anisotropic setting. Finally, we propose a stronger relaxation and show empirically that it is able to recover global optima in all tested datasets and leads to a more accurate reconstruction in all but one of the scenes.

Via

Access Paper or Ask Questions

Strengthening the Internal Adversarial Robustness in Lifted Neural Networks

Mar 10, 2025

Christopher Zach

Abstract:Lifted neural networks (i.e. neural architectures explicitly optimizing over respective network potentials to determine the neural activities) can be combined with a type of adversarial training to gain robustness for internal as well as input layers, in addition to improved generalization performance. In this work we first investigate how adversarial robustness in this framework can be further strengthened by solely modifying the training loss. In a second step we fix some remaining limitations and arrive at a novel training loss for lifted neural networks, that combines targeted and untargeted adversarial perturbations.

* 13 pages

Via

Access Paper or Ask Questions

Text in the Dark: Extremely Low-Light Text Image Enhancement

Apr 22, 2024

Che-Tsung Lin, Chun Chet Ng, Zhi Qin Tan, Wan Jun Nah, Xinyu Wang, Jie Long Kew, Pohao Hsu, Shang Hong Lai, Chee Seng Chan, Christopher Zach

Figure 1 for Text in the Dark: Extremely Low-Light Text Image Enhancement

Figure 2 for Text in the Dark: Extremely Low-Light Text Image Enhancement

Figure 3 for Text in the Dark: Extremely Low-Light Text Image Enhancement

Figure 4 for Text in the Dark: Extremely Low-Light Text Image Enhancement

Abstract:Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text tasks. Further research is also hindered by the lack of extremely low-light text datasets. To address these limitations, we propose a novel encoder-decoder framework with an edge-aware attention module to focus on scene text regions during enhancement. Our proposed method uses novel text detection and edge reconstruction losses to emphasize low-level scene text features, leading to successful text extraction. Additionally, we present a Supervised Deep Curve Estimation (Supervised-DCE) model to synthesize extremely low-light images based on publicly available scene text datasets such as ICDAR15 (IC15). We also labeled texts in the extremely low-light See In the Dark (SID) and ordinary LOw-Light (LOL) datasets to allow for objective assessment of extremely low-light image enhancement through scene text tasks. Extensive experiments show that our model outperforms state-of-the-art methods in terms of both image quality and scene text metrics on the widely-used LOL, SID, and synthetic IC15 datasets. Code and dataset will be released publicly at https://github.com/chunchet-ng/Text-in-the-Dark.

* The first two authors contributed equally to this work

Via

Access Paper or Ask Questions

Two Tales of Single-Phase Contrastive Hebbian Learning

Feb 13, 2024

Rasmus Kjær Høier, Christopher Zach

Abstract:The search for "biologically plausible" learning algorithms has converged on the idea of representing gradients as activity differences. However, most approaches require a high degree of synchronization (distinct phases during learning) and introduce substantial computational overhead, which raises doubts regarding their biological plausibility as well as their potential utility for neuromorphic computing. Furthermore, they commonly rely on applying infinitesimal perturbations (nudges) to output units, which is impractical in noisy environments. Recently it has been shown that by modelling artificial neurons as dyads with two oppositely nudged compartments, it is possible for a fully local learning algorithm named ``dual propagation'' to bridge the performance gap to backpropagation, without requiring separate learning phases or infinitesimal nudging. However, the algorithm has the drawback that its numerical stability relies on symmetric nudging, which may be restrictive in biological and analog implementations. In this work we first provide a solid foundation for the objective underlying the dual propagation method, which also reveals a surprising connection with adversarial robustness. Second, we demonstrate how dual propagation is related to a particular adjoint state method, which is stable regardless of asymmetric nudging.

* 18 pages

Via

Access Paper or Ask Questions

Fully Variational Noise-Contrastive Estimation

Apr 04, 2023

Christopher Zach

Abstract:By using the underlying theory of proper scoring rules, we design a family of noise-contrastive estimation (NCE) methods that are tractable for latent variable models. Both terms in the underlying NCE loss, the one using data samples and the one using noise samples, can be lower-bounded as in variational Bayes, therefore we call this family of losses fully variational noise-contrastive estimation. Variational autoencoders are a particular example in this family and therefore can be also understood as separating real data from synthetic samples using an appropriate classification loss. We further discuss other instances in this family of fully variational NCE objectives and indicate differences in their empirical behavior.

* SCIA 2023, 13 pages

Via

Access Paper or Ask Questions

Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons

Feb 02, 2023

Rasmus Høier, D. Staudt, Christopher Zach

Figure 1 for Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons

Figure 2 for Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons

Figure 3 for Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons

Figure 4 for Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons

Abstract:Activity difference based learning algorithms-such as contrastive Hebbian learning and equilibrium propagation-have been proposed as biologically plausible alternatives to error back-propagation. However, on traditional digital chips these algorithms suffer from having to solve a costly inference problem twice, making these approaches more than two orders of magnitude slower than back-propagation. In the analog realm equilibrium propagation may be promising for fast and energy efficient learning, but states still need to be inferred and stored twice. Inspired by lifted neural networks and compartmental neuron models we propose a simple energy based compartmental neuron model, termed dual propagation, in which each neuron is a dyad with two intrinsic states. At inference time these intrinsic states encode the error/activity duality through their difference and their mean respectively. The advantage of this method is that only a single inference phase is needed and that inference can be solved in layerwise closed-form. Experimentally we show on common computer vision datasets, including Imagenet32x32, that dual propagation performs equivalently to back-propagation both in terms of accuracy and runtime.

Via

Access Paper or Ask Questions

Extremely Low-light Image Enhancement with Scene Text Restoration

Apr 01, 2022

Pohao Hsu, Che-Tsung Lin, Chun Chet Ng, Jie-Long Kew, Mei Yih Tan, Shang-Hong Lai, Chee Seng Chan, Christopher Zach

Figure 1 for Extremely Low-light Image Enhancement with Scene Text Restoration

Figure 2 for Extremely Low-light Image Enhancement with Scene Text Restoration

Figure 3 for Extremely Low-light Image Enhancement with Scene Text Restoration

Figure 4 for Extremely Low-light Image Enhancement with Scene Text Restoration

Abstract:Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene texts, as well as the overall quality of the image simultaneously under extremely low-light images conditions. Mainly, we employed a self-regularised attention map, an edge map, and a novel text detection loss. In addition, leveraging synthetic low-light images is beneficial for image enhancement on the genuine ones in terms of text detection. The quantitative and qualitative experimental results have shown that the proposed model outperforms state-of-the-art methods in image restoration, text detection, and text spotting on See In the Dark and ICDAR15 datasets.

Via

Access Paper or Ask Questions

AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks

Dec 06, 2021

Huu Le, Rasmus Kjær Høier, Che-Tsung Lin, Christopher Zach

Figure 1 for AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks

Figure 2 for AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks

Figure 3 for AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks

Figure 4 for AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks

Abstract:We propose a new algorithm for training deep neural networks (DNNs) with binary weights. In particular, we first cast the problem of training binary neural networks (BiNNs) as a bilevel optimization instance and subsequently construct flexible relaxations of this bilevel program. The resulting training method shares its algorithmic simplicity with several existing approaches to train BiNNs, in particular with the straight-through gradient estimator successfully employed in BinaryConnect and subsequent methods. In fact, our proposed method can be interpreted as an adaptive variant of the original straight-through estimator that conditionally (but not always) acts like a linear mapping in the backward pass of error propagation. Experimental results demonstrate that our new algorithm offers favorable performance compared to existing approaches.

* 18 pages

Via

Access Paper or Ask Questions

BabelCalib: A Universal Approach to Calibrating Central Cameras

Sep 20, 2021

Yaroslava Lochman, Kostiantyn Liepieshov, Jianhui Chen, Michal Perdoch, Christopher Zach, James Pritts

Figure 1 for BabelCalib: A Universal Approach to Calibrating Central Cameras

Figure 2 for BabelCalib: A Universal Approach to Calibrating Central Cameras

Figure 3 for BabelCalib: A Universal Approach to Calibrating Central Cameras

Figure 4 for BabelCalib: A Universal Approach to Calibrating Central Cameras

Abstract:Existing calibration methods occasionally fail for large field-of-view cameras due to the non-linearity of the underlying problem and the lack of good initial values for all parameters of the used camera model. This might occur because a simpler projection model is assumed in an initial step, or a poor initial guess for the internal parameters is pre-defined. A lot of the difficulties of general camera calibration lie in the use of a forward projection model. We side-step these challenges by first proposing a solver to calibrate the parameters in terms of a back-projection model and then regress the parameters for a target forward model. These steps are incorporated in a robust estimation framework to cope with outlying detections. Extensive experiments demonstrate that our approach is very reliable and returns the most accurate calibration parameters as measured on the downstream task of absolute pose estimation on test sets. The code is released at https://github.com/ylochman/babelcalib.

Via

Access Paper or Ask Questions

Bilevel Programming and Deep Learning: A Unifying View on Inference Learning Methods

May 15, 2021

Christopher Zach

Figure 1 for Bilevel Programming and Deep Learning: A Unifying View on Inference Learning Methods

Figure 2 for Bilevel Programming and Deep Learning: A Unifying View on Inference Learning Methods

Abstract:In this work we unify a number of inference learning methods, that are proposed in the literature as alternative training algorithms to the ones based on regular error back-propagation. These inference learning methods were developed with very diverse motivations, mainly aiming to enhance the biological plausibility of deep neural networks and to improve the intrinsic parallelism of training methods. We show that these superficially very different methods can all be obtained by successively applying a particular reformulation of bilevel optimization programs. As a by-product it becomes also evident that all considered inference learning methods include back-propagation as a special case, and therefore at least approximate error back-propagation in typical settings. Finally, we propose Fenchel back-propagation, that replaces the propagation of infinitesimal corrections performed in standard back-propagation with finite targets as the learning signal. Fenchel back-propagation can therefore be seen as an instance of learning via explicit target propagation.

* 16 pages

Via

Access Paper or Ask Questions