Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Köhler

AAA-Gaussians: Anti-Aliased and Artifact-Free 3D Gaussian Rendering

Apr 17, 2025

Michael Steiner, Thomas Köhler, Lukas Radl, Felix Windisch, Dieter Schmalstieg, Markus Steinberger

Abstract:Although 3D Gaussian Splatting (3DGS) has revolutionized 3D reconstruction, it still faces challenges such as aliasing, projection artifacts, and view inconsistencies, primarily due to the simplification of treating splats as 2D entities. We argue that incorporating full 3D evaluation of Gaussians throughout the 3DGS pipeline can effectively address these issues while preserving rasterization efficiency. Specifically, we introduce an adaptive 3D smoothing filter to mitigate aliasing and present a stable view-space bounding method that eliminates popping artifacts when Gaussians extend beyond the view frustum. Furthermore, we promote tile-based culling to 3D with screen-space planes, accelerating rendering and reducing sorting costs for hierarchical rasterization. Our method achieves state-of-the-art quality on in-distribution evaluation sets and significantly outperforms other approaches for out-of-distribution views. Our qualitative evaluations further demonstrate the effective removal of aliasing, distortions, and popping artifacts, ensuring real-time, artifact-free rendering.

Via

Access Paper or Ask Questions

LORD: Leveraging Open-Set Recognition with Unknown Data

Aug 24, 2023

Tobias Koch, Christian Riess, Thomas Köhler

Figure 1 for LORD: Leveraging Open-Set Recognition with Unknown Data

Figure 2 for LORD: Leveraging Open-Set Recognition with Unknown Data

Figure 3 for LORD: Leveraging Open-Set Recognition with Unknown Data

Figure 4 for LORD: Leveraging Open-Set Recognition with Unknown Data

Abstract:Handling entirely unknown data is a challenge for any deployed classifier. Classification models are typically trained on a static pre-defined dataset and are kept in the dark for the open unassigned feature space. As a result, they struggle to deal with out-of-distribution data during inference. Addressing this task on the class-level is termed open-set recognition (OSR). However, most OSR methods are inherently limited, as they train closed-set classifiers and only adapt the downstream predictions to OSR. This work presents LORD, a framework to Leverage Open-set Recognition by exploiting unknown Data. LORD explicitly models open space during classifier training and provides a systematic evaluation for such approaches. We identify three model-agnostic training strategies that exploit background data and applied them to well-established classifiers. Due to LORD's extensive evaluation protocol, we consistently demonstrate improved recognition of unknown data. The benchmarks facilitate in-depth analysis across various requirement levels. To mitigate dependency on extensive and costly background datasets, we explore mixup as an off-the-shelf data generation technique. Our experiments highlight mixup's effectiveness as a substitute for background datasets. Lightweight constraints on mixup synthesis further improve OSR performance.

* Accepted at ICCV 2023 Workshop (Out-Of-Distribution Generalization in Computer Vision)

Via

Access Paper or Ask Questions

Exploring the Open World Using Incremental Extreme Value Machines

May 30, 2022

Tobias Koch, Felix Liebezeit, Christian Riess, Vincent Christlein, Thomas Köhler

Figure 1 for Exploring the Open World Using Incremental Extreme Value Machines

Figure 2 for Exploring the Open World Using Incremental Extreme Value Machines

Figure 3 for Exploring the Open World Using Incremental Extreme Value Machines

Figure 4 for Exploring the Open World Using Incremental Extreme Value Machines

Abstract:Dynamic environments require adaptive applications. One particular machine learning problem in dynamic environments is open world recognition. It characterizes a continuously changing domain where only some classes are seen in one batch of the training data and such batches can only be learned incrementally. Open world recognition is a demanding task that is, to the best of our knowledge, addressed by only a few methods. This work introduces a modification of the widely known Extreme Value Machine (EVM) to enable open world recognition. Our proposed method extends the EVM with a partial model fitting function by neglecting unaffected space during an update. This reduces the training time by a factor of 28. In addition, we provide a modified model reduction using weighted maximum K-set cover to strictly bound the model complexity and reduce the computational effort by a factor of 3.5 from 2.1 s to 0.6 s. In our experiments, we rigorously evaluate openness with two novel evaluation protocols. The proposed method achieves superior accuracy of about 12 % and computational efficiency in the tasks of image classification and face recognition.

* Accepted at ICPR 2022

Via

Access Paper or Ask Questions

Joint Super-Resolution and Rectification for Solar Cell Inspection

Nov 10, 2020

Mathis Hoffmann, Thomas Köhler, Bernd Doll, Frank Schebesch, Florian Talkenberg, Ian Marius Peters, Christoph J. Brabec, Andreas Maier, Vincent Christlein

Figure 1 for Joint Super-Resolution and Rectification for Solar Cell Inspection

Figure 2 for Joint Super-Resolution and Rectification for Solar Cell Inspection

Figure 3 for Joint Super-Resolution and Rectification for Solar Cell Inspection

Figure 4 for Joint Super-Resolution and Rectification for Solar Cell Inspection

Abstract:Visual inspection of solar modules is an important monitoring facility in photovoltaic power plants. Since a single measurement of fast CMOS sensors is limited in spatial resolution and often not sufficient to reliably detect small defects, we apply multi-frame super-resolution (MFSR) to a sequence of low resolution measurements. In addition, the rectification and removal of lens distortion simplifies subsequent analysis. Therefore, we propose to fuse this pre-processing with standard MFSR algorithms. This is advantageous, because we omit a separate processing step, the motion estimation becomes more stable and the spacing of high-resolution (HR) pixels on the rectified module image becomes uniform w.r.t. the module plane, regardless of perspective distortion. We present a comprehensive user study showing that MFSR is beneficial for defect recognition by human experts and that the proposed method performs better than the state of the art. Furthermore, we apply automated crack segmentation and show that the proposed method performs 3x better than bicubic upsampling and 2x better than the state of the art for automated inspection.

Via

Access Paper or Ask Questions

Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing

Nov 12, 2019

Prashant Chaudhari, Franziska Schirrmacher, Andreas Maier, Christian Riess, Thomas Köhler

Figure 1 for Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing

Figure 2 for Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing

Figure 3 for Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing

Figure 4 for Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing

Abstract:The image signal processing pipeline (ISP) is a core element of digital cameras to capture high-quality displayable images from raw data. In high dynamic range (HDR) imaging, ISPs include steps like demosaicing of raw color filter array (CFA) data at different exposure times, alignment of the exposures, conversion to HDR domain, and exposure merging into an HDR image. Traditionally, such pipelines are built by cascading algorithms addressing the individual subtasks. However, cascaded designs suffer from error propagations since simply combining multiple processing steps is not necessarily optimal for the entire imaging task. This paper proposes a multi-exposure high dynamic range image signal processing pipeline (Merging-ISP) to jointly solve all subtasks for HDR imaging. Our pipeline is modeled by a deep neural network architecture. As such, it is end-to-end trainable, circumvents the use of complex, hand-crafted algorithms in its core, and mitigates error propagation. Merging-ISP enables direct reconstructions of HDR images from multiple differently exposed raw CFA images captured from dynamic scenes. We compared Merging-ISP against different alternative cascaded pipelines. End-to-end learning leads to HDR reconstructions of high perceptual quality and quantitatively outperforms competing ISPs by more than 1 dB in terms of PSNR.

* 8 pages, 6 figures, 2 tables, WACV-2020 conference

Via

Access Paper or Ask Questions

Multi-Frame Super-Resolution Reconstruction with Applications to Medical Imaging

Dec 21, 2018

Thomas Köhler

Abstract:The optical resolution of a digital camera is one of its most crucial parameters with broad relevance for consumer electronics, surveillance systems, remote sensing, or medical imaging. However, resolution is physically limited by the optics and sensor characteristics. In addition, practical and economic reasons often stipulate the use of out-dated or low-cost hardware. Super-resolution is a class of retrospective techniques that aims at high-resolution imagery by means of software. Multi-frame algorithms approach this task by fusing multiple low-resolution frames to reconstruct high-resolution images. This work covers novel super-resolution methods along with new applications in medical imaging.

* Ph.D. thesis at the Friedrich-Alexander-Universit\"at (FAU) Erlangen-N\"urnberg; source code is available at https://www5.cs.fau.de/de/forschung/software/multi-frame-super-resolution-toolbox/ . https://opus4.kobv.de/opus4-fau/frontdoor/index/index/docId/9145

Via

Access Paper or Ask Questions

Bridging the Simulated-to-Real Gap: Benchmarking Super-Resolution on Real Data

Sep 17, 2018

Thomas Köhler, Michel Bätz, Farzad Naderi, André Kaup, Andreas Maier, Christian Riess

Figure 1 for Bridging the Simulated-to-Real Gap: Benchmarking Super-Resolution on Real Data

Figure 2 for Bridging the Simulated-to-Real Gap: Benchmarking Super-Resolution on Real Data

Figure 3 for Bridging the Simulated-to-Real Gap: Benchmarking Super-Resolution on Real Data

Figure 4 for Bridging the Simulated-to-Real Gap: Benchmarking Super-Resolution on Real Data

Abstract:Capturing ground truth data to benchmark super-resolution (SR) is challenging. Therefore, current quantitative studies are mainly evaluated on simulated data artificially sampled from ground truth images. We argue that such evaluations overestimate the actual performance of SR methods compared to their behavior on real images. To bridge this simulated-to-real gap, we introduce the Super-Resolution Erlangen (SupER) database, the first comprehensive laboratory SR database of all-real acquisitions with pixel-wise ground truth. It consists of more than 80k images of 14 scenes combining different facets: CMOS sensor noise, real sampling at four resolution levels, nine scene motion types, two photometric conditions, and lossy video coding at five levels. As such, the database exceeds existing benchmarks by an order of magnitude in quality and quantity. This paper also benchmarks 19 popular single-image and multi-frame algorithms on our data. The benchmark comprises a quantitative study by exploiting ground truth data and qualitative evaluations in a large-scale observer study. We also rigorously investigate agreements between both evaluations from a statistical perspective. One interesting result is that top-performing methods on simulated data may be surpassed by others on real data. Our insights can spur further algorithm development, and the publicy available dataset can foster future evaluations.

* Data and source code available at https://superresolution.tf.fau.de/

Via

Access Paper or Ask Questions

Temporal and Volumetric Denoising via Quantile Sparse Image (QuaSI) Prior in Optical Coherence Tomography and Beyond

Jul 05, 2018

Franziska Schirrmacher, Thomas Köhler, Tobias Lindenberger, Lennart Husvogt, Jürgen Endres, James G. Fujimoto, Joachim Hornegger, Arnd Dörfler, Philip Hoelter, Andreas K. Maier

Figure 1 for Temporal and Volumetric Denoising via Quantile Sparse Image (QuaSI) Prior in Optical Coherence Tomography and Beyond

Figure 2 for Temporal and Volumetric Denoising via Quantile Sparse Image (QuaSI) Prior in Optical Coherence Tomography and Beyond

Figure 3 for Temporal and Volumetric Denoising via Quantile Sparse Image (QuaSI) Prior in Optical Coherence Tomography and Beyond

Figure 4 for Temporal and Volumetric Denoising via Quantile Sparse Image (QuaSI) Prior in Optical Coherence Tomography and Beyond

Abstract:This paper introduces an universal and structure-preserving regularization term, called quantile sparse image (QuaSI) prior. The prior is suitable for denoising images from various medical imaging modalities. We demonstrate its effectiveness on volumetric optical coherence tomography (OCT) and computed tomography (CT) data, which show different noise and image characteristics. OCT offers high-resolution scans of the human retina but is inherently impaired by speckle noise. CT on the other hand has a lower resolution and shows high-frequency noise. For the purpose of denoising, we propose a variational framework based on the QuaSI prior and a Huber data fidelity model that can handle 3-D and 3-D+t data. Efficient optimization is facilitated through the use of an alternating direction method of multipliers (ADMM) scheme and the linearization of the quantile filter. Experiments on multiple datasets emphasize the excellent performance of the proposed method.

* Accepted for MICCAI2017 special issue

Via

Access Paper or Ask Questions

Learning from a Handful Volumes: MRI Resolution Enhancement with Volumetric Super-Resolution Forests

Feb 15, 2018

Aline Sindel, Katharina Breininger, Johannes Käßer, Andreas Hess, Andreas Maier, Thomas Köhler

Figure 1 for Learning from a Handful Volumes: MRI Resolution Enhancement with Volumetric Super-Resolution Forests

Figure 2 for Learning from a Handful Volumes: MRI Resolution Enhancement with Volumetric Super-Resolution Forests

Figure 3 for Learning from a Handful Volumes: MRI Resolution Enhancement with Volumetric Super-Resolution Forests

Figure 4 for Learning from a Handful Volumes: MRI Resolution Enhancement with Volumetric Super-Resolution Forests

Abstract:Magnetic resonance imaging (MRI) enables 3-D imaging of anatomical structures. However, the acquisition of MR volumes with high spatial resolution leads to long scan times. To this end, we propose volumetric super-resolution forests (VSRF) to enhance MRI resolution retrospectively. Our method learns a locally linear mapping between low-resolution and high-resolution volumetric image patches by employing random forest regression. We customize features suitable for volumetric MRI to train the random forest and propose a median tree ensemble for robust regression. VSRF outperforms state-of-the-art example-based super-resolution in term of image quality and efficiency for model training and inference in different MRI datasets. It is also superior to unsupervised methods with just a handful or even a single volume to assemble training data.

* Preprint submitted to ICIP 2018

Via

Access Paper or Ask Questions

Benchmarking Super-Resolution Algorithms on Real Data

Sep 08, 2017

Thomas Köhler, Michel Bätz, Farzad Naderi, André Kaup, Andreas K. Maier, Christian Riess

Abstract:Over the past decades, various super-resolution (SR) techniques have been developed to enhance the spatial resolution of digital images. Despite the great number of methodical contributions, there is still a lack of comparative validations of SR under practical conditions, as capturing real ground truth data is a challenging task. Therefore, current studies are either evaluated 1) on simulated data or 2) on real data without a pixel-wise ground truth. To facilitate comprehensive studies, this paper introduces the publicly available Super-Resolution Erlangen (SupER) database that includes real low-resolution images along with high-resolution ground truth data. Our database comprises image sequences with more than 20k images captured from 14 scenes under various types of motions and photometric conditions. The datasets cover four spatial resolution levels using camera hardware binning. With this database, we benchmark 15 single-image and multi-frame SR algorithms. Our experiments quantitatively analyze SR accuracy and robustness under realistic conditions including independent object and camera motion or photometric variations.

Via

Access Paper or Ask Questions