Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matthias Rapp

OSSA: Unsupervised One-Shot Style Adaptation

Oct 01, 2024

Robin Gerster, Holger Caesar, Matthias Rapp, Alexander Wolpert, Michael Teutsch

Figure 1 for OSSA: Unsupervised One-Shot Style Adaptation

Figure 2 for OSSA: Unsupervised One-Shot Style Adaptation

Figure 3 for OSSA: Unsupervised One-Shot Style Adaptation

Figure 4 for OSSA: Unsupervised One-Shot Style Adaptation

Abstract:Despite their success in various vision tasks, deep neural network architectures often underperform in out-of-distribution scenarios due to the difference between training and target domain style. To address this limitation, we introduce One-Shot Style Adaptation (OSSA), a novel unsupervised domain adaptation method for object detection that utilizes a single, unlabeled target image to approximate the target domain style. Specifically, OSSA generates diverse target styles by perturbing the style statistics derived from a single target image and then applies these styles to a labeled source dataset at the feature level using Adaptive Instance Normalization (AdaIN). Extensive experiments show that OSSA establishes a new state-of-the-art among one-shot domain adaptation methods by a significant margin, and in some cases, even outperforms strong baselines that use thousands of unlabeled target images. By applying OSSA in various scenarios, including weather, simulated-to-real (sim2real), and visual-to-thermal adaptations, our study explores the overarching significance of the style gap in these contexts. OSSA's simplicity and efficiency allow easy integration into existing frameworks, providing a potentially viable solution for practical applications with limited data availability. Code is available at https://github.com/RobinGerster7/OSSA

Via

Access Paper or Ask Questions

Efficient Dictionary Learning with Sparseness-Enforcing Projections

Apr 16, 2016

Markus Thom, Matthias Rapp, Günther Palm

Figure 1 for Efficient Dictionary Learning with Sparseness-Enforcing Projections

Figure 2 for Efficient Dictionary Learning with Sparseness-Enforcing Projections

Figure 3 for Efficient Dictionary Learning with Sparseness-Enforcing Projections

Figure 4 for Efficient Dictionary Learning with Sparseness-Enforcing Projections

Abstract:Learning dictionaries suitable for sparse coding instead of using engineered bases has proven effective in a variety of image processing tasks. This paper studies the optimization of dictionaries on image data where the representation is enforced to be explicitly sparse with respect to a smooth, normalized sparseness measure. This involves the computation of Euclidean projections onto level sets of the sparseness measure. While previous algorithms for this optimization problem had at least quasi-linear time complexity, here the first algorithm with linear time complexity and constant space complexity is proposed. The key for this is the mathematically rigorous derivation of a characterization of the projection's result based on a soft-shrinkage function. This theory is applied in an original algorithm called Easy Dictionary Learning (EZDL), which learns dictionaries with a simple and fast-to-compute Hebbian-like learning rule. The new algorithm is efficient, expressive and particularly simple to implement. It is demonstrated that despite its simplicity, the proposed learning algorithm is able to generate a rich variety of dictionaries, in particular a topographic organization of atoms or separable atoms. Further, the dictionaries are as expressive as those of benchmark learning algorithms in terms of the reproduction quality on entire images, and result in an equivalent denoising performance. EZDL learns approximately 30 % faster than the already very efficient Online Dictionary Learning algorithm, and is therefore eligible for rapid data set analysis and problems with vast quantities of learning samples.

* International Journal of Computer Vision, vol. 114, no. 2, pp. 168-194, 2015
* The final publication is available at Springer via http://dx.doi.org/10.1007/s11263-015-0799-8

Via

Access Paper or Ask Questions