Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dennis Ritter

CAD Models to Real-World Images: A Practical Approach to Unsupervised Domain Adaptation in Industrial Object Classification

Oct 07, 2023

Dennis Ritter, Mike Hemberger, Marc Hönig, Volker Stopp, Erik Rodner, Kristian Hildebrand

Abstract:In this paper, we systematically analyze unsupervised domain adaptation pipelines for object classification in a challenging industrial setting. In contrast to standard natural object benchmarks existing in the field, our results highlight the most important design choices when only category-labeled CAD models are available but classification needs to be done with real-world images. Our domain adaptation pipeline achieves SoTA performance on the VisDA benchmark, but more importantly, drastically improves recognition performance on our new open industrial dataset comprised of 102 mechanical parts. We conclude with a set of guidelines that are relevant for practitioners needing to apply state-of-the-art unsupervised domain adaptation in practice. Our code is available at https://github.com/dritter-bht/synthnet-transfer-learning.

* Presented at ECML-PKDD 2023 Workshop "Adapting to Change: Reliable Multimodal Learning Across Domains", Student Paper Award

Via

Access Paper or Ask Questions

Pose-Guided Sign Language Video GAN with Dynamic Lambda

May 06, 2021

Christopher Kissel, Christopher Kümmel, Dennis Ritter, Kristian Hildebrand

Figure 1 for Pose-Guided Sign Language Video GAN with Dynamic Lambda

Figure 2 for Pose-Guided Sign Language Video GAN with Dynamic Lambda

Figure 3 for Pose-Guided Sign Language Video GAN with Dynamic Lambda

Figure 4 for Pose-Guided Sign Language Video GAN with Dynamic Lambda

Abstract:We propose a novel approach for the synthesis of sign language videos using GANs. We extend the previous work of Stoll et al. by using the human semantic parser of the Soft-Gated Warping-GAN from to produce photorealistic videos guided by region-level spatial layouts. Synthesizing target poses improves performance on independent and contrasting signers. Therefore, we have evaluated our system with the highly heterogeneous MS-ASL dataset with over 200 signers resulting in a SSIM of 0.893. Furthermore, we introduce a periodic weighting approach to the generator that reactivates the training and leads to quantitatively better results.

Via

Access Paper or Ask Questions