Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andreas Espersen

Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation

Dec 30, 2024

Shubh Singhal, Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

Figure 1 for Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation

Figure 2 for Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation

Figure 3 for Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation

Figure 4 for Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation

Abstract:Accurate segmentation of wind turbine blade (WTB) images is critical for effective assessments, as it directly influences the performance of automated damage detection systems. Despite advancements in large universal vision models, these models often underperform in domain-specific tasks like WTB segmentation. To address this, we extend Intrinsic LoRA for image segmentation, and propose a novel dual-space augmentation strategy that integrates both image-level and latent-space augmentations. The image-space augmentation is achieved through linear interpolation between image pairs, while the latent-space augmentation is accomplished by introducing a noise-based latent probabilistic model. Our approach significantly boosts segmentation accuracy, surpassing current state-of-the-art methods in WTB image segmentation.

* Authors Shubh Singhal and Ra\"ul P\'erez-Gonzalo contributed equally to this work. Accepted to ICASSP 2025

Via

Access Paper or Ask Questions

Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

Jun 10, 2024

Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

Figure 1 for Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

Figure 2 for Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

Figure 3 for Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

Figure 4 for Generalized Nested Latent Variable Models for Lossy Coding applied to Wind Turbine Scenarios

Abstract:Rate-distortion optimization through neural networks has accomplished competitive results in compression efficiency and image quality. This learning-based approach seeks to minimize the compromise between compression rate and reconstructed image quality by automatically extracting and retaining crucial information, while discarding less critical details. A successful technique consists in introducing a deep hyperprior that operates within a 2-level nested latent variable model, enhancing compression by capturing complex data dependencies. This paper extends this concept by designing a generalized L-level nested generative model with a Markov chain structure. We demonstrate as L increases that a trainable prior is detrimental and explore a common dimensionality along the distinct latent variables to boost compression performance. As this structured framework can represent autoregressive coders, we outperform the hyperprior model and achieve state-of-the-art performance while reducing substantially the computational cost. Our experimental evaluation is performed on wind turbine scenarios to study its application on visual inspections

* Accepted to ICIP 2024

Via

Access Paper or Ask Questions

Robust Wind Turbine Blade Segmentation from RGB Images in the Wild

Jun 26, 2023

Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo

Abstract:With the relentless growth of the wind industry, there is an imperious need to design automatic data-driven solutions for wind turbine maintenance. As structural health monitoring mainly relies on visual inspections, the first stage in any automatic solution is to identify the blade region on the image. Thus, we propose a novel segmentation algorithm that strengthens the U-Net results by a tailored loss, which pools the focal loss with a contiguity regularization term. To attain top performing results, a set of additional steps are proposed to ensure a reliable, generic, robust and efficient algorithm. First, we leverage our prior knowledge on the images by filling the holes enclosed by temporarily-classified blade pixels and by the image boundaries. Subsequently, the mislead classified pixels are successfully amended by training an on-the-fly random forest. Our algorithm demonstrates its effectiveness reaching a non-trivial 97.39% of accuracy.

* Accepted to ICIP 2023

Via

Access Paper or Ask Questions