Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eric Stumpe

3D Multimodal Image Registration for Plant Phenotyping

Jul 03, 2024

Eric Stumpe, Gernot Bodner, Francesco Flagiello, Matthias Zeppelzauer

Figure 1 for 3D Multimodal Image Registration for Plant Phenotyping

Figure 2 for 3D Multimodal Image Registration for Plant Phenotyping

Figure 3 for 3D Multimodal Image Registration for Plant Phenotyping

Figure 4 for 3D Multimodal Image Registration for Plant Phenotyping

Abstract:The use of multiple camera technologies in a combined multimodal monitoring system for plant phenotyping offers promising benefits. Compared to configurations that only utilize a single camera technology, cross-modal patterns can be recorded that allow a more comprehensive assessment of plant phenotypes. However, the effective utilization of cross-modal patterns is dependent on precise image registration to achieve pixel-accurate alignment, a challenge often complicated by parallax and occlusion effects inherent in plant canopy imaging. In this study, we propose a novel multimodal 3D image registration method that addresses these challenges by integrating depth information from a time-of-flight camera into the registration process. By leveraging depth data, our method mitigates parallax effects and thus facilitates more accurate pixel alignment across camera modalities. Additionally, we introduce an automated mechanism to identify and differentiate different types of occlusions, thereby minimizing the introduction of registration errors. To evaluate the efficacy of our approach, we conduct experiments on a diverse image dataset comprising six distinct plant species with varying leaf geometries. Our results demonstrate the robustness of the proposed registration algorithm, showcasing its ability to achieve accurate alignment across different plant types and camera compositions. Compared to previous methods it is not reliant on detecting plant specific image features and can thereby be utilized for a wide variety of applications in plant sciences. The registration approach principally scales to arbitrary numbers of cameras with different resolutions and wavelengths. Overall, our study contributes to advancing the field of plant phenotyping by offering a robust and reliable solution for multimodal image registration.

* 53 pages, 13 Figures, preprint submitted to Computers and Electronics in Agriculture

Via

Access Paper or Ask Questions

Case Study: Ensemble Decision-Based Annotation of Unconstrained Real Estate Images

Sep 26, 2023

Miroslav Despotovic, Zedong Zhang, Eric Stumpe, Matthias Zeppelzauer

Abstract:We describe a proof-of-concept for annotating real estate images using simple iterative rule-based semi-supervised learning. In this study, we have gained important insights into the content characteristics and uniqueness of individual image classes as well as essential requirements for a practical implementation.

* 2 pages, 3 figures

Via

Access Paper or Ask Questions

Real Estate Attribute Prediction from Multiple Visual Modalities with Missing Data

Nov 16, 2022

Eric Stumpe, Miroslav Despotovic, Zedong Zhang, Matthias Zeppelzauer

Figure 1 for Real Estate Attribute Prediction from Multiple Visual Modalities with Missing Data

Figure 2 for Real Estate Attribute Prediction from Multiple Visual Modalities with Missing Data

Figure 3 for Real Estate Attribute Prediction from Multiple Visual Modalities with Missing Data

Figure 4 for Real Estate Attribute Prediction from Multiple Visual Modalities with Missing Data

Abstract:The assessment and valuation of real estate requires large datasets with real estate information. Unfortunately, real estate databases are usually sparse in practice, i.e., not for each property every important attribute is available. In this paper, we study the potential of predicting high-level real estate attributes from visual data, specifically from two visual modalities, namely indoor (interior) and outdoor (facade) photos. We design three models using different multimodal fusion strategies and evaluate them for three different use cases. Thereby, a particular challenge is to handle missing modalities. We evaluate different fusion strategies, present baselines for the different prediction tasks, and find that enriching the training data with additional incomplete samples can lead to an improvement in prediction accuracy. Furthermore, the fusion of information from indoor and outdoor photos results in a performance boost of up to 5% in Macro F1-score.

* OAGM Workshop 2021 (2021) 31-37
* included in the Proceedings of the OAGM Workshop 2021

Via

Access Paper or Ask Questions