Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hongjian You

Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks

Aug 14, 2024

Liting Jiang, Feng Wang, Wenyi Zhang, Peifeng Li, Hongjian You, Yuming Xiang

Figure 1 for Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks

Figure 2 for Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks

Figure 3 for Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks

Figure 4 for Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks

Abstract:Stereo matching, a critical step of 3D reconstruction, has fully shifted towards deep learning due to its strong feature representation of remote sensing images. However, ground truth for stereo matching task relies on expensive airborne LiDAR data, thus making it difficult to obtain enough samples for supervised learning. To improve the generalization ability of stereo matching networks on cross-domain data from different sensors and scenarios, in this paper, we dedicate to study key training factors from three perspectives. (1) For the selection of training dataset, it is important to select data with similar regional target distribution as the test set instead of utilizing data from the same sensor. (2) For model structure, cascaded structure that flexibly adapts to different sizes of features is preferred. (3) For training manner, unsupervised methods generalize better than supervised methods, and we design an unsupervised early-stop strategy to help retain the best model with pre-trained weights as the basis. Extensive experiments are conducted to support the previous findings, on the basis of which we present an unsupervised stereo matching network with good generalization performance. We release the source code and the datasets at https://github.com/Elenairene/RKF_RSSM to reproduce the results and encourage future work.

* submitted to IEEE jstars

Via

Access Paper or Ask Questions

Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction

Aug 14, 2024

Liting Jiang, Yuming Xiang, Feng Wang, Hongjian You

Figure 1 for Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction

Figure 2 for Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction

Figure 3 for Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction

Figure 4 for Unsupervised Stereo Matching Network For VHR Remote Sensing Images Based On Error Prediction

Abstract:Stereo matching in remote sensing has recently garnered increased attention, primarily focusing on supervised learning. However, datasets with ground truth generated by expensive airbone Lidar exhibit limited quantity and diversity, constraining the effectiveness of supervised networks. In contrast, unsupervised learning methods can leverage the increasing availability of very-high-resolution (VHR) remote sensing images, offering considerable potential in the realm of stereo matching. Motivated by this intuition, we propose a novel unsupervised stereo matching network for VHR remote sensing images. A light-weight module to bridge confidence with predicted error is introduced to refine the core model. Robust unsupervised losses are formulated to enhance network convergence. The experimental results on US3D and WHU-Stereo datasets demonstrate that the proposed network achieves superior accuracy compared to other unsupervised networks and exhibits better generalization capabilities than supervised models. Our code will be available at https://github.com/Elenairene/CBEM.

* Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2024

Via

Access Paper or Ask Questions