Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jon Muhovič

3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results

Jan 17, 2025

Benjamin Kiefer, Lojze Žust, Jon Muhovič, Matej Kristan, Janez Perš, Matija Teršek, Uma Mudenagudi Chaitra Desai, Arnold Wiliem, Marten Kreis, Nikhil Akalwadi(+36 more)

Abstract:The 3rd Workshop on Maritime Computer Vision (MaCVi) 2025 addresses maritime computer vision for Unmanned Surface Vehicles (USV) and underwater. This report offers a comprehensive overview of the findings from the challenges. We provide both statistical and qualitative analyses, evaluating trends from over 700 submissions. All datasets, evaluation code, and the leaderboard are available to the public at https://macvi.org/workshop/macvi25.

* Part of the MaCVi 2025 workshop

Via

Access Paper or Ask Questions

Dense Center-Direction Regression for Object Counting and Localization with Point Supervision

Aug 26, 2024

Domen Tabernik, Jon Muhovič, Danijel Skočaj

Figure 1 for Dense Center-Direction Regression for Object Counting and Localization with Point Supervision

Figure 2 for Dense Center-Direction Regression for Object Counting and Localization with Point Supervision

Figure 3 for Dense Center-Direction Regression for Object Counting and Localization with Point Supervision

Figure 4 for Dense Center-Direction Regression for Object Counting and Localization with Point Supervision

Abstract:Object counting and localization problems are commonly addressed with point supervised learning, which allows the use of less labor-intensive point annotations. However, learning based on point annotations poses challenges due to the high imbalance between the sets of annotated and unannotated pixels, which is often treated with Gaussian smoothing of point annotations and focal loss. However, these approaches still focus on the pixels in the immediate vicinity of the point annotations and exploit the rest of the data only indirectly. In this work, we propose a novel approach termed CeDiRNet for point-supervised learning that uses a dense regression of directions pointing towards the nearest object centers, i.e. center-directions. This provides greater support for each center point arising from many surrounding pixels pointing towards the object center. We propose a formulation of center-directions that allows the problem to be split into the domain-specific dense regression of center-directions and the final localization task based on a small, lightweight, and domain-agnostic localization network that can be trained with synthetic data completely independent of the target domain. We demonstrate the performance of the proposed method on six different datasets for object counting and localization, and show that it outperforms the existing state-of-the-art methods. The code is accessible on GitHub at https://github.com/vicoslab/CeDiRNet.git.

* Published in Pattern Recognition

Via

Access Paper or Ask Questions

Center Direction Network for Grasping Point Localization on Cloths

Aug 26, 2024

Domen Tabernik, Jon Muhovič, Matej Urbas, Danijel Skočaj

Figure 1 for Center Direction Network for Grasping Point Localization on Cloths

Figure 2 for Center Direction Network for Grasping Point Localization on Cloths

Figure 3 for Center Direction Network for Grasping Point Localization on Cloths

Figure 4 for Center Direction Network for Grasping Point Localization on Cloths

Abstract:Object grasping is a fundamental challenge in robotics and computer vision, critical for advancing robotic manipulation capabilities. Deformable objects, like fabrics and cloths, pose additional challenges due to their non-rigid nature. In this work, we introduce CeDiRNet-3DoF, a deep-learning model for grasp point detection, with a particular focus on cloth objects. CeDiRNet-3DoF employs center direction regression alongside a localization network, attaining first place in the perception task of ICRA 2023's Cloth Manipulation Challenge. Recognizing the lack of standardized benchmarks in the literature that hinder effective method comparison, we present the ViCoS Towel Dataset. This extensive benchmark dataset comprises 8,000 real and 12,000 synthetic images, serving as a robust resource for training and evaluating contemporary data-driven deep-learning approaches. Extensive evaluation revealed CeDiRNet-3DoF's robustness in real-world performance, outperforming state-of-the-art methods, including the latest transformer-based models. Our work bridges a crucial gap, offering a robust solution and benchmark for cloth grasping in computer vision and robotics. Code and dataset are available at: https://github.com/vicoslab/CeDiRNet-3DoF

* Accepted for publication in IEEE Robotics and Automation Letters

Via

Access Paper or Ask Questions

MODS -- A USV-oriented object detection and obstacle segmentation benchmark

May 05, 2021

Borja Bovcon, Jon Muhovič, Duško Vranac, Dean Mozetič, Janez Perš, Matej Kristan

Figure 1 for MODS -- A USV-oriented object detection and obstacle segmentation benchmark

Figure 2 for MODS -- A USV-oriented object detection and obstacle segmentation benchmark

Figure 3 for MODS -- A USV-oriented object detection and obstacle segmentation benchmark

Figure 4 for MODS -- A USV-oriented object detection and obstacle segmentation benchmark

Abstract:Small-sized unmanned surface vehicles (USV) are coastal water devices with a broad range of applications such as environmental control and surveillance. A crucial capability for autonomous operation is obstacle detection for timely reaction and collision avoidance, which has been recently explored in the context of camera-based visual scene interpretation. Owing to curated datasets, substantial advances in scene interpretation have been made in a related field of unmanned ground vehicles. However, the current maritime datasets do not adequately capture the complexity of real-world USV scenes and the evaluation protocols are not standardised, which makes cross-paper comparison of different methods difficult and hiders the progress. To address these issues, we introduce a new obstacle detection benchmark MODS, which considers two major perception tasks: maritime object detection and the more general maritime obstacle segmentation. We present a new diverse maritime evaluation dataset containing approximately 81k stereo images synchronized with an on-board IMU, with over 60k objects annotated. We propose a new obstacle segmentation performance evaluation protocol that reflects the detection accuracy in a way meaningful for practical USV navigation. Seventeen recent state-of-the-art object detection and obstacle segmentation methods are evaluated using the proposed protocol, creating a benchmark to facilitate development of the field.

* 15 pages, 15 figures. The dataset, as well as the proposed evaluation metrics, will be published on our website: www.vicos.si

Via

Access Paper or Ask Questions

Correcting Decalibration of Stereo Cameras in Self-Driving Vehicles

Jan 15, 2020

Jon Muhovič, Janez Perš

Figure 1 for Correcting Decalibration of Stereo Cameras in Self-Driving Vehicles

Figure 2 for Correcting Decalibration of Stereo Cameras in Self-Driving Vehicles

Figure 3 for Correcting Decalibration of Stereo Cameras in Self-Driving Vehicles

Figure 4 for Correcting Decalibration of Stereo Cameras in Self-Driving Vehicles

Abstract:We address the problem of optical decalibration in mobile stereo camera setups, especially in context of autonomous vehicles. In real world conditions, an optical system is subject to various sources of anticipated and unanticipated mechanical stress (vibration, rough handling, collisions). Mechanical stress changes the geometry between the cameras that make up the stereo pair, and as a consequence, the pre-calculated epipolar geometry is no longer valid. Our method is based on optimization of camera geometry parameters and plugs directly into the output of the stereo matching algorithm. Therefore, it is able to recover calibration parameters on image pairs obtained from a decalibrated stereo system with minimal use of additional computing resources. The number of successfully recovered depth pixels is used as an objective function, which we aim to maximize. Our simulation confirms that the method can run constantly in parallel to stereo estimation and thus help keep the system calibrated in real time. Results confirm that the method is able to recalibrate all the parameters except for the baseline distance, which scales the absolute depth readings. However, that scaling factor could be uniquely determined using any kind of absolute range finding methods (e.g. a single beam time-of-flight sensor).

* 8 pages, 9 figures

Via

Access Paper or Ask Questions