Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Grzegorz Kurzejamski

ESC: Evolutionary Stitched Camera Calibration in the Wild

Apr 19, 2024

Grzegorz Rypeść, Grzegorz Kurzejamski

Abstract:This work introduces a novel end-to-end approach for estimating extrinsic parameters of cameras in multi-camera setups on real-life sports fields. We identify the source of significant calibration errors in multi-camera environments and address the limitations of existing calibration methods, particularly the disparity between theoretical models and actual sports field characteristics. We propose the Evolutionary Stitched Camera calibration (ESC) algorithm to bridge this gap. It consists of image segmentation followed by evolutionary optimization of a novel loss function, providing a unified and accurate multi-camera calibration solution with high visual fidelity. The outcome allows the creation of virtual stitched views from multiple video sources, being as important for practical applications as numerical accuracy. We demonstrate the superior performance of our approach compared to state-of-the-art methods across diverse real-life football fields with varying physical characteristics.

* Accepted for IEEE CEC 2024

Via

Access Paper or Ask Questions

Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers

Sep 23, 2023

Adam Pardyl, Grzegorz Kurzejamski, Jan Olszewski, Tomasz Trzciński, Bartosz Zieliński

Abstract:Vision transformers have excelled in various computer vision tasks but mostly rely on rigid input sampling using a fixed-size grid of patches. This limits their applicability in real-world problems, such as in the field of robotics and UAVs, where one can utilize higher input elasticity to boost model performance and efficiency. Our paper addresses this limitation by formalizing the concept of input elasticity for vision transformers and introducing an evaluation protocol, including dedicated metrics for measuring input elasticity. Moreover, we propose modifications to the transformer architecture and training regime, which increase its elasticity. Through extensive experimentation, we spotlight opportunities and challenges associated with input sampling strategies.

Via

Access Paper or Ask Questions

Active Visual Exploration Based on Attention-Map Entropy

Mar 11, 2023

Adam Pardyl, Grzegorz Rypeść, Grzegorz Kurzejamski, Bartosz Zieliński, Tomasz Trzciński

Figure 1 for Active Visual Exploration Based on Attention-Map Entropy

Figure 2 for Active Visual Exploration Based on Attention-Map Entropy

Figure 3 for Active Visual Exploration Based on Attention-Map Entropy

Figure 4 for Active Visual Exploration Based on Attention-Map Entropy

Abstract:Active visual exploration addresses the issue of limited sensor capabilities in real-world scenarios, where successive observations are actively chosen based on the environment. To tackle this problem, we introduce a new technique called Attention-Map Entropy (AME). It leverages the internal uncertainty of the transformer-based model to determine the most informative observations. In contrast to existing solutions, it does not require additional loss components, which simplifies the training. Through experiments, which also mimic retina-like sensors, we show that such simplified training significantly improves the performance of reconstruction and classification on publicly available datasets.

Via

Access Paper or Ask Questions

Sports Camera Pose Refinement Using an Evolution Strategy

Nov 03, 2022

Grzegorz Rypeść, Grzegorz Kurzejamski, Jacek Komorowski

Abstract:This paper presents a robust end-to-end method for sports cameras extrinsic parameters optimization using a novel evolution strategy. First, we developed a neural network architecture for an edge or area-based segmentation of a sports field. Secondly, we implemented the evolution strategy, which purpose is to refine extrinsic camera parameters given a single, segmented sports field image. Experimental comparison with state-of-the-art camera pose refinement methods on real-world data demonstrates the superiority of the proposed algorithm. We also perform an ablation study and propose a way to generalize the method to additionally refine the intrinsic camera matrix.

* Conference paper at 2022 IEEE Congress on Evolutionary Computation (CEC)

Via

Access Paper or Ask Questions

Graph-Based Multi-Camera Soccer Player Tracker

Nov 03, 2022

Jacek Komorowski, Grzegorz Kurzejamski

Abstract:The paper presents a multi-camera tracking method intended for tracking soccer players in long shot video recordings from multiple calibrated cameras installed around the playing field. The large distance to the camera makes it difficult to visually distinguish individual players, which adversely affects the performance of traditional solutions relying on the appearance of tracked objects. Our method focuses on individual player dynamics and interactions between neighborhood players to improve tracking performance. To overcome the difficulty of reliably merging detections from multiple cameras in the presence of calibration errors, we propose the novel tracking approach, where the tracker operates directly on raw detection heat maps from multiple cameras. Our model is trained on a large synthetic dataset generated using Google Research Football Environment and fine-tuned using real-world data to reduce costs involved with ground truth preparation.

Via

Access Paper or Ask Questions

SuperNCN: Neighbourhood consensus network for robust outdoor scenes matching

Dec 10, 2019

Grzegorz Kurzejamski, Jacek Komorowski, Lukasz Dabala, Konrad Czarnota, Simon Lynen, Tomasz Trzcinski

Figure 1 for SuperNCN: Neighbourhood consensus network for robust outdoor scenes matching

Figure 2 for SuperNCN: Neighbourhood consensus network for robust outdoor scenes matching

Figure 3 for SuperNCN: Neighbourhood consensus network for robust outdoor scenes matching

Figure 4 for SuperNCN: Neighbourhood consensus network for robust outdoor scenes matching

Abstract:In this paper, we present a framework for computing dense keypoint correspondences between images under strong scene appearance changes. Traditional methods, based on nearest neighbour search in the feature descriptor space, perform poorly when environmental conditions vary, e.g. when images are taken at different times of the day or seasons. Our method improves finding keypoint correspondences in such difficult conditions. First, we use Neighbourhood Consensus Networks to build spatially consistent matching grid between two images at a coarse scale. Then, we apply Superpoint-like corner detector to achieve pixel-level accuracy. Both parts use features learned with domain adaptation to increase robustness against strong scene appearance variations. The framework has been tested on a RobotCar Seasons dataset, proving large improvement on pose estimation task under challenging environmental conditions.

Via

Access Paper or Ask Questions

FootAndBall: Integrated player and ball detector

Dec 10, 2019

Jacek Komorowski, Grzegorz Kurzejamski, Grzegorz Sarwas

Figure 1 for FootAndBall: Integrated player and ball detector

Figure 2 for FootAndBall: Integrated player and ball detector

Figure 3 for FootAndBall: Integrated player and ball detector

Figure 4 for FootAndBall: Integrated player and ball detector

Abstract:The paper describes a deep neural network-based detector dedicated for ball and players detection in high resolution, long shot, video recordings of soccer matches. The detector, dubbed FootAndBall, has an efficient fully convolutional architecture and can operate on input video stream with an arbitrary resolution. It produces ball confidence map encoding the position of the detected ball, player confidence map and player bounding boxes tensor encoding players' positions and bounding boxes. The network uses Feature Pyramid Network desing pattern, where lower level features with higher spatial resolution are combined with higher level features with bigger receptive field. This improves discriminability of small objects (the ball) as larger visual context around the object of interest is taken into account for the classification. Due to its specialized design, the network has two orders of magnitude less parameters than a generic deep neural network-based object detector, such as SSD or YOLO. This allows real-time processing of high resolution input video stream.

* arXiv admin note: text overlap with arXiv:1902.07304

Via

Access Paper or Ask Questions

DeepBall: Deep Neural-Network Ball Detector

Feb 19, 2019

Jacek Komorowski, Grzegorz Kurzejamski, Grzegorz Sarwas

Figure 1 for DeepBall: Deep Neural-Network Ball Detector

Figure 2 for DeepBall: Deep Neural-Network Ball Detector

Figure 3 for DeepBall: Deep Neural-Network Ball Detector

Figure 4 for DeepBall: Deep Neural-Network Ball Detector

Abstract:The paper describes a deep network based object detector specialized for ball detection in long shot videos. Due to its fully convolutional design, the method operates on images of any size and produces \emph{ball confidence map} encoding the position of detected ball. The network uses hypercolumn concept, where feature maps from different hierarchy levels of the deep convolutional network are combined and jointly fed to the convolutional classification layer. This allows boosting the detection accuracy as larger visual context around the object of interest is taken into account. The method achieves state-of-the-art results when tested on publicly available ISSIA-CNR Soccer Dataset.

* Conference: VISAPP 2019

Via

Access Paper or Ask Questions

SConE: Siamese Constellation Embedding Descriptor for Image Matching

Sep 28, 2018

Tomasz Trzcinski, Jacek Komorowski, Lukasz Dabala, Konrad Czarnota, Grzegorz Kurzejamski, Simon Lynen

Figure 1 for SConE: Siamese Constellation Embedding Descriptor for Image Matching

Figure 2 for SConE: Siamese Constellation Embedding Descriptor for Image Matching

Figure 3 for SConE: Siamese Constellation Embedding Descriptor for Image Matching

Figure 4 for SConE: Siamese Constellation Embedding Descriptor for Image Matching

Abstract:Numerous computer vision applications rely on local feature descriptors, such as SIFT, SURF or FREAK, for image matching. Although their local character makes image matching processes more robust to occlusions, it often leads to geometrically inconsistent keypoint matches that need to be filtered out, e.g. using RANSAC. In this paper we propose a novel, more discriminative, descriptor that includes not only local feature representation, but also information about the geometric layout of neighbouring keypoints. To that end, we use a Siamese architecture that learns a low-dimensional feature embedding of keypoint constellation by maximizing the distances between non-corresponding pairs of matched image patches, while minimizing it for correct matches. The 48-dimensional oating point descriptor that we train is built on top of the state-of-the-art FREAK descriptor achieves significant performance improvement over the competitors on a challenging TUM dataset.

Via

Access Paper or Ask Questions

Robust Method of Vote Aggregation and Proposition Verification for Invariant Local Features

Jan 05, 2016

Grzegorz Kurzejamski, Jacek Zawistowski, Grzegorz Sarwas

Figure 1 for Robust Method of Vote Aggregation and Proposition Verification for Invariant Local Features

Figure 2 for Robust Method of Vote Aggregation and Proposition Verification for Invariant Local Features

Figure 3 for Robust Method of Vote Aggregation and Proposition Verification for Invariant Local Features

Figure 4 for Robust Method of Vote Aggregation and Proposition Verification for Invariant Local Features

Abstract:This paper presents a method for analysis of the vote space created from the local features extraction process in a multi-detection system. The method is opposed to the classic clustering approach and gives a high level of control over the clusters composition for further verification steps. Proposed method comprises of the graphical vote space presentation, the proposition generation, the two-pass iterative vote aggregation and the cascade filters for verification of the propositions. Cascade filters contain all of the minor algorithms needed for effective object detection verification. The new approach does not have the drawbacks of the classic clustering approaches and gives a substantial control over process of detection. Method exhibits an exceptionally high detection rate in conjunction with a low false detection chance in comparison to alternative methods.

* 8 pages Short Paper, presented at VISAPP 2015 Conference in Berlin, March. Proceedings of the 10th International Conference on Computer Vision Theory and Applications, 252-259, 2015, Berlin, Germany, ISBN 978-989-758-090-1

Via

Access Paper or Ask Questions