Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rozenn Dahyot

Performance of Gaussian Mixture Model Classifiers on Embedded Feature Spaces

Oct 17, 2024

Jeremy Chopin, Rozenn Dahyot

Figure 1 for Performance of Gaussian Mixture Model Classifiers on Embedded Feature Spaces

Figure 2 for Performance of Gaussian Mixture Model Classifiers on Embedded Feature Spaces

Figure 3 for Performance of Gaussian Mixture Model Classifiers on Embedded Feature Spaces

Figure 4 for Performance of Gaussian Mixture Model Classifiers on Embedded Feature Spaces

Abstract:Data embeddings with CLIP and ImageBind provide powerful features for the analysis of multimedia and/or multimodal data. We assess their performance here for classification using a Gaussian Mixture models (GMMs) based layer as an alternative to the standard Softmax layer. GMMs based classifiers have recently been shown to have interesting performances as part of deep learning pipelines trained end-to-end. Our first contribution is to investigate GMM based classification performance taking advantage of the embedded spaces CLIP and ImageBind. Our second contribution is in proposing our own GMM based classifier with a lower parameters count than previously proposed. Our findings are, that in most cases, on these tested embedded spaces, one gaussian component in the GMMs is often enough for capturing each class, and we hypothesize that this may be due to the contrastive loss used for training these embedded spaces that naturally concentrates features together for each class. We also observed that ImageBind often provides better performance than CLIP for classification of image datasets even when these embedded spaces are compressed using PCA.

* 8 pages

Via

Access Paper or Ask Questions

Combining geolocation and height estimation of objects from street level imagery

May 14, 2023

Matej Ulicny, Vladimir A. Krylov, Julie Connelly, Rozenn Dahyot

Abstract:We propose a pipeline for combined multi-class object geolocation and height estimation from street level RGB imagery, which is considered as a single available input data modality. Our solution is formulated via Markov Random Field optimization with deterministic output. The proposed technique uses image metadata along with coordinates of objects detected in the image plane as found by a custom-trained Convolutional Neural Network. Computing the object height using our methodology, in addition to object geolocation, has negligible effect on the overall computational cost. Accuracy is demonstrated experimentally for water drains and road signs on which we achieve average elevation estimation error lower than 20cm.

Via

Access Paper or Ask Questions

Model-based inexact graph matching on top of CNNs for semantic scene understanding

Jan 18, 2023

Jérémy Chopin, Jean-Baptiste Fasquel, Harold Mouchère, Rozenn Dahyot, Isabelle Bloch

Figure 1 for Model-based inexact graph matching on top of CNNs for semantic scene understanding

Figure 2 for Model-based inexact graph matching on top of CNNs for semantic scene understanding

Figure 3 for Model-based inexact graph matching on top of CNNs for semantic scene understanding

Figure 4 for Model-based inexact graph matching on top of CNNs for semantic scene understanding

Abstract:Deep learning based pipelines for semantic segmentation often ignore structural information available on annotated images used for training. We propose a novel post-processing module enforcing structural knowledge about the objects of interest to improve segmentation results provided by deep learning. This module corresponds to a "many-to-one-or-none" inexact graph matching approach, and is formulated as a quadratic assignment problem. Our approach is compared to a CNN-based segmentation (for various CNN backbones) on two public datasets, one for face segmentation from 2D RGB images (FASSEG), and the other for brain segmentation from 3D MRIs (IBSR). Evaluations are performed using two types of structural information (distances and directional relations, , this choice being a hyper-parameter of our generic framework). On FASSEG data, results show that our module improves accuracy of the CNN by about 6.3% (the Hausdorff distance decreases from 22.11 to 20.71). On IBSR data, the improvement is of 51% (the Hausdorff distance decreases from 11.01 to 5.4). In addition, our approach is shown to be resilient to small training datasets that often limit the performance of deep learning methods: the improvement increases as the size of the training dataset decreases.

* 27 pages, 10 figures, 7 tables

Via

Access Paper or Ask Questions

Principal Component Classification

Oct 26, 2022

Rozenn Dahyot

Abstract:We propose to directly compute classification estimates by learning features encoded with their class scores using PCA. Our resulting model has a encoder-decoder structure suitable for supervised learning, it is computationally efficient and performs well for classification on several datasets.

* 5 pages; 5 figures; 1 table

Via

Access Paper or Ask Questions

DR-VNet: Retinal Vessel Segmentation via Dense Residual UNet

Nov 08, 2021

Ali Karaali, Rozenn Dahyot, Donal J. Sexton

Figure 1 for DR-VNet: Retinal Vessel Segmentation via Dense Residual UNet

Figure 2 for DR-VNet: Retinal Vessel Segmentation via Dense Residual UNet

Figure 3 for DR-VNet: Retinal Vessel Segmentation via Dense Residual UNet

Figure 4 for DR-VNet: Retinal Vessel Segmentation via Dense Residual UNet

Abstract:Accurate retinal vessel segmentation is an important task for many computer-aided diagnosis systems. Yet, it is still a challenging problem due to the complex vessel structures of an eye. Numerous vessel segmentation methods have been proposed recently, however more research is needed to deal with poor segmentation of thin and tiny vessels. To address this, we propose a new deep learning pipeline combining the efficiency of residual dense net blocks and, residual squeeze and excitation blocks. We validate experimentally our approach on three datasets and show that our pipeline outperforms current state of the art techniques on the sensitivity metric relevant to assess capture of small vessels.

Via

Access Paper or Ask Questions

3D point cloud segmentation using GIS

Aug 13, 2021

Chao-Jung Liu, Vladimir Krylov, Rozenn Dahyot

Figure 1 for 3D point cloud segmentation using GIS

Figure 2 for 3D point cloud segmentation using GIS

Figure 3 for 3D point cloud segmentation using GIS

Figure 4 for 3D point cloud segmentation using GIS

Abstract:In this paper we propose an approach to perform semantic segmentation of 3D point cloud data by importing the geographic information from a 2D GIS layer (OpenStreetMap). The proposed automatic procedure identifies meaningful units such as buildings and adjusts their locations to achieve best fit between the GIS polygonal perimeters and the point cloud. Our processing pipeline is presented and illustrated by segmenting point cloud data of Trinity College Dublin (Ireland) campus constructed from optical imagery collected by a drone.

* IMVIP 2018
* 8 pages

Via

Access Paper or Ask Questions

Context Aware Object Geotagging

Aug 13, 2021

Chao-Jung Liu, Matej Ulicny, Michael Manzke, Rozenn Dahyot

Figure 1 for Context Aware Object Geotagging

Figure 2 for Context Aware Object Geotagging

Figure 3 for Context Aware Object Geotagging

Figure 4 for Context Aware Object Geotagging

Abstract:Localization of street objects from images has gained a lot of attention in recent years. We propose an approach to improve asset geolocation from street view imagery by enhancing the quality of the metadata associated with the images using Structure from Motion. The predicted object geolocation is further refined by imposing contextual geographic information extracted from OpenStreetMap. Our pipeline is validated experimentally against the state of the art approaches for geotagging traffic lights.

* IMVIP 2021
* 8 pages

Via

Access Paper or Ask Questions

Sliced $\mathcal{L}_2$ Distance for Colour Grading

Feb 18, 2021

Hana Alghamdi, Rozenn Dahyot

$Figure 1 for Sliced $\mathcal{L}_2$ Distance for Colour Grading$

$Figure 2 for Sliced $\mathcal{L}_2$ Distance for Colour Grading$

$Figure 3 for Sliced $\mathcal{L}_2$ Distance for Colour Grading$

$Figure 4 for Sliced $\mathcal{L}_2$ Distance for Colour Grading$

Abstract:We propose a new method with $\mathcal{L}_2$ distance that maps one $N$-dimensional distribution to another, taking into account available information about correspondences. We solve the high-dimensional problem in 1D space using an iterative projection approach. To show the potentials of this mapping, we apply it to colour transfer between two images that exhibit overlapped scenes. Experiments show quantitative and qualitative competitive results as compared with the state of the art colour transfer methods.

* 5 pages, 9 figures

Via

Access Paper or Ask Questions

Tensor Reordering for CNN Compression

Oct 22, 2020

Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot

Figure 1 for Tensor Reordering for CNN Compression

Figure 2 for Tensor Reordering for CNN Compression

Figure 3 for Tensor Reordering for CNN Compression

Figure 4 for Tensor Reordering for CNN Compression

Abstract:We show how parameter redundancy in Convolutional Neural Network (CNN) filters can be effectively reduced by pruning in spectral domain. Specifically, the representation extracted via Discrete Cosine Transform (DCT) is more conducive for pruning than the original space. By relying on a combination of weight tensor reshaping and reordering we achieve high levels of layer compression with just minor accuracy loss. Our approach is applied to compress pretrained CNNs and we show that minor additional fine-tuning allows our method to recover the original model performance after a significant parameter reduction. We validate our approach on ResNet-50 and MobileNet-V2 architectures for ImageNet classification task.

Via

Access Paper or Ask Questions

Iterative Nadaraya-Watson Distribution Transfer for Colour Grading

Jun 15, 2020

Hana Alghamdi, Rozenn Dahyot

Figure 1 for Iterative Nadaraya-Watson Distribution Transfer for Colour Grading

Figure 2 for Iterative Nadaraya-Watson Distribution Transfer for Colour Grading

Figure 3 for Iterative Nadaraya-Watson Distribution Transfer for Colour Grading

Figure 4 for Iterative Nadaraya-Watson Distribution Transfer for Colour Grading

Abstract:We propose a new method with Nadaraya-Watson that maps one N-dimensional distribution to another taking into account available information about correspondences. We extend the 2D/3D problem to higher dimensions by encoding overlapping neighborhoods of data points and solve the high dimensional problem in 1D space using an iterative projection approach. To show potentials of this mapping, we apply it to colour transfer between two images that exhibit overlapped scene. Experiments show quantitative and qualitative improvements over previous state of the art colour transfer methods.

* 6 pages, 6 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:2005.09015

Via

Access Paper or Ask Questions