Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jenny Benois-Pineau

LaBRI

Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods

Jul 29, 2024

Hyeon Yu, Jenny Benois-Pineau, Romain Bourqui, Romain Giot, Alexey Zhukov

Abstract:This paper investigates the use of Mean Opinion Score (MOS), a common image quality metric, as a user-centric evaluation metric for XAI post-hoc explainers. To measure the MOS, a user experiment is proposed, which has been conducted with explanation maps of intentionally distorted images. Three methods from the family of feature attribution methods - Gradient-weighted Class Activation Mapping (Grad-CAM), Multi-Layered Feature Explanation Method (MLFEM), and Feature Explanation Method (FEM) - are compared with this metric. Additionally, the correlation of this new user-centric metric with automatic metrics is studied via Spearman's rank correlation coefficient. MOS of MLFEM shows the highest correlation with automatic metrics of Insertion Area Under Curve (IAUC) and Deletion Area Under Curve (DAUC). However, the overall correlations are limited, which highlights the lack of consensus between automatic and user-centric metrics.

* Supported by organization Laboratoire Bordelais de Recherche en Informatique, 15 pages, 4 figures, 3 tables

Via

Access Paper or Ask Questions

Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022

Jan 31, 2023

Pierre-Etienne Martin, Jordan Calandre, Boris Mansencal, Jenny Benois-Pineau, Renaud Péteri, Laurent Mascarilla, Julien Morlier

Figure 1 for Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022

Abstract:Sports video analysis is a widespread research topic. Its applications are very diverse, like events detection during a match, video summary, or fine-grained movement analysis of athletes. As part of the MediaEval 2022 benchmarking initiative, this task aims at detecting and classifying subtle movements from sport videos. We focus on recordings of table tennis matches. Conducted since 2019, this task provides a classification challenge from untrimmed videos recorded under natural conditions with known temporal boundaries for each stroke. Since 2021, the task also provides a stroke detection challenge from unannotated, untrimmed videos. This year, the training, validation, and test sets are enhanced to ensure that all strokes are represented in each dataset. The dataset is now similar to the one used in [1, 2]. This research is intended to build tools for coaches and athletes who want to further evaluate their sport performances.

* MediaEval 2022 Workshop, Jan 2023, Bergen, Norway. arXiv admin note: substantial text overlap with arXiv:2112.11384

Via

Access Paper or Ask Questions

Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021

Dec 16, 2021

Pierre-Etienne Martin, Jordan Calandre, Boris Mansencal, Jenny Benois-Pineau, Renaud Péteri, Laurent Mascarilla, Julien Morlier

Figure 1 for Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021

Abstract:Sports video analysis is a prevalent research topic due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests up to analysis of athletes' performance. The Sports Video task is part of the MediaEval 2021 benchmark. This task tackles fine-grained action detection and classification from videos. The focus is on recordings of table tennis games. Running since 2019, the task has offered a classification challenge from untrimmed video recorded in natural conditions with known temporal boundaries for each stroke. This year, the dataset is extended and offers, in addition, a detection challenge from untrimmed videos without annotations. This work aims at creating tools for sports coaches and players in order to analyze sports performance. Movement analysis and player profiling may be built upon such technology to enrich the training experience of athletes and improve their performance.

* MediaEval 2021, Dec 2021, Online, Germany

Via

Access Paper or Ask Questions

Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis

Sep 29, 2021

Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Péteri, Julien Morlier

Figure 1 for Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis

Figure 2 for Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis

Figure 3 for Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis

Figure 4 for Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis

Abstract:This paper proposes a fusion method of modalities extracted from video through a three-stream network with spatio-temporal and temporal convolutions for fine-grained action classification in sport. It is applied to TTStroke-21 dataset which consists of untrimmed videos of table tennis games. The goal is to detect and classify table tennis strokes in the videos, the first step of a bigger scheme aiming at giving feedback to the players for improving their performance. The three modalities are raw RGB data, the computed optical flow and the estimated pose of the player. The network consists of three branches with attention blocks. Features are fused at the latest stage of the network using bilinear layers. Compared to previous approaches, the use of three modalities allows faster convergence and better performances on both tasks: classification of strokes with known temporal boundaries and joint segmentation and classification. The pose is also further investigated in order to offer richer feedback to the athletes.

* MMSports '21, October 20, 2021, Virtual Event,, Oct 2021, Chengdu, China

Via

Access Paper or Ask Questions

White Box Methods for Explanations of Convolutional Neural Networks in Image Classification Tasks

Apr 06, 2021

Meghna P Ayyar, Jenny Benois-Pineau, Akka Zemmari

Figure 1 for White Box Methods for Explanations of Convolutional Neural Networks in Image Classification Tasks

Figure 2 for White Box Methods for Explanations of Convolutional Neural Networks in Image Classification Tasks

Figure 3 for White Box Methods for Explanations of Convolutional Neural Networks in Image Classification Tasks

Figure 4 for White Box Methods for Explanations of Convolutional Neural Networks in Image Classification Tasks

Abstract:In recent years, deep learning has become prevalent to solve applications from multiple domains. Convolutional Neural Networks (CNNs) particularly have demonstrated state of the art performance for the task of image classification. However, the decisions made by these networks are not transparent and cannot be directly interpreted by a human. Several approaches have been proposed to explain to understand the reasoning behind a prediction made by a network. In this paper, we propose a topology of grouping these methods based on their assumptions and implementations. We focus primarily on white box methods that leverage the information of the internal architecture of a network to explain its decision. Given the task of image classification and a trained CNN, this work aims to provide a comprehensive and detailed overview of a set of methods that can be used to create explanation maps for a particular image, that assign an importance score to each pixel of the image based on its contribution to the decision of the network. We also propose a further classification of the white box methods based on their implementations to enable better comparisons and help researchers find methods best suited for different scenarios.

* Submitted to Journal of Electronic Imaging (JEI)

Via

Access Paper or Ask Questions

3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks

Nov 20, 2020

Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Péteri, Julien Morlier

Figure 1 for 3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks

Figure 2 for 3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks

Figure 3 for 3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks

Figure 4 for 3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks

Abstract:The paper addresses the problem of recognition of actions in video with low inter-class variability such as Table Tennis strokes. Two stream, "twin" convolutional neural networks are used with 3D convolutions both on RGB data and optical flow. Actions are recognized by classification of temporal windows. We introduce 3D attention modules and examine their impact on classification efficiency. In the context of the study of sportsmen performances, a corpus of the particular actions of table tennis strokes is considered. The use of attention blocks in the network speeds up the training step and improves the classification scores up to 5% with our twin model. We visualize the impact on the obtained features and notice correlation between attention and player movements and position. Score comparison of state-of-the-art action classification method and proposed approach with attentional blocks is performed on the corpus. Proposed model with attention blocks outperforms previous model without them and our baseline.

* 25th International Conference on Pattern Recognition (ICPR2020), Jan 2021, Milano, Italy

Via

Access Paper or Ask Questions

Move-to-Data: A new Continual Learning approach with Deep CNNs, Application for image-class recognition

Jun 12, 2020

Miltiadis Poursanidis, Jenny Benois-Pineau, Akka Zemmari, Boris Mansenca, Aymar de Rugy

Figure 1 for Move-to-Data: A new Continual Learning approach with Deep CNNs, Application for image-class recognition

Figure 2 for Move-to-Data: A new Continual Learning approach with Deep CNNs, Application for image-class recognition

Figure 3 for Move-to-Data: A new Continual Learning approach with Deep CNNs, Application for image-class recognition

Abstract:In many real-life tasks of application of supervised learning approaches, all the training data are not available at the same time. The examples are lifelong image classification or recognition of environmental objects during interaction of instrumented persons with their environment, enrichment of an online-database with more images. It is necessary to pre-train the model at a "training recording phase" and then adjust it to the new coming data. This is the task of incremental/continual learning approaches. Amongst different problems to be solved by these approaches such as introduction of new categories in the model, refining existing categories to sub-categories and extending trained classifiers over them, ... we focus on the problem of adjusting pre-trained model with new additional training data for existing categories. We propose a fast continual learning layer at the end of the neuronal network. Obtained results are illustrated on the opensource CIFAR benchmark dataset. The proposed scheme yields similar performances as retraining but with drastically lower computational cost.

Via

Access Paper or Ask Questions

3D Inception-based CNN with sMRI and MD-DTI data fusion for Alzheimer's Disease diagnostics

Jul 17, 2018

Alexander Khvostikov, Karim Aderghal, Andrey Krylov, Gwenaelle Catheline, Jenny Benois-Pineau

Figure 1 for 3D Inception-based CNN with sMRI and MD-DTI data fusion for Alzheimer's Disease diagnostics

Figure 2 for 3D Inception-based CNN with sMRI and MD-DTI data fusion for Alzheimer's Disease diagnostics

Figure 3 for 3D Inception-based CNN with sMRI and MD-DTI data fusion for Alzheimer's Disease diagnostics

Figure 4 for 3D Inception-based CNN with sMRI and MD-DTI data fusion for Alzheimer's Disease diagnostics

Abstract:In the last decade, computer-aided early diagnostics of Alzheimer's Disease (AD) and its prodromal form, Mild Cognitive Impairment (MCI), has been the subject of extensive research. Some recent studies have shown promising results in the AD and MCI determination using structural and functional Magnetic Resonance Imaging (sMRI, fMRI), Positron Emission Tomography (PET) and Diffusion Tensor Imaging (DTI) modalities. Furthermore, fusion of imaging modalities in a supervised machine learning framework has shown promising direction of research. In this paper we first review major trends in automatic classification methods such as feature extraction based methods as well as deep learning approaches in medical image analysis applied to the field of Alzheimer's Disease diagnostics. Then we propose our own design of a 3D Inception-based Convolutional Neural Network (CNN) for Alzheimer's Disease diagnostics. The network is designed with an emphasis on the interior resource utilization and uses sMRI and DTI modalities fusion on hippocampal ROI. The comparison with the conventional AlexNet-based network using data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset (http://adni.loni.usc.edu) demonstrates significantly better performance of the proposed 3D Inception-based CNN.

* arXiv admin note: substantial text overlap with arXiv:1801.05968

Via

Access Paper or Ask Questions

3D CNN-based classification using sMRI and MD-DTI images for Alzheimer disease studies

Jan 18, 2018

Alexander Khvostikov, Karim Aderghal, Jenny Benois-Pineau, Andrey Krylov, Gwenaelle Catheline

Figure 1 for 3D CNN-based classification using sMRI and MD-DTI images for Alzheimer disease studies

Figure 2 for 3D CNN-based classification using sMRI and MD-DTI images for Alzheimer disease studies

Figure 3 for 3D CNN-based classification using sMRI and MD-DTI images for Alzheimer disease studies

Figure 4 for 3D CNN-based classification using sMRI and MD-DTI images for Alzheimer disease studies

Abstract:Computer-aided early diagnosis of Alzheimers Disease (AD) and its prodromal form, Mild Cognitive Impairment (MCI), has been the subject of extensive research in recent years. Some recent studies have shown promising results in the AD and MCI determination using structural and functional Magnetic Resonance Imaging (sMRI, fMRI), Positron Emission Tomography (PET) and Diffusion Tensor Imaging (DTI) modalities. Furthermore, fusion of imaging modalities in a supervised machine learning framework has shown promising direction of research. In this paper we first review major trends in automatic classification methods such as feature extraction based methods as well as deep learning approaches in medical image analysis applied to the field of Alzheimer's Disease diagnostics. Then we propose our own algorithm for Alzheimer's Disease diagnostics based on a convolutional neural network and sMRI and DTI modalities fusion on hippocampal ROI using data from the Alzheimers Disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). Comparison with a single modality approach shows promising results. We also propose our own method of data augmentation for balancing classes of different size and analyze the impact of the ROI size on the classification results as well.

Via

Access Paper or Ask Questions

Saliency Driven Object recognition in egocentric videos with deep CNN

Jun 23, 2016

Philippe Pérez de San Roman, Jenny Benois-Pineau, Jean-Philippe Domenger, Florent Paclet, Daniel Cataert, Aymar de Rugy

Figure 1 for Saliency Driven Object recognition in egocentric videos with deep CNN

Figure 2 for Saliency Driven Object recognition in egocentric videos with deep CNN

Figure 3 for Saliency Driven Object recognition in egocentric videos with deep CNN

Figure 4 for Saliency Driven Object recognition in egocentric videos with deep CNN

Abstract:The problem of object recognition in natural scenes has been recently successfully addressed with Deep Convolutional Neuronal Networks giving a significant break-through in recognition scores. The computational efficiency of Deep CNNs as a function of their depth, allows for their use in real-time applications. One of the key issues here is to reduce the number of windows selected from images to be submitted to a Deep CNN. This is usually solved by preliminary segmentation and selection of specific windows, having outstanding "objectiveness" or other value of indicators of possible location of objects. In this paper we propose a Deep CNN approach and the general framework for recognition of objects in a real-time scenario and in an egocentric perspective. Here the window of interest is built on the basis of visual attention map computed over gaze fixations measured by a glass-worn eye-tracker. The application of this set-up is an interactive user-friendly environment for upper-limb amputees. Vision has to help the subject to control his worn neuro-prosthesis in case of a small amount of remaining muscles when the EMG control becomes unefficient. The recognition results on a specifically recorded corpus of 151 videos with simple geometrical objects show the mAP of 64,6\% and the computational time at the generalization lower than a time of a visual fixation on the object-of-interest.

* 20 pages, 8 figures, 3 tables, Submitted to the Journal of Computer Vision and Image Understanding

Via

Access Paper or Ask Questions