Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ayoub Ghriss

Deep Clustering via Probabilistic Ratio-Cut Optimization

Feb 05, 2025

Ayoub Ghriss, Claire Monteleoni

Abstract:We propose a novel approach for optimizing the graph ratio-cut by modeling the binary assignments as random variables. We provide an upper bound on the expected ratio-cut, as well as an unbiased estimate of its gradient, to learn the parameters of the assignment variables in an online setting. The clustering resulting from our probabilistic approach (PRCut) outperforms the Rayleigh quotient relaxation of the combinatorial problem, its online learning extensions, and several widely used methods. We demonstrate that the PRCut clustering closely aligns with the similarity measure and can perform as well as a supervised classifier when label-based similarities are provided. This novel approach can leverage out-of-the-box self-supervised representations to achieve competitive performance and serve as an evaluation method for the quality of these representations.

* Proceedings of Machine Learning Research (2008), Volume 258
* Proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025, Mai Khao, Thailand. PMLR: Volume 258

Via

Access Paper or Ask Questions

Reinforcement Learning with Options and State Representation

Mar 25, 2024

Ayoub Ghriss, Masashi Sugiyama, Alessandro Lazaric

Figure 1 for Reinforcement Learning with Options and State Representation

Figure 2 for Reinforcement Learning with Options and State Representation

Figure 3 for Reinforcement Learning with Options and State Representation

Figure 4 for Reinforcement Learning with Options and State Representation

Abstract:The current thesis aims to explore the reinforcement learning field and build on existing methods to produce improved ones to tackle the problem of learning in high-dimensional and complex environments. It addresses such goals by decomposing learning tasks in a hierarchical fashion known as Hierarchical Reinforcement Learning. We start in the first chapter by getting familiar with the Markov Decision Process framework and presenting some of its recent techniques that the following chapters use. We then proceed to build our Hierarchical Policy learning as an answer to the limitations of a single primitive policy. The hierarchy is composed of a manager agent at the top and employee agents at the lower level. In the last chapter, which is the core of this thesis, we attempt to learn lower-level elements of the hierarchy independently of the manager level in what is known as the "Eigenoption". Based on the graph structure of the environment, Eigenoptions allow us to build agents that are aware of the geometric and dynamic properties of the environment. Their decision-making has a special property: it is invariant to symmetric transformations of the environment, allowing as a consequence to greatly reduce the complexity of the learning task.

* Master Thesis 2018, MVA ENS Paris-Saclay, Tokyo RIKEN AIP

Via

Access Paper or Ask Questions

Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition

Jan 27, 2022

Ayoub Ghriss, Bo Yang, Viktor Rozgic, Elizabeth Shriberg, Chao Wang

Figure 1 for Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition

Figure 2 for Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition

Figure 3 for Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition

Figure 4 for Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition

Abstract:We propose a novel multi-task pre-training method for Speech Emotion Recognition (SER). We pre-train SER model simultaneously on Automatic Speech Recognition (ASR) and sentiment classification tasks to make the acoustic ASR model more ``emotion aware''. We generate targets for the sentiment classification using text-to-sentiment model trained on publicly available data. Finally, we fine-tune the acoustic ASR on emotion annotated speech data. We evaluated the proposed approach on the MSP-Podcast dataset, where we achieved the best reported concordance correlation coefficient (CCC) of 0.41 for valence prediction.

* ICASSP 2022

Via

Access Paper or Ask Questions