Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yacouba Kaloga

A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport

Feb 03, 2025

Yacouba Kaloga, Shashi Kumar, Petr Motlicek, Ina Kodrasi

Figure 1 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport

Figure 2 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport

Figure 3 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport

Figure 4 for A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport

Abstract:Accurate sequence-to-sequence (seq2seq) alignment is critical for applications like medical speech analysis and language learning tools relying on automatic speech recognition (ASR). State-of-the-art end-to-end (E2E) ASR systems, such as the Connectionist Temporal Classification (CTC) and transducer-based models, suffer from peaky behavior and alignment inaccuracies. In this paper, we propose a novel differentiable alignment framework based on one-dimensional optimal transport, enabling the model to learn a single alignment and perform ASR in an E2E manner. We introduce a pseudo-metric, called Sequence Optimal Transport Distance (SOTD), over the sequence space and discuss its theoretical properties. Based on the SOTD, we propose Optimal Temporal Transport Classification (OTTC) loss for ASR and contrast its behavior with CTC. Experimental results on the TIMIT, AMI, and LibriSpeech datasets show that our method considerably improves alignment performance, though with a trade-off in ASR performance when compared to CTC. We believe this work opens new avenues for seq2seq alignment research, providing a solid foundation for further exploration and development within the community.

Via

Access Paper or Ask Questions

Graph Neural Networks for Parkinsons Disease Detection

Sep 12, 2024

Shakeel A. Sheikh, Yacouba Kaloga, Ina Kodrasi

Abstract:Despite the promising performance of state of the art approaches for Parkinsons Disease (PD) detection, these approaches often analyze individual speech segments in isolation, which can lead to suboptimal results. Dysarthric cues that characterize speech impairments from PD patients are expected to be related across segments from different speakers. Isolated segment analysis fails to exploit these inter segment relationships. Additionally, not all speech segments from PD patients exhibit clear dysarthric symptoms, introducing label noise that can negatively affect the performance and generalizability of current approaches. To address these challenges, we propose a novel PD detection framework utilizing Graph Convolutional Networks (GCNs). By representing speech segments as nodes and capturing the similarity between segments through edges, our GCN model facilitates the aggregation of dysarthric cues across the graph, effectively exploiting segment relationships and mitigating the impact of label noise. Experimental results demonstrate theadvantages of the proposed GCN model for PD detection and provide insights into its underlying mechanisms

* Submitted to ICASSP 2025

Via

Access Paper or Ask Questions

A simple way to learn metrics between attributed graphs

Sep 26, 2022

Yacouba Kaloga, Pierre Borgnat, Amaury Habrard

Figure 1 for A simple way to learn metrics between attributed graphs

Figure 2 for A simple way to learn metrics between attributed graphs

Figure 3 for A simple way to learn metrics between attributed graphs

Figure 4 for A simple way to learn metrics between attributed graphs

Abstract:The choice of good distances and similarity measures between objects is important for many machine learning methods. Therefore, many metric learning algorithms have been developed in recent years, mainly for Euclidean data in order to improve performance of classification or clustering methods. However, due to difficulties in establishing computable, efficient and differentiable distances between attributed graphs, few metric learning algorithms adapted to graphs have been developed despite the strong interest of the community. In this paper, we address this issue by proposing a new Simple Graph Metric Learning - SGML - model with few trainable parameters based on Simple Graph Convolutional Neural Networks - SGCN - and elements of Optimal Transport theory. This model allows us to build an appropriate distance from a database of labeled (attributed) graphs to improve the performance of simple classification algorithms such as $k$-NN. This distance can be quickly trained while maintaining good performances as illustrated by the experimental study presented in this paper.

Via

Access Paper or Ask Questions

Multiview Variational Graph Autoencoders for Canonical Correlation Analysis

Oct 30, 2020

Yacouba Kaloga, Pierre Borgnat, Sundeep Prabhakar Chepuri, Patrice Abry, Amaury Habrard

Figure 1 for Multiview Variational Graph Autoencoders for Canonical Correlation Analysis

Figure 2 for Multiview Variational Graph Autoencoders for Canonical Correlation Analysis

Figure 3 for Multiview Variational Graph Autoencoders for Canonical Correlation Analysis

Abstract:We present a novel multiview canonical correlation analysis model based on a variational approach. This is the first nonlinear model that takes into account the available graph-based geometric constraints while being scalable for processing large scale datasets with multiple views. It is based on an autoencoder architecture with graph convolutional neural network layers. We experiment with our approach on classification, clustering, and recommendation tasks on real datasets. The algorithm is competitive with state-of-the-art multiview representation learning techniques.

* 4 pages, 3 figures, submitted

Via

Access Paper or Ask Questions

Hierarchical and Unsupervised Graph Representation Learning with Loukas's Coarsening

Jul 07, 2020

Louis Béthune, Yacouba Kaloga, Pierre Borgnat, Aurélien Garivier, Amaury Habrard

Figure 1 for Hierarchical and Unsupervised Graph Representation Learning with Loukas's Coarsening

Figure 2 for Hierarchical and Unsupervised Graph Representation Learning with Loukas's Coarsening

Figure 3 for Hierarchical and Unsupervised Graph Representation Learning with Loukas's Coarsening

Figure 4 for Hierarchical and Unsupervised Graph Representation Learning with Loukas's Coarsening

Abstract:We propose a novel algorithm for unsupervised graph representation learning with attributed graphs. It combines three advantages addressing some current limitations of the literature: i) The model is inductive: it can embed new graphs without re-training in the presence of new data; ii) The method takes into account both micro-structures and macro-structures by looking at the attributed graphs at different scales; iii) The model is end-to-end differentiable: it is a building block that can be plugged into deep learning pipelines and allows for back-propagation. We show that combining a coarsening method having strong theoretical guarantees with mutual information maximization suffices to produce high quality embeddings. We evaluate them on classification tasks with common benchmarks of the literature. We show that our algorithm is competitive with state of the art among unsupervised graph representation learning methods.

* 17 pages, 15 figures, submitted

Via

Access Paper or Ask Questions