Picture for Leibny Paola Garcia

Leibny Paola Garcia

SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model

Add code
Nov 12, 2024
Figure 1 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 2 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 3 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 4 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Viaarxiv icon

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

Add code
Mar 09, 2024
Viaarxiv icon

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

Add code
Feb 16, 2024
Viaarxiv icon

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

Add code
Nov 27, 2023
Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Add code
Sep 29, 2023
Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex

Add code
Sep 26, 2023
Figure 1 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 2 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 3 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 4 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Viaarxiv icon

Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts

Add code
Jun 01, 2023
Viaarxiv icon

EURO: ESPnet Unsupervised ASR Open-source Toolkit

Add code
Dec 01, 2022
Viaarxiv icon

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Add code
Oct 26, 2022
Figure 1 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 2 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 3 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 4 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Viaarxiv icon

Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection

Add code
Oct 06, 2022
Figure 1 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Figure 2 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Figure 3 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Figure 4 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Viaarxiv icon