Picture for Hexin Liu

Hexin Liu

SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model

Add code
Nov 12, 2024
Figure 1 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 2 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 3 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 4 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Viaarxiv icon

Selective State Space Model for Monaural Speech Enhancement

Add code
Nov 09, 2024
Figure 1 for Selective State Space Model for Monaural Speech Enhancement
Figure 2 for Selective State Space Model for Monaural Speech Enhancement
Figure 3 for Selective State Space Model for Monaural Speech Enhancement
Figure 4 for Selective State Space Model for Monaural Speech Enhancement
Viaarxiv icon

Mamba in Speech: Towards an Alternative to Self-Attention

Add code
May 22, 2024
Figure 1 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 2 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 3 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 4 for Mamba in Speech: Towards an Alternative to Self-Attention
Viaarxiv icon

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

Add code
Mar 09, 2024
Viaarxiv icon

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

Add code
Feb 17, 2024
Viaarxiv icon

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

Add code
Feb 16, 2024
Viaarxiv icon

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

Add code
Nov 27, 2023
Viaarxiv icon

Generative error correction for code-switching speech recognition using large language models

Add code
Oct 17, 2023
Figure 1 for Generative error correction for code-switching speech recognition using large language models
Figure 2 for Generative error correction for code-switching speech recognition using large language models
Figure 3 for Generative error correction for code-switching speech recognition using large language models
Figure 4 for Generative error correction for code-switching speech recognition using large language models
Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Add code
Sep 29, 2023
Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex

Add code
Sep 26, 2023
Figure 1 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 2 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 3 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 4 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Viaarxiv icon