Picture for Sabato Marco Siniscalchi

Sabato Marco Siniscalchi

MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network

Add code
Nov 28, 2024
Viaarxiv icon

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement

Add code
Sep 24, 2024
Figure 1 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 2 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 3 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Figure 4 for An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
Viaarxiv icon

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

Add code
Sep 17, 2024
Figure 1 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 2 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 3 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 4 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Viaarxiv icon

Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement

Add code
Aug 08, 2024
Viaarxiv icon

Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions

Add code
Jun 23, 2024
Figure 1 for Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions
Figure 2 for Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions
Figure 3 for Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions
Figure 4 for Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions
Viaarxiv icon

Speech Analysis of Language Varieties in Italy

Add code
Jun 22, 2024
Figure 1 for Speech Analysis of Language Varieties in Italy
Figure 2 for Speech Analysis of Language Varieties in Italy
Figure 3 for Speech Analysis of Language Varieties in Italy
Figure 4 for Speech Analysis of Language Varieties in Italy
Viaarxiv icon

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

Add code
Jun 04, 2024
Figure 1 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 2 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 3 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Figure 4 for Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
Viaarxiv icon

An Investigation of Incorporating Mamba for Speech Enhancement

Add code
May 10, 2024
Figure 1 for An Investigation of Incorporating Mamba for Speech Enhancement
Figure 2 for An Investigation of Incorporating Mamba for Speech Enhancement
Figure 3 for An Investigation of Incorporating Mamba for Speech Enhancement
Figure 4 for An Investigation of Incorporating Mamba for Speech Enhancement
Viaarxiv icon

Benchmarking Representations for Speech, Music, and Acoustic Events

Add code
May 02, 2024
Figure 1 for Benchmarking Representations for Speech, Music, and Acoustic Events
Figure 2 for Benchmarking Representations for Speech, Music, and Acoustic Events
Viaarxiv icon

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

Add code
Feb 08, 2024
Viaarxiv icon