Picture for Samir Sadok

Samir Sadok

Residual Tokens Enhance Masked Autoencoders for Speech Modeling

Add code
Jan 27, 2026
Viaarxiv icon

Bringing Interpretability to Neural Audio Codecs

Add code
Jun 04, 2025
Viaarxiv icon

AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

Add code
Jan 09, 2025
Viaarxiv icon

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

Add code
May 05, 2023
Viaarxiv icon

A vector quantized masked autoencoder for audiovisual speech emotion recognition

Add code
May 05, 2023
Figure 1 for A vector quantized masked autoencoder for audiovisual speech emotion recognition
Figure 2 for A vector quantized masked autoencoder for audiovisual speech emotion recognition
Figure 3 for A vector quantized masked autoencoder for audiovisual speech emotion recognition
Figure 4 for A vector quantized masked autoencoder for audiovisual speech emotion recognition
Viaarxiv icon

A vector quantized masked autoencoder for speech emotion recognition

Add code
Apr 21, 2023
Figure 1 for A vector quantized masked autoencoder for speech emotion recognition
Figure 2 for A vector quantized masked autoencoder for speech emotion recognition
Figure 3 for A vector quantized masked autoencoder for speech emotion recognition
Figure 4 for A vector quantized masked autoencoder for speech emotion recognition
Viaarxiv icon

Learning and controlling the source-filter representation of speech with a variational autoencoder

Add code
Apr 14, 2022
Figure 1 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Figure 2 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Figure 3 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Figure 4 for Learning and controlling the source-filter representation of speech with a variational autoencoder
Viaarxiv icon