Picture for Mathew Magimai. -Doss

Mathew Magimai. -Doss

Feature Representations for Automatic Meerkat Vocalization Classification

Add code
Aug 27, 2024
Viaarxiv icon

SSL-TTS: Leveraging Self-Supervised Embeddings and kNN Retrieval for Zero-Shot Multi-speaker TTS

Add code
Aug 20, 2024
Viaarxiv icon

Towards interfacing large language models with ASR systems using confidence measures and prompting

Add code
Jul 31, 2024
Viaarxiv icon

On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis

Add code
Jul 24, 2024
Viaarxiv icon

Predicting Heart Activity from Speech using Data-driven and Knowledge-based features

Add code
Jun 10, 2024
Viaarxiv icon

Can Self-Supervised Neural Networks Pre-Trained on Human Speech distinguish Animal Callers?

Add code
May 23, 2023
Viaarxiv icon

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

Add code
Jun 27, 2022
Figure 1 for Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Figure 2 for Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Figure 3 for Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Figure 4 for Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Viaarxiv icon

Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track

Add code
Jun 23, 2022
Figure 1 for Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track
Figure 2 for Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track
Figure 3 for Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track
Figure 4 for Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track
Viaarxiv icon

An Objective Evaluation Framework for Pathological Speech Synthesis

Add code
Jul 01, 2021
Figure 1 for An Objective Evaluation Framework for Pathological Speech Synthesis
Figure 2 for An Objective Evaluation Framework for Pathological Speech Synthesis
Figure 3 for An Objective Evaluation Framework for Pathological Speech Synthesis
Figure 4 for An Objective Evaluation Framework for Pathological Speech Synthesis
Viaarxiv icon

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks

Add code
Dec 07, 2013
Figure 1 for End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks
Figure 2 for End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks
Figure 3 for End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks
Figure 4 for End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks
Viaarxiv icon