Picture for Volker Dellwo

Volker Dellwo

Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints

Add code
Feb 28, 2025
Viaarxiv icon

Comparative Analysis of Modality Fusion Approaches for Audio-Visual Person Identification and Verification

Add code
Aug 31, 2024
Viaarxiv icon

Self-Supervised Models in Automatic Whispered Speech Recognition

Add code
Jul 30, 2024
Viaarxiv icon

Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features

Add code
Nov 02, 2023
Figure 1 for Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features
Figure 2 for Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features
Figure 3 for Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features
Figure 4 for Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features
Viaarxiv icon

Reorganization of the auditory-perceptual space across the human vocal range

Add code
Sep 13, 2023
Viaarxiv icon