Picture for Tim Polzehl

Tim Polzehl

DFKI-Speech System for WildSpoof Challenge: A robust framework for SASV In-the-Wild

Add code
Feb 02, 2026
Viaarxiv icon

Content Leakage in LibriSpeech and Its Impact on the Privacy Evaluation of Speaker Anonymization

Add code
Jan 19, 2026
Viaarxiv icon

Two Views, One Truth: Spectral and Self-Supervised Features Fusion for Robust Speech Deepfake Detection

Add code
Jul 27, 2025
Viaarxiv icon

Private kNN-VC: Interpretable Anonymization of Converted Speech

Add code
May 23, 2025
Viaarxiv icon

BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention

Add code
May 20, 2025
Figure 1 for BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention
Figure 2 for BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention
Figure 3 for BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention
Figure 4 for BiCrossMamba-ST: Speech Deepfake Detection with Bidirectional Mamba Spectro-Temporal Cross-Attention
Viaarxiv icon

Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection

Add code
Feb 05, 2025
Figure 1 for Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection
Figure 2 for Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection
Figure 3 for Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection
Figure 4 for Comprehensive Layer-wise Analysis of SSL Models for Audio Deepfake Detection
Viaarxiv icon

Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example

Add code
Oct 20, 2024
Figure 1 for Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example
Figure 2 for Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example
Figure 3 for Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example
Figure 4 for Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example
Viaarxiv icon

StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings

Add code
Sep 14, 2023
Figure 1 for StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings
Figure 2 for StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings
Figure 3 for StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings
Figure 4 for StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings
Viaarxiv icon

Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion

Add code
Sep 14, 2023
Figure 1 for Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion
Figure 2 for Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion
Figure 3 for Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion
Figure 4 for Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion
Viaarxiv icon

Speaker adaptation for Wav2vec2 based dysarthric ASR

Add code
Apr 02, 2022
Figure 1 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 2 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 3 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 4 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Viaarxiv icon