Picture for Björn W. Schuller

Björn W. Schuller

EIHW -- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany, GLAM -- Group on Language, Audio, and Music, Imperial College London, UK

Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning

Add code
Nov 01, 2024
Viaarxiv icon

Audio-based Kinship Verification Using Age Domain Conversion

Add code
Oct 14, 2024
Viaarxiv icon

Audio Explanation Synthesis with Generative Foundation Models

Add code
Oct 10, 2024
Figure 1 for Audio Explanation Synthesis with Generative Foundation Models
Figure 2 for Audio Explanation Synthesis with Generative Foundation Models
Figure 3 for Audio Explanation Synthesis with Generative Foundation Models
Figure 4 for Audio Explanation Synthesis with Generative Foundation Models
Viaarxiv icon

Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models

Add code
Sep 10, 2024
Viaarxiv icon

Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation

Add code
Sep 04, 2024
Viaarxiv icon

Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance

Add code
Aug 12, 2024
Viaarxiv icon

Abusive Speech Detection in Indic Languages Using Acoustic Features

Add code
Jul 30, 2024
Viaarxiv icon

Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset

Add code
Jul 03, 2024
Viaarxiv icon

This Paper Had the Smartest Reviewers -- Flattery Detection Utilising an Audio-Textual Transformer-Based Approach

Add code
Jun 25, 2024
Viaarxiv icon

Speech Emotion Recognition under Resource Constraints with Data Distillation

Add code
Jun 21, 2024
Viaarxiv icon