Picture for Björn W. Schuller

Björn W. Schuller

EIHW -- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany, GLAM -- Group on Language, Audio, and Music, Imperial College London, UK

M6: Multi-generator, Multi-domain, Multi-lingual and cultural, Multi-genres, Multi-instrument Machine-Generated Music Detection Databases

Add code
Dec 08, 2024
Viaarxiv icon

From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview

Add code
Nov 30, 2024
Viaarxiv icon

Using voice analysis as an early indicator of risk for depression in young adults

Add code
Nov 18, 2024
Viaarxiv icon

Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning

Add code
Nov 01, 2024
Figure 1 for Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Figure 2 for Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Figure 3 for Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Figure 4 for Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Viaarxiv icon

Audio-based Kinship Verification Using Age Domain Conversion

Add code
Oct 14, 2024
Viaarxiv icon

Audio Explanation Synthesis with Generative Foundation Models

Add code
Oct 10, 2024
Figure 1 for Audio Explanation Synthesis with Generative Foundation Models
Figure 2 for Audio Explanation Synthesis with Generative Foundation Models
Figure 3 for Audio Explanation Synthesis with Generative Foundation Models
Figure 4 for Audio Explanation Synthesis with Generative Foundation Models
Viaarxiv icon

Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models

Add code
Sep 10, 2024
Viaarxiv icon

Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation

Add code
Sep 04, 2024
Viaarxiv icon

Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance

Add code
Aug 12, 2024
Viaarxiv icon

Abusive Speech Detection in Indic Languages Using Acoustic Features

Add code
Jul 30, 2024
Viaarxiv icon