Picture for Björn W. Schuller

Björn W. Schuller

EIHW -- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany, GLAM -- Group on Language, Audio, and Music, Imperial College London, UK

DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset

Add code
Jan 21, 2025
Figure 1 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Figure 2 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Figure 3 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Figure 4 for DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset
Viaarxiv icon

Parameterised Quantum Circuits for Novel Representation Learning in Speech Emotion Recognition

Add code
Jan 21, 2025
Viaarxiv icon

DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids

Add code
Jan 17, 2025
Viaarxiv icon

MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge

Add code
Jan 08, 2025
Figure 1 for MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge
Figure 2 for MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge
Figure 3 for MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge
Viaarxiv icon

Gender Bias in Text-to-Video Generation Models: A case study of Sora

Add code
Dec 30, 2024
Viaarxiv icon

Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment

Add code
Dec 19, 2024
Viaarxiv icon

Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks

Add code
Dec 18, 2024
Figure 1 for Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks
Figure 2 for Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks
Figure 3 for Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks
Figure 4 for Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks
Viaarxiv icon

Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features

Add code
Dec 17, 2024
Figure 1 for Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features
Figure 2 for Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features
Figure 3 for Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features
Figure 4 for Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features
Viaarxiv icon

autrainer: A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks

Add code
Dec 16, 2024
Viaarxiv icon

ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis

Add code
Dec 16, 2024
Figure 1 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Figure 2 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Figure 3 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Figure 4 for ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Viaarxiv icon