Picture for Dorien Herremans

Dorien Herremans

Singapore University of Technology and Design

JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata

Add code
Feb 11, 2025
Viaarxiv icon

ImprovNet: Generating Controllable Musical Improvisations with Iterative Corruption Refinement

Add code
Feb 06, 2025
Viaarxiv icon

Towards Unified Music Emotion Recognition across Dimensional and Categorical Models

Add code
Feb 06, 2025
Viaarxiv icon

Text2midi: Generating Symbolic Music from Captions

Add code
Dec 21, 2024
Viaarxiv icon

MIRFLEX: Music Information Retrieval Feature Library for Extraction

Add code
Nov 01, 2024
Figure 1 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 2 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 3 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 4 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Viaarxiv icon

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech

Add code
Oct 17, 2024
Figure 1 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 2 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 3 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 4 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Viaarxiv icon

Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction

Add code
Oct 15, 2024
Figure 1 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 2 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 3 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 4 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Viaarxiv icon

Prevailing Research Areas for Music AI in the Era of Foundation Models

Add code
Sep 14, 2024
Viaarxiv icon

PRESENT: Zero-Shot Text-to-Prosody Control

Add code
Aug 13, 2024
Viaarxiv icon

BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features

Add code
Jul 15, 2024
Viaarxiv icon