Picture for Xavier Serra

Xavier Serra

Benchmarking Music Autotagging with MGPHot Expert Annotations vs. Generic Tag Datasets

Add code
Sep 08, 2025
Viaarxiv icon

Fractional Fourier Sound Synthesis

Add code
Jun 10, 2025
Viaarxiv icon

A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis

Add code
Jun 04, 2025
Viaarxiv icon

Automatic Estimation of Singing Voice Musical Dynamics

Add code
Oct 27, 2024
Figure 1 for Automatic Estimation of Singing Voice Musical Dynamics
Figure 2 for Automatic Estimation of Singing Voice Musical Dynamics
Figure 3 for Automatic Estimation of Singing Voice Musical Dynamics
Figure 4 for Automatic Estimation of Singing Voice Musical Dynamics
Viaarxiv icon

Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata

Add code
Oct 22, 2024
Figure 1 for Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata
Figure 2 for Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata
Figure 3 for Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata
Figure 4 for Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata
Viaarxiv icon

Heterogeneous sound classification with the Broad Sound Taxonomy and Dataset

Add code
Oct 01, 2024
Figure 1 for Heterogeneous sound classification with the Broad Sound Taxonomy and Dataset
Figure 2 for Heterogeneous sound classification with the Broad Sound Taxonomy and Dataset
Figure 3 for Heterogeneous sound classification with the Broad Sound Taxonomy and Dataset
Figure 4 for Heterogeneous sound classification with the Broad Sound Taxonomy and Dataset
Viaarxiv icon

The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?

Add code
Sep 03, 2024
Figure 1 for The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?
Figure 2 for The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?
Figure 3 for The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?
Figure 4 for The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?
Viaarxiv icon

Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-efficient Approach

Add code
Aug 01, 2024
Figure 1 for Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-efficient Approach
Figure 2 for Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-efficient Approach
Figure 3 for Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-efficient Approach
Figure 4 for Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-efficient Approach
Viaarxiv icon

Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio

Add code
Jul 19, 2024
Figure 1 for Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
Figure 2 for Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
Figure 3 for Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
Figure 4 for Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
Viaarxiv icon

Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset

Add code
Mar 06, 2024
Figure 1 for Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
Figure 2 for Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
Figure 3 for Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
Figure 4 for Can Audio Reveal Music Performance Difficulty? Insights from the Piano Syllabus Dataset
Viaarxiv icon