Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset

Oct 02, 2024

Weihan Xu, Julian McAuley, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Hao-Wen Dong

Figure 1 for Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset

Figure 2 for Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset

Figure 3 for Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset

Figure 4 for Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset

Share this with someone who'll enjoy it:

Abstract:Recent years have seen many audio-domain text-to-music generation models that rely on large amounts of text-audio pairs for training. However, symbolic-domain controllable music generation has lagged behind partly due to the lack of a large-scale symbolic music dataset with extensive metadata and captions. In this work, we present MetaScore, a new dataset consisting of 963K musical scores paired with rich metadata, including free-form user-annotated tags, collected from an online music forum. To approach text-to-music generation, we leverage a pretrained large language model (LLM) to generate pseudo natural language captions from the metadata. With the LLM-enhanced MetaScore, we train a text-conditioned music generation model that learns to generate symbolic music from the pseudo captions, allowing control of instruments, genre, composer, complexity and other free-form music descriptors. In addition, we train a tag-conditioned system that supports a predefined set of tags available in MetaScore. Our experimental results show that both the proposed text-to-music and tags-to-music models outperform a baseline text-to-music model in a listening test, while the text-based system offers a more natural interface that allows free-form natural language prompts.

View paper on

Share this with someone who'll enjoy it:

Title:Generating Symbolic Music from Natural Language Prompts using an LLM-Enhanced Dataset

Paper and Code