Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit

Oct 17, 2024

Yutian Wang, Wanyin Yang, Zhenrong Dai, Yilong Zhang, Kun Zhao, Hui Wang

Figure 1 for MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit

Figure 2 for MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit

Figure 3 for MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit

Figure 4 for MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit

Share this with someone who'll enjoy it:

Abstract:At present, neural network models show powerful sequence prediction ability and are used in many automatic composition models. In comparison, the way humans compose music is very different from it. Composers usually start by creating musical motifs and then develop them into music through a series of rules. This process ensures that the music has a specific structure and changing pattern. However, it is difficult for neural network models to learn these composition rules from training data, which results in a lack of musicality and diversity in the generated music. This paper posits that integrating the learning capabilities of neural networks with human-derived knowledge may lead to better results. To archive this, we develop the POP909$\_$M dataset, the first to include labels for musical motifs and their variants, providing a basis for mimicking human compositional habits. Building on this, we propose MeloTrans, a text-to-music composition model that employs principles of motif development rules. Our experiments demonstrate that MeloTrans excels beyond existing music generation models and even surpasses Large Language Models (LLMs) like ChatGPT-4. This highlights the importance of merging human insights with neural network capabilities to achieve superior symbolic music generation.

View paper on

Share this with someone who'll enjoy it:

Title:MeloTrans: A Text to Symbolic Music Generation Model Following Human Composition Habit

Paper and Code