Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Diff-BGM: A Diffusion Model for Video Background Music Generation

May 20, 2024

Sizhe Li, Yiming Qin, Minghang Zheng, Xin Jin, Yang Liu

Figure 1 for Diff-BGM: A Diffusion Model for Video Background Music Generation

Figure 2 for Diff-BGM: A Diffusion Model for Video Background Music Generation

Figure 3 for Diff-BGM: A Diffusion Model for Video Background Music Generation

Figure 4 for Diff-BGM: A Diffusion Model for Video Background Music Generation

Share this with someone who'll enjoy it:

Abstract:When editing a video, a piece of attractive background music is indispensable. However, video background music generation tasks face several challenges, for example, the lack of suitable training datasets, and the difficulties in flexibly controlling the music generation process and sequentially aligning the video and music. In this work, we first propose a high-quality music-video dataset BGM909 with detailed annotation and shot detection to provide multi-modal information about the video and music. We then present evaluation metrics to assess music quality, including music diversity and alignment between music and video with retrieval precision metrics. Finally, we propose the Diff-BGM framework to automatically generate the background music for a given video, which uses different signals to control different aspects of the music during the generation process, i.e., uses dynamic video features to control music rhythm and semantic features to control the melody and atmosphere. We propose to align the video and music sequentially by introducing a segment-aware cross-attention layer. Experiments verify the effectiveness of our proposed method. The code and models are available at https://github.com/sizhelee/Diff-BGM.

* Accepted by CVPR 2024(Poster)

View paper on

Share this with someone who'll enjoy it:

Title:Diff-BGM: A Diffusion Model for Video Background Music Generation

Paper and Code