Picture for Ruibin Yuan

Ruibin Yuan

CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Add code
Oct 17, 2024
Figure 1 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 2 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 3 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 4 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Viaarxiv icon

Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer

Add code
Oct 07, 2024
Figure 1 for Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer
Figure 2 for Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer
Figure 3 for Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer
Viaarxiv icon

You Know What I'm Saying -- Jailbreak Attack via Implicit Reference

Add code
Oct 04, 2024
Figure 1 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 2 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 3 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Figure 4 for You Know What I'm Saying -- Jailbreak Attack via Implicit Reference
Viaarxiv icon

HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router

Add code
Oct 03, 2024
Figure 1 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Figure 2 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Figure 3 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Figure 4 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Viaarxiv icon

SongTrans: An unified song transcription and alignment method for lyrics and notes

Add code
Sep 22, 2024
Viaarxiv icon

Foundation Models for Music: A Survey

Add code
Aug 27, 2024
Figure 1 for Foundation Models for Music: A Survey
Figure 2 for Foundation Models for Music: A Survey
Figure 3 for Foundation Models for Music: A Survey
Figure 4 for Foundation Models for Music: A Survey
Viaarxiv icon

Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation

Add code
Jul 31, 2024
Figure 1 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Figure 2 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Figure 3 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Figure 4 for Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Viaarxiv icon

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Add code
Jul 30, 2024
Figure 1 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 2 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 3 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 4 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Viaarxiv icon

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

Add code
Jun 06, 2024
Viaarxiv icon

LLMs Meet Multimodal Generation and Editing: A Survey

Add code
May 29, 2024
Viaarxiv icon