Picture for Juhan Nam

Juhan Nam

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models

Add code
Nov 11, 2024
Viaarxiv icon

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval

Add code
Oct 04, 2024
Figure 1 for Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
Figure 2 for Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
Figure 3 for Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
Figure 4 for Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
Viaarxiv icon

Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound

Add code
Aug 21, 2024
Viaarxiv icon

CONMOD: Controllable Neural Frame-based Modulation Effects

Add code
Jun 20, 2024
Figure 1 for CONMOD: Controllable Neural Frame-based Modulation Effects
Figure 2 for CONMOD: Controllable Neural Frame-based Modulation Effects
Figure 3 for CONMOD: Controllable Neural Frame-based Modulation Effects
Figure 4 for CONMOD: Controllable Neural Frame-based Modulation Effects
Viaarxiv icon

Musical Word Embedding for Music Tagging and Retrieval

Add code
Apr 23, 2024
Viaarxiv icon

Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models

Add code
Apr 10, 2024
Viaarxiv icon

Expressive Acoustic Guitar Sound Synthesis with an Instrument-Specific Input Representation and Diffusion Outpainting

Add code
Jan 24, 2024
Viaarxiv icon

A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance

Add code
Jan 17, 2024
Viaarxiv icon

T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis

Add code
Jan 17, 2024
Viaarxiv icon

DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech

Add code
Jan 16, 2024
Figure 1 for DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Figure 2 for DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Figure 3 for DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Figure 4 for DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
Viaarxiv icon