Picture for Jaeyeon Kim

Jaeyeon Kim

LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won't Fail)

Add code
Feb 13, 2025
Viaarxiv icon

Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

Add code
Feb 10, 2025
Viaarxiv icon

Assessing the Answerability of Queries in Retrieval-Augmented Code Generation

Add code
Nov 08, 2024
Figure 1 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Figure 2 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Figure 3 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Figure 4 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Viaarxiv icon

Task Diversity Shortens the ICL Plateau

Add code
Oct 07, 2024
Figure 1 for Task Diversity Shortens the ICL Plateau
Figure 2 for Task Diversity Shortens the ICL Plateau
Figure 3 for Task Diversity Shortens the ICL Plateau
Figure 4 for Task Diversity Shortens the ICL Plateau
Viaarxiv icon

Expanding on EnCLAP with Auxiliary Retrieval Model for Automated Audio Captioning

Add code
Sep 02, 2024
Viaarxiv icon

EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance

Add code
Sep 02, 2024
Figure 1 for EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Figure 2 for EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Figure 3 for EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Figure 4 for EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning Performance
Viaarxiv icon

Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations

Add code
Feb 02, 2024
Viaarxiv icon

EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning

Add code
Jan 31, 2024
Viaarxiv icon

Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

Add code
Sep 25, 2023
Viaarxiv icon

PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS

Add code
Mar 02, 2023
Viaarxiv icon