Picture for Jaeyeon Kim

Jaeyeon Kim

Fine-Tuning Masked Diffusion for Provable Self-Correction

Add code
Oct 01, 2025
Viaarxiv icon

Selective Underfitting in Diffusion Models

Add code
Oct 01, 2025
Viaarxiv icon

ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models

Add code
Sep 26, 2025
Viaarxiv icon

WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations

Add code
Aug 28, 2025
Viaarxiv icon

ViSAGe: Video-to-Spatial Audio Generation

Add code
Jun 13, 2025
Viaarxiv icon

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge

Add code
May 12, 2025
Viaarxiv icon

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Add code
Apr 01, 2025
Figure 1 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Figure 2 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Figure 3 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Figure 4 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Viaarxiv icon

LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won't Fail)

Add code
Feb 13, 2025
Viaarxiv icon

Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

Add code
Feb 10, 2025
Viaarxiv icon

Assessing the Answerability of Queries in Retrieval-Augmented Code Generation

Add code
Nov 08, 2024
Figure 1 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Figure 2 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Figure 3 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Figure 4 for Assessing the Answerability of Queries in Retrieval-Augmented Code Generation
Viaarxiv icon