Picture for Hee Suk Yoon

Hee Suk Yoon

Physics Informed Distillation for Diffusion Models

Add code
Nov 13, 2024
Figure 1 for Physics Informed Distillation for Diffusion Models
Figure 2 for Physics Informed Distillation for Diffusion Models
Figure 3 for Physics Informed Distillation for Diffusion Models
Figure 4 for Physics Informed Distillation for Diffusion Models
Viaarxiv icon

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

Add code
Aug 12, 2024
Viaarxiv icon

LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition

Add code
Aug 11, 2024
Viaarxiv icon

TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback

Add code
Jul 23, 2024
Viaarxiv icon

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Add code
Mar 31, 2024
Figure 1 for C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion
Figure 2 for C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion
Figure 3 for C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion
Figure 4 for C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion
Viaarxiv icon

AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition

Add code
Mar 18, 2024
Figure 1 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Figure 2 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Figure 3 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Figure 4 for AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition
Viaarxiv icon

HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue

Add code
Dec 15, 2023
Viaarxiv icon

SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation

Add code
Dec 10, 2023
Viaarxiv icon

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Add code
Aug 16, 2023
Viaarxiv icon

INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition

Add code
May 25, 2023
Viaarxiv icon