Picture for Chang D. Yoo

Chang D. Yoo

A Hidden Semantic Bottleneck in Conditional Embeddings of Diffusion Transformers

Add code
Feb 25, 2026
Viaarxiv icon

Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning

Add code
Feb 23, 2026
Viaarxiv icon

Towards Robust Dysarthric Speech Recognition: LLM-Agent Post-ASR Correction Beyond WER

Add code
Jan 29, 2026
Viaarxiv icon

The Interspeech 2025 Speech Accessibility Project Challenge

Add code
Jul 29, 2025
Viaarxiv icon

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Add code
Jun 15, 2025
Figure 1 for Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models
Figure 2 for Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models
Figure 3 for Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models
Figure 4 for Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models
Viaarxiv icon

ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization

Add code
Jun 12, 2025
Viaarxiv icon

TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis

Add code
Apr 08, 2025
Viaarxiv icon

ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On

Add code
Mar 26, 2025
Viaarxiv icon

E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization

Add code
Feb 13, 2025
Viaarxiv icon

DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving

Add code
Jan 09, 2025
Figure 1 for DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving
Figure 2 for DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving
Viaarxiv icon