Picture for Youngjae Yu

Youngjae Yu

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Add code
Dec 02, 2024
Viaarxiv icon

TIPO: Text to Image with Text Presampling for Prompt Optimization

Add code
Nov 12, 2024
Figure 1 for TIPO: Text to Image with Text Presampling for Prompt Optimization
Figure 2 for TIPO: Text to Image with Text Presampling for Prompt Optimization
Figure 3 for TIPO: Text to Image with Text Presampling for Prompt Optimization
Figure 4 for TIPO: Text to Image with Text Presampling for Prompt Optimization
Viaarxiv icon

$C^2$: Scalable Auto-Feedback for LLM-based Chart Generation

Add code
Oct 24, 2024
Figure 1 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Figure 2 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Figure 3 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Figure 4 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Viaarxiv icon

Towards Visual Text Design Transfer Across Languages

Add code
Oct 24, 2024
Figure 1 for Towards Visual Text Design Transfer Across Languages
Figure 2 for Towards Visual Text Design Transfer Across Languages
Figure 3 for Towards Visual Text Design Transfer Across Languages
Figure 4 for Towards Visual Text Design Transfer Across Languages
Viaarxiv icon

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

Add code
Oct 02, 2024
Figure 1 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 2 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 3 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 4 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Viaarxiv icon

Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!

Add code
Oct 01, 2024
Figure 1 for Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!
Figure 2 for Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!
Figure 3 for Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!
Figure 4 for Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!
Viaarxiv icon

DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation

Add code
Aug 12, 2024
Figure 1 for DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
Figure 2 for DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
Figure 3 for DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
Figure 4 for DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
Viaarxiv icon

ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos

Add code
Jul 17, 2024
Viaarxiv icon

Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation

Add code
Jul 13, 2024
Viaarxiv icon

Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory

Add code
Jul 03, 2024
Viaarxiv icon