Picture for Taehwan Kim

Taehwan Kim

RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals

Add code
Feb 18, 2025
Viaarxiv icon

Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation

Add code
Jan 14, 2025
Viaarxiv icon

Zero-shot Text-guided Infinite Image Synthesis with LLM guidance

Add code
Jul 17, 2024
Viaarxiv icon

Grid Diffusion Models for Text-to-Video Generation

Add code
Mar 30, 2024
Figure 1 for Grid Diffusion Models for Text-to-Video Generation
Figure 2 for Grid Diffusion Models for Text-to-Video Generation
Figure 3 for Grid Diffusion Models for Text-to-Video Generation
Figure 4 for Grid Diffusion Models for Text-to-Video Generation
Viaarxiv icon

Sound of Story: Multi-modal Storytelling with Audio

Add code
Oct 30, 2023
Figure 1 for Sound of Story: Multi-modal Storytelling with Audio
Figure 2 for Sound of Story: Multi-modal Storytelling with Audio
Figure 3 for Sound of Story: Multi-modal Storytelling with Audio
Figure 4 for Sound of Story: Multi-modal Storytelling with Audio
Viaarxiv icon

Effective Slogan Generation with Noise Perturbation

Add code
Oct 12, 2023
Figure 1 for Effective Slogan Generation with Noise Perturbation
Figure 2 for Effective Slogan Generation with Noise Perturbation
Figure 3 for Effective Slogan Generation with Noise Perturbation
Figure 4 for Effective Slogan Generation with Noise Perturbation
Viaarxiv icon

Generating Realistic Images from In-the-wild Sounds

Add code
Sep 05, 2023
Viaarxiv icon

Technical Report for CVPR 2022 LOVEU AQTC Challenge

Add code
Jun 29, 2022
Figure 1 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 2 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 3 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 4 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Viaarxiv icon

Understanding Beauty via Deep Facial Features

Add code
Apr 17, 2019
Figure 1 for Understanding Beauty via Deep Facial Features
Figure 2 for Understanding Beauty via Deep Facial Features
Figure 3 for Understanding Beauty via Deep Facial Features
Figure 4 for Understanding Beauty via Deep Facial Features
Viaarxiv icon

Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention

Add code
Jun 16, 2018
Figure 1 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Figure 2 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Figure 3 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Figure 4 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Viaarxiv icon