Picture for Taehwan Kim

Taehwan Kim

Zero-shot Text-guided Infinite Image Synthesis with LLM guidance

Add code
Jul 17, 2024
Viaarxiv icon

Grid Diffusion Models for Text-to-Video Generation

Add code
Mar 30, 2024
Figure 1 for Grid Diffusion Models for Text-to-Video Generation
Figure 2 for Grid Diffusion Models for Text-to-Video Generation
Figure 3 for Grid Diffusion Models for Text-to-Video Generation
Figure 4 for Grid Diffusion Models for Text-to-Video Generation
Viaarxiv icon

Sound of Story: Multi-modal Storytelling with Audio

Add code
Oct 30, 2023
Figure 1 for Sound of Story: Multi-modal Storytelling with Audio
Figure 2 for Sound of Story: Multi-modal Storytelling with Audio
Figure 3 for Sound of Story: Multi-modal Storytelling with Audio
Figure 4 for Sound of Story: Multi-modal Storytelling with Audio
Viaarxiv icon

Effective Slogan Generation with Noise Perturbation

Add code
Oct 12, 2023
Figure 1 for Effective Slogan Generation with Noise Perturbation
Figure 2 for Effective Slogan Generation with Noise Perturbation
Figure 3 for Effective Slogan Generation with Noise Perturbation
Figure 4 for Effective Slogan Generation with Noise Perturbation
Viaarxiv icon

Generating Realistic Images from In-the-wild Sounds

Add code
Sep 05, 2023
Viaarxiv icon

Technical Report for CVPR 2022 LOVEU AQTC Challenge

Add code
Jun 29, 2022
Figure 1 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 2 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 3 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 4 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Viaarxiv icon

Understanding Beauty via Deep Facial Features

Add code
Apr 17, 2019
Figure 1 for Understanding Beauty via Deep Facial Features
Figure 2 for Understanding Beauty via Deep Facial Features
Figure 3 for Understanding Beauty via Deep Facial Features
Figure 4 for Understanding Beauty via Deep Facial Features
Viaarxiv icon

Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention

Add code
Jun 16, 2018
Figure 1 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Figure 2 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Figure 3 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Figure 4 for Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
Viaarxiv icon

Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

Add code
Sep 26, 2016
Figure 1 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation
Figure 2 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation
Figure 3 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation
Figure 4 for Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation
Viaarxiv icon

American Sign Language fingerspelling recognition from video: Methods for unrestricted recognition and signer-independence

Add code
Aug 30, 2016
Figure 1 for American Sign Language fingerspelling recognition from video: Methods for unrestricted recognition and signer-independence
Figure 2 for American Sign Language fingerspelling recognition from video: Methods for unrestricted recognition and signer-independence
Figure 3 for American Sign Language fingerspelling recognition from video: Methods for unrestricted recognition and signer-independence
Figure 4 for American Sign Language fingerspelling recognition from video: Methods for unrestricted recognition and signer-independence
Viaarxiv icon