Picture for Sungroh Yoon

Sungroh Yoon

Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens

Add code
Dec 06, 2024
Viaarxiv icon

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Add code
Nov 23, 2024
Viaarxiv icon

Style-Friendly SNR Sampler for Style-Driven Generation

Add code
Nov 22, 2024
Viaarxiv icon

Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization

Add code
Nov 20, 2024
Viaarxiv icon

Interpretable Language Modeling via Induction-head Ngram Models

Add code
Oct 31, 2024
Figure 1 for Interpretable Language Modeling via Induction-head Ngram Models
Figure 2 for Interpretable Language Modeling via Induction-head Ngram Models
Figure 3 for Interpretable Language Modeling via Induction-head Ngram Models
Figure 4 for Interpretable Language Modeling via Induction-head Ngram Models
Viaarxiv icon

Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP

Add code
Oct 11, 2024
Figure 1 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 2 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 3 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 4 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Viaarxiv icon

Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context

Add code
Oct 09, 2024
Viaarxiv icon

Textual Training for the Hassle-Free Removal of Unwanted Visual Data

Add code
Sep 30, 2024
Figure 1 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Figure 2 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Figure 3 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Figure 4 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Viaarxiv icon

NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers

Add code
Sep 24, 2024
Viaarxiv icon

VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance

Add code
Sep 24, 2024
Viaarxiv icon