Picture for Sungroh Yoon

Sungroh Yoon

EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models

Add code
Feb 27, 2025
Viaarxiv icon

Visual Attention Never Fades: Selective Progressive Attention ReCalibration for Detailed Image Captioning in Multimodal Large Language Models

Add code
Feb 03, 2025
Viaarxiv icon

CNN-based TEM image denoising from first principles

Add code
Jan 20, 2025
Figure 1 for CNN-based TEM image denoising from first principles
Figure 2 for CNN-based TEM image denoising from first principles
Figure 3 for CNN-based TEM image denoising from first principles
Figure 4 for CNN-based TEM image denoising from first principles
Viaarxiv icon

Know "No" Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP

Add code
Jan 19, 2025
Viaarxiv icon

Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation

Add code
Jan 09, 2025
Figure 1 for Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation
Figure 2 for Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation
Figure 3 for Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation
Figure 4 for Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation
Viaarxiv icon

Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage

Add code
Dec 24, 2024
Viaarxiv icon

Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation

Add code
Dec 19, 2024
Viaarxiv icon

Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens

Add code
Dec 06, 2024
Figure 1 for Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
Figure 2 for Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
Figure 3 for Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
Figure 4 for Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
Viaarxiv icon

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Add code
Nov 23, 2024
Viaarxiv icon