Picture for Suha Kwak

Suha Kwak

GENIUS: A Generative Framework for Universal Multimodal Search

Add code
Mar 25, 2025
Viaarxiv icon

Enhancing Cost Efficiency in Active Learning with Candidate Set Query

Add code
Feb 10, 2025
Viaarxiv icon

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Add code
Jan 13, 2025
Figure 1 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 2 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 3 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 4 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Viaarxiv icon

Improving Text-based Person Search via Part-level Cross-modal Correspondence

Add code
Dec 31, 2024
Figure 1 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Figure 2 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Figure 3 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Figure 4 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Viaarxiv icon

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

Add code
Dec 05, 2024
Viaarxiv icon

Bootstrapping Top-down Information for Self-modulating Slot Attention

Add code
Nov 04, 2024
Viaarxiv icon

Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization

Add code
Sep 05, 2024
Viaarxiv icon

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Add code
Aug 11, 2024
Figure 1 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Figure 2 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Figure 3 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Figure 4 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Viaarxiv icon

Online Temporal Action Localization with Memory-Augmented Transformer

Add code
Aug 06, 2024
Figure 1 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 2 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 3 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 4 for Online Temporal Action Localization with Memory-Augmented Transformer
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Viaarxiv icon