Picture for Suha Kwak

Suha Kwak

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Add code
Jan 13, 2025
Figure 1 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 2 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 3 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 4 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Viaarxiv icon

Improving Text-based Person Search via Part-level Cross-modal Correspondence

Add code
Dec 31, 2024
Figure 1 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Figure 2 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Figure 3 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Figure 4 for Improving Text-based Person Search via Part-level Cross-modal Correspondence
Viaarxiv icon

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

Add code
Dec 05, 2024
Viaarxiv icon

Bootstrapping Top-down Information for Self-modulating Slot Attention

Add code
Nov 04, 2024
Viaarxiv icon

Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization

Add code
Sep 05, 2024
Viaarxiv icon

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Add code
Aug 11, 2024
Figure 1 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Figure 2 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Figure 3 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Figure 4 for Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Viaarxiv icon

Online Temporal Action Localization with Memory-Augmented Transformer

Add code
Aug 06, 2024
Figure 1 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 2 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 3 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 4 for Online Temporal Action Localization with Memory-Augmented Transformer
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Viaarxiv icon

FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions

Add code
Jul 18, 2024
Viaarxiv icon

Extreme Point Supervised Instance Segmentation

Add code
Jun 04, 2024
Viaarxiv icon