Picture for Kyle Min

Kyle Min

Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation

Add code
Aug 12, 2024
Figure 1 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Figure 2 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Figure 3 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Figure 4 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Viaarxiv icon

Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation

Add code
Jul 28, 2024
Figure 1 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Figure 2 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Figure 3 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Figure 4 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Viaarxiv icon

SViTT-Ego: A Sparse Video-Text Transformer for Egocentric Video

Add code
Jun 13, 2024
Viaarxiv icon

Contrastive Language Video Time Pre-training

Add code
Jun 04, 2024
Viaarxiv icon

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Add code
May 25, 2024
Figure 1 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Figure 2 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Figure 3 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Figure 4 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Viaarxiv icon

Action Scene Graphs for Long-Form Understanding of Egocentric Videos

Add code
Dec 06, 2023
Viaarxiv icon

STHG: Spatial-Temporal Heterogeneous Graph Learning for Advanced Audio-Visual Diarization

Add code
Jun 18, 2023
Viaarxiv icon

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

Add code
Jun 07, 2023
Figure 1 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 2 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 3 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 4 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Viaarxiv icon

SViTT: Temporal Learning of Sparse Video-Text Transformers

Add code
Apr 18, 2023
Viaarxiv icon

Unbiased Scene Graph Generation in Videos

Add code
Apr 06, 2023
Viaarxiv icon