Picture for Minsu Cho

Minsu Cho

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

Add code
Dec 05, 2024
Viaarxiv icon

RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos

Add code
Dec 04, 2024
Viaarxiv icon

MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers

Add code
Nov 28, 2024
Figure 1 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 2 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 3 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 4 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Viaarxiv icon

3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction

Add code
Nov 04, 2024
Viaarxiv icon

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Add code
Aug 09, 2024
Viaarxiv icon

Online Temporal Action Localization with Memory-Augmented Transformer

Add code
Aug 06, 2024
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Viaarxiv icon

3D Geometric Shape Assembly via Efficient Point Cloud Matching

Add code
Jul 15, 2024
Figure 1 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Figure 2 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Figure 3 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Figure 4 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Viaarxiv icon

Burst Image Super-Resolution with Base Frame Selection

Add code
Jun 25, 2024
Viaarxiv icon

Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation

Add code
Apr 26, 2024
Viaarxiv icon