Picture for Seon Joo Kim

Seon Joo Kim

Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models

Add code
Mar 11, 2025
Viaarxiv icon

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon

IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation

Add code
Dec 05, 2024
Figure 1 for IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation
Figure 2 for IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation
Figure 3 for IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation
Figure 4 for IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation
Viaarxiv icon

4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction

Add code
Nov 26, 2024
Figure 1 for 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction
Figure 2 for 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction
Figure 3 for 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction
Figure 4 for 4D Scaffold Gaussian Splatting for Memory Efficient Dynamic Scene Reconstruction
Viaarxiv icon

Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos

Add code
Aug 01, 2024
Figure 1 for Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos
Figure 2 for Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos
Figure 3 for Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos
Figure 4 for Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos
Viaarxiv icon

Accelerating Image Super-Resolution Networks with Pixel-Level Classification

Add code
Jul 31, 2024
Viaarxiv icon

Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging

Add code
Jul 29, 2024
Viaarxiv icon

ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos

Add code
Jul 17, 2024
Viaarxiv icon

Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization

Add code
Jul 09, 2024
Viaarxiv icon

Object Aware Egocentric Online Action Detection

Add code
Jun 03, 2024
Viaarxiv icon