Picture for Taeoh Kim

Taeoh Kim

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Add code
Jul 10, 2025
Figure 1 for Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
Figure 2 for Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
Figure 3 for Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
Figure 4 for Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
Viaarxiv icon

Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval

Add code
Apr 17, 2025
Viaarxiv icon

CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images

Add code
Mar 07, 2025
Viaarxiv icon

CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images

Add code
Dec 20, 2024
Figure 1 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Figure 2 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Figure 3 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Figure 4 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Viaarxiv icon

A Simple Baseline with Single-encoder for Referring Image Segmentation

Add code
Aug 28, 2024
Figure 1 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Figure 2 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Figure 3 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Figure 4 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Figure 1 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 2 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 3 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 4 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Viaarxiv icon

Masked Autoencoder for Unsupervised Video Summarization

Add code
Jun 02, 2023
Figure 1 for Masked Autoencoder for Unsupervised Video Summarization
Figure 2 for Masked Autoencoder for Unsupervised Video Summarization
Figure 3 for Masked Autoencoder for Unsupervised Video Summarization
Figure 4 for Masked Autoencoder for Unsupervised Video Summarization
Viaarxiv icon

Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection

Add code
Mar 30, 2023
Figure 1 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 2 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 3 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 4 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Viaarxiv icon

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Add code
Jun 30, 2022
Figure 1 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 2 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 3 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 4 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Viaarxiv icon

Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning

Add code
Apr 08, 2022
Figure 1 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 2 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 3 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 4 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Viaarxiv icon