Picture for Jinhyung Kim

Jinhyung Kim

MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations

Add code
Mar 20, 2025
Viaarxiv icon

Enhancing Whole Slide Pathology Foundation Models through Stain Normalization

Add code
Aug 05, 2024
Figure 1 for Enhancing Whole Slide Pathology Foundation Models through Stain Normalization
Figure 2 for Enhancing Whole Slide Pathology Foundation Models through Stain Normalization
Figure 3 for Enhancing Whole Slide Pathology Foundation Models through Stain Normalization
Figure 4 for Enhancing Whole Slide Pathology Foundation Models through Stain Normalization
Viaarxiv icon

Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition

Add code
Jun 13, 2024
Figure 1 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Figure 2 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Figure 3 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Figure 4 for Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition
Viaarxiv icon

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining

Add code
Dec 19, 2023
Viaarxiv icon

Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders

Add code
Dec 19, 2023
Viaarxiv icon

Masked Autoencoder for Unsupervised Video Summarization

Add code
Jun 02, 2023
Figure 1 for Masked Autoencoder for Unsupervised Video Summarization
Figure 2 for Masked Autoencoder for Unsupervised Video Summarization
Figure 3 for Masked Autoencoder for Unsupervised Video Summarization
Figure 4 for Masked Autoencoder for Unsupervised Video Summarization
Viaarxiv icon

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Add code
Jun 30, 2022
Figure 1 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 2 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 3 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 4 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Viaarxiv icon

Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning

Add code
Apr 08, 2022
Figure 1 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 2 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 3 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 4 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Viaarxiv icon

VideoMix: Rethinking Data Augmentation for Video Classification

Add code
Dec 07, 2020
Figure 1 for VideoMix: Rethinking Data Augmentation for Video Classification
Figure 2 for VideoMix: Rethinking Data Augmentation for Video Classification
Figure 3 for VideoMix: Rethinking Data Augmentation for Video Classification
Figure 4 for VideoMix: Rethinking Data Augmentation for Video Classification
Viaarxiv icon

Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning

Add code
Jun 08, 2017
Figure 1 for Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning
Figure 2 for Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning
Figure 3 for Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning
Figure 4 for Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning
Viaarxiv icon