Picture for Heeseung Yun

Heeseung Yun

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

Add code
Aug 09, 2024
Figure 1 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 2 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 3 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 4 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Viaarxiv icon

Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation

Add code
Sep 20, 2023
Figure 1 for Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation
Figure 2 for Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation
Figure 3 for Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation
Viaarxiv icon

Panoramic Vision Transformer for Saliency Detection in 360° Videos

Add code
Sep 19, 2022
Figure 1 for Panoramic Vision Transformer for Saliency Detection in 360° Videos
Figure 2 for Panoramic Vision Transformer for Saliency Detection in 360° Videos
Figure 3 for Panoramic Vision Transformer for Saliency Detection in 360° Videos
Figure 4 for Panoramic Vision Transformer for Saliency Detection in 360° Videos
Viaarxiv icon

Multimodal Knowledge Alignment with Reinforcement Learning

Add code
May 25, 2022
Figure 1 for Multimodal Knowledge Alignment with Reinforcement Learning
Figure 2 for Multimodal Knowledge Alignment with Reinforcement Learning
Figure 3 for Multimodal Knowledge Alignment with Reinforcement Learning
Figure 4 for Multimodal Knowledge Alignment with Reinforcement Learning
Viaarxiv icon

Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$ Videos

Add code
Oct 11, 2021
Figure 1 for Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$ Videos
Figure 2 for Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$ Videos
Figure 3 for Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$ Videos
Figure 4 for Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$ Videos
Viaarxiv icon

Video Summarization through Human Detection on a Social Robot

Add code
Jan 30, 2019
Figure 1 for Video Summarization through Human Detection on a Social Robot
Figure 2 for Video Summarization through Human Detection on a Social Robot
Figure 3 for Video Summarization through Human Detection on a Social Robot
Figure 4 for Video Summarization through Human Detection on a Social Robot
Viaarxiv icon