Picture for Eun-Sol Kim

Eun-Sol Kim

Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

Add code
Jun 10, 2024
Viaarxiv icon

Bridging the Domain Gap by Clustering-based Image-Text Graph Matching

Add code
Oct 04, 2023
Viaarxiv icon

Dense but Efficient VideoQA for Intricate Compositional Reasoning

Add code
Oct 19, 2022
Figure 1 for Dense but Efficient VideoQA for Intricate Compositional Reasoning
Figure 2 for Dense but Efficient VideoQA for Intricate Compositional Reasoning
Figure 3 for Dense but Efficient VideoQA for Intricate Compositional Reasoning
Figure 4 for Dense but Efficient VideoQA for Intricate Compositional Reasoning
Viaarxiv icon

Selective Token Generation for Few-shot Natural Language Generation

Add code
Sep 17, 2022
Figure 1 for Selective Token Generation for Few-shot Natural Language Generation
Figure 2 for Selective Token Generation for Few-shot Natural Language Generation
Figure 3 for Selective Token Generation for Few-shot Natural Language Generation
Figure 4 for Selective Token Generation for Few-shot Natural Language Generation
Viaarxiv icon

Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering

Add code
Apr 22, 2022
Figure 1 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Figure 2 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Figure 3 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Figure 4 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Viaarxiv icon

Video-Text Representation Learning via Differentiable Weak Temporal Alignment

Add code
Mar 31, 2022
Figure 1 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Figure 2 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Figure 3 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Figure 4 for Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Viaarxiv icon

MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection

Add code
Mar 28, 2022
Figure 1 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Figure 2 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Figure 3 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Figure 4 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Viaarxiv icon

Boundary-aware Self-supervised Learning for Video Scene Segmentation

Add code
Jan 14, 2022
Figure 1 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 2 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 3 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 4 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Viaarxiv icon

Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts

Add code
Oct 13, 2021
Figure 1 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Figure 2 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Figure 3 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Figure 4 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Viaarxiv icon

HOTR: End-to-End Human-Object Interaction Detection with Transformers

Add code
Apr 28, 2021
Figure 1 for HOTR: End-to-End Human-Object Interaction Detection with Transformers
Figure 2 for HOTR: End-to-End Human-Object Interaction Detection with Transformers
Figure 3 for HOTR: End-to-End Human-Object Interaction Detection with Transformers
Figure 4 for HOTR: End-to-End Human-Object Interaction Detection with Transformers
Viaarxiv icon