Picture for Yu-Jung Heo

Yu-Jung Heo

Biointelligence Laboratory, Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

Add code
Aug 12, 2024
Viaarxiv icon

Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration

Add code
Jun 24, 2024
Viaarxiv icon

Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

Add code
Jun 10, 2024
Viaarxiv icon

Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering

Add code
Jun 04, 2024
Figure 1 for Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Figure 2 for Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Figure 3 for Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Figure 4 for Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Viaarxiv icon

SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation

Add code
Oct 17, 2022
Figure 1 for SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation
Figure 2 for SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation
Figure 3 for SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation
Figure 4 for SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation
Viaarxiv icon

Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering

Add code
Apr 22, 2022
Figure 1 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Figure 2 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Figure 3 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Figure 4 for Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Viaarxiv icon

Toward a Human-Level Video Understanding Intelligence

Add code
Oct 18, 2021
Figure 1 for Toward a Human-Level Video Understanding Intelligence
Figure 2 for Toward a Human-Level Video Understanding Intelligence
Figure 3 for Toward a Human-Level Video Understanding Intelligence
Viaarxiv icon

CogME: A Novel Evaluation Metric for Video Understanding Intelligence

Add code
Jul 21, 2021
Figure 1 for CogME: A Novel Evaluation Metric for Video Understanding Intelligence
Figure 2 for CogME: A Novel Evaluation Metric for Video Understanding Intelligence
Figure 3 for CogME: A Novel Evaluation Metric for Video Understanding Intelligence
Figure 4 for CogME: A Novel Evaluation Metric for Video Understanding Intelligence
Viaarxiv icon

DramaQA: Character-Centered Video Story Understanding with Hierarchical QA

Add code
May 07, 2020
Figure 1 for DramaQA: Character-Centered Video Story Understanding with Hierarchical QA
Figure 2 for DramaQA: Character-Centered Video Story Understanding with Hierarchical QA
Figure 3 for DramaQA: Character-Centered Video Story Understanding with Hierarchical QA
Figure 4 for DramaQA: Character-Centered Video Story Understanding with Hierarchical QA
Viaarxiv icon

Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data

Add code
Jan 17, 2020
Figure 1 for Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
Figure 2 for Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
Figure 3 for Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
Figure 4 for Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
Viaarxiv icon