Picture for Andrew Shin

Andrew Shin

Large Language Models Lack Understanding of Character Composition of Words

Add code
May 18, 2024
Viaarxiv icon

The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective

Add code
May 13, 2024
Viaarxiv icon

LADER: Log-Augmented DEnse Retrieval for Biomedical Literature Search

Add code
Apr 10, 2023
Viaarxiv icon

Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision

Add code
Mar 06, 2021
Figure 1 for Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Figure 2 for Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Figure 3 for Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Figure 4 for Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Viaarxiv icon

Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives

Add code
Feb 12, 2021
Figure 1 for Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives
Figure 2 for Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives
Figure 3 for Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives
Figure 4 for Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives
Viaarxiv icon

Reference-Based Video Colorization with Spatiotemporal Correspondence

Add code
Nov 25, 2020
Figure 1 for Reference-Based Video Colorization with Spatiotemporal Correspondence
Figure 2 for Reference-Based Video Colorization with Spatiotemporal Correspondence
Figure 3 for Reference-Based Video Colorization with Spatiotemporal Correspondence
Figure 4 for Reference-Based Video Colorization with Spatiotemporal Correspondence
Viaarxiv icon

Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Add code
Apr 27, 2018
Figure 1 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Figure 2 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Figure 3 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Figure 4 for Customized Image Narrative Generation via Interactive Visual Question Generation and Answering
Viaarxiv icon

DualNet: Domain-Invariant Network for Visual Question Answering

Add code
May 04, 2017
Figure 1 for DualNet: Domain-Invariant Network for Visual Question Answering
Figure 2 for DualNet: Domain-Invariant Network for Visual Question Answering
Figure 3 for DualNet: Domain-Invariant Network for Visual Question Answering
Figure 4 for DualNet: Domain-Invariant Network for Visual Question Answering
Viaarxiv icon

The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering

Add code
Sep 21, 2016
Figure 1 for The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering
Figure 2 for The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering
Figure 3 for The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering
Figure 4 for The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering
Viaarxiv icon

Beyond Caption To Narrative: Video Captioning With Multiple Sentences

Add code
May 18, 2016
Figure 1 for Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Figure 2 for Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Figure 3 for Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Figure 4 for Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Viaarxiv icon