Picture for Guan-Lin Chao

Guan-Lin Chao

DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks

Add code
Aug 28, 2019
Figure 1 for DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks
Figure 2 for DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks
Figure 3 for DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks
Figure 4 for DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks
Viaarxiv icon

Learning Question-Guided Video Representation for Multi-Turn Video Question Answering

Add code
Jul 31, 2019
Figure 1 for Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Figure 2 for Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Figure 3 for Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Figure 4 for Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Viaarxiv icon

BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer

Add code
Jul 05, 2019
Figure 1 for BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
Figure 2 for BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
Figure 3 for BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
Figure 4 for BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
Viaarxiv icon

Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments

Add code
Jun 13, 2019
Figure 1 for Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
Figure 2 for Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
Figure 3 for Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
Figure 4 for Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
Viaarxiv icon

City-Identification of Flickr Videos Using Semantic Acoustic Features

Add code
Jul 12, 2016
Figure 1 for City-Identification of Flickr Videos Using Semantic Acoustic Features
Figure 2 for City-Identification of Flickr Videos Using Semantic Acoustic Features
Figure 3 for City-Identification of Flickr Videos Using Semantic Acoustic Features
Figure 4 for City-Identification of Flickr Videos Using Semantic Acoustic Features
Viaarxiv icon