Picture for Josiah Wang

Josiah Wang

MultiSubs: A Large-scale Multimodal and Multilingual Dataset

Add code
Mar 02, 2021
Figure 1 for MultiSubs: A Large-scale Multimodal and Multilingual Dataset
Figure 2 for MultiSubs: A Large-scale Multimodal and Multilingual Dataset
Figure 3 for MultiSubs: A Large-scale Multimodal and Multilingual Dataset
Figure 4 for MultiSubs: A Large-scale Multimodal and Multilingual Dataset
Viaarxiv icon

Transformer-based Cascaded Multimodal Speech Translation

Add code
Nov 08, 2019
Figure 1 for Transformer-based Cascaded Multimodal Speech Translation
Figure 2 for Transformer-based Cascaded Multimodal Speech Translation
Figure 3 for Transformer-based Cascaded Multimodal Speech Translation
Figure 4 for Transformer-based Cascaded Multimodal Speech Translation
Viaarxiv icon

Imperial College London Submission to VATEX Video Captioning Task

Add code
Oct 16, 2019
Figure 1 for Imperial College London Submission to VATEX Video Captioning Task
Figure 2 for Imperial College London Submission to VATEX Video Captioning Task
Figure 3 for Imperial College London Submission to VATEX Video Captioning Task
Figure 4 for Imperial College London Submission to VATEX Video Captioning Task
Viaarxiv icon

Phrase Localization Without Paired Training Examples

Add code
Aug 20, 2019
Figure 1 for Phrase Localization Without Paired Training Examples
Figure 2 for Phrase Localization Without Paired Training Examples
Figure 3 for Phrase Localization Without Paired Training Examples
Figure 4 for Phrase Localization Without Paired Training Examples
Viaarxiv icon

Predicting Actions to Help Predict Translations

Add code
Aug 18, 2019
Figure 1 for Predicting Actions to Help Predict Translations
Figure 2 for Predicting Actions to Help Predict Translations
Figure 3 for Predicting Actions to Help Predict Translations
Figure 4 for Predicting Actions to Help Predict Translations
Viaarxiv icon

VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions

Add code
Jul 22, 2019
Figure 1 for VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Figure 2 for VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Figure 3 for VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Figure 4 for VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Viaarxiv icon

End-to-end Image Captioning Exploits Multimodal Distributional Similarity

Add code
Sep 11, 2018
Figure 1 for End-to-end Image Captioning Exploits Multimodal Distributional Similarity
Viaarxiv icon

Defoiling Foiled Image Captions

Add code
May 16, 2018
Figure 1 for Defoiling Foiled Image Captions
Figure 2 for Defoiling Foiled Image Captions
Figure 3 for Defoiling Foiled Image Captions
Figure 4 for Defoiling Foiled Image Captions
Viaarxiv icon

Object Counts! Bringing Explicit Detections Back into Image Captioning

Add code
Apr 23, 2018
Figure 1 for Object Counts! Bringing Explicit Detections Back into Image Captioning
Figure 2 for Object Counts! Bringing Explicit Detections Back into Image Captioning
Figure 3 for Object Counts! Bringing Explicit Detections Back into Image Captioning
Figure 4 for Object Counts! Bringing Explicit Detections Back into Image Captioning
Viaarxiv icon

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Add code
Mar 13, 2018
Figure 1 for Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection
Figure 2 for Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection
Figure 3 for Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection
Figure 4 for Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection
Viaarxiv icon