Picture for Christopher Thomas

Christopher Thomas

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Add code
Nov 23, 2024
Viaarxiv icon

Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs

Add code
May 29, 2023
Viaarxiv icon

Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World

Add code
Jun 14, 2022
Figure 1 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 2 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 3 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Figure 4 for Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World
Viaarxiv icon

Fine-Grained Visual Entailment

Add code
Mar 29, 2022
Figure 1 for Fine-Grained Visual Entailment
Figure 2 for Fine-Grained Visual Entailment
Figure 3 for Fine-Grained Visual Entailment
Viaarxiv icon

Joint Multimedia Event Extraction from Video and Article

Add code
Sep 27, 2021
Figure 1 for Joint Multimedia Event Extraction from Video and Article
Figure 2 for Joint Multimedia Event Extraction from Video and Article
Figure 3 for Joint Multimedia Event Extraction from Video and Article
Figure 4 for Joint Multimedia Event Extraction from Video and Article
Viaarxiv icon

Learning to Transfer Visual Effects from Videos to Images

Add code
Dec 17, 2020
Figure 1 for Learning to Transfer Visual Effects from Videos to Images
Figure 2 for Learning to Transfer Visual Effects from Videos to Images
Figure 3 for Learning to Transfer Visual Effects from Videos to Images
Figure 4 for Learning to Transfer Visual Effects from Videos to Images
Viaarxiv icon

Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval

Add code
Jul 16, 2020
Figure 1 for Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval
Figure 2 for Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval
Figure 3 for Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval
Figure 4 for Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval
Viaarxiv icon

Predicting the Politics of an Image Using Webly Supervised Data

Add code
Oct 31, 2019
Figure 1 for Predicting the Politics of an Image Using Webly Supervised Data
Figure 2 for Predicting the Politics of an Image Using Webly Supervised Data
Figure 3 for Predicting the Politics of an Image Using Webly Supervised Data
Figure 4 for Predicting the Politics of an Image Using Webly Supervised Data
Viaarxiv icon

Artistic Object Recognition by Unsupervised Style Adaptation

Add code
Dec 28, 2018
Figure 1 for Artistic Object Recognition by Unsupervised Style Adaptation
Figure 2 for Artistic Object Recognition by Unsupervised Style Adaptation
Figure 3 for Artistic Object Recognition by Unsupervised Style Adaptation
Figure 4 for Artistic Object Recognition by Unsupervised Style Adaptation
Viaarxiv icon

Persuasive Faces: Generating Faces in Advertisements

Add code
Jul 25, 2018
Figure 1 for Persuasive Faces: Generating Faces in Advertisements
Figure 2 for Persuasive Faces: Generating Faces in Advertisements
Figure 3 for Persuasive Faces: Generating Faces in Advertisements
Figure 4 for Persuasive Faces: Generating Faces in Advertisements
Viaarxiv icon