Picture for Tan Jiang

Tan Jiang

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Viaarxiv icon

DeVLBert: Learning Deconfounded Visio-Linguistic Representations

Add code
Aug 16, 2020
Figure 1 for DeVLBert: Learning Deconfounded Visio-Linguistic Representations
Figure 2 for DeVLBert: Learning Deconfounded Visio-Linguistic Representations
Figure 3 for DeVLBert: Learning Deconfounded Visio-Linguistic Representations
Figure 4 for DeVLBert: Learning Deconfounded Visio-Linguistic Representations
Viaarxiv icon

Comprehensive Information Integration Modeling Framework for Video Titling

Add code
Jun 24, 2020
Figure 1 for Comprehensive Information Integration Modeling Framework for Video Titling
Figure 2 for Comprehensive Information Integration Modeling Framework for Video Titling
Figure 3 for Comprehensive Information Integration Modeling Framework for Video Titling
Figure 4 for Comprehensive Information Integration Modeling Framework for Video Titling
Viaarxiv icon

Grounded and Controllable Image Completion by Incorporating Lexical Semantics

Add code
Feb 29, 2020
Figure 1 for Grounded and Controllable Image Completion by Incorporating Lexical Semantics
Figure 2 for Grounded and Controllable Image Completion by Incorporating Lexical Semantics
Figure 3 for Grounded and Controllable Image Completion by Incorporating Lexical Semantics
Figure 4 for Grounded and Controllable Image Completion by Incorporating Lexical Semantics
Viaarxiv icon