Picture for Yuhao Cui

Yuhao Cui

CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation

Add code
Jun 27, 2023
Viaarxiv icon

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

Add code
Aug 16, 2021
Figure 1 for ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Figure 2 for ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Figure 3 for ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Figure 4 for ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Viaarxiv icon

Deep Multimodal Neural Architecture Search

Add code
Apr 25, 2020
Figure 1 for Deep Multimodal Neural Architecture Search
Figure 2 for Deep Multimodal Neural Architecture Search
Figure 3 for Deep Multimodal Neural Architecture Search
Figure 4 for Deep Multimodal Neural Architecture Search
Viaarxiv icon

Multimodal Unified Attention Networks for Vision-and-Language Interactions

Add code
Aug 19, 2019
Figure 1 for Multimodal Unified Attention Networks for Vision-and-Language Interactions
Figure 2 for Multimodal Unified Attention Networks for Vision-and-Language Interactions
Figure 3 for Multimodal Unified Attention Networks for Vision-and-Language Interactions
Figure 4 for Multimodal Unified Attention Networks for Vision-and-Language Interactions
Viaarxiv icon

Deep Modular Co-Attention Networks for Visual Question Answering

Add code
Jun 25, 2019
Figure 1 for Deep Modular Co-Attention Networks for Visual Question Answering
Figure 2 for Deep Modular Co-Attention Networks for Visual Question Answering
Figure 3 for Deep Modular Co-Attention Networks for Visual Question Answering
Figure 4 for Deep Modular Co-Attention Networks for Visual Question Answering
Viaarxiv icon