Picture for Fuhai Chen

Fuhai Chen

Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency

Add code
Dec 17, 2024
Figure 1 for Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency
Figure 2 for Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency
Figure 3 for Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency
Figure 4 for Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency
Viaarxiv icon

Multimodal Sentiment Analysis Based on Causal Reasoning

Add code
Dec 10, 2024
Viaarxiv icon

Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning

Add code
Aug 01, 2024
Figure 1 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Figure 2 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Figure 3 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Figure 4 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Viaarxiv icon

3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

Add code
Apr 26, 2024
Figure 1 for 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Figure 2 for 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Figure 3 for 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Figure 4 for 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Viaarxiv icon

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Add code
Oct 17, 2022
Figure 1 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 2 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 3 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 4 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Viaarxiv icon

Global2Local: A Joint-Hierarchical Attention for Video Captioning

Add code
Mar 13, 2022
Figure 1 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Figure 2 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Figure 3 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Figure 4 for Global2Local: A Joint-Hierarchical Attention for Video Captioning
Viaarxiv icon

Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation

Add code
Mar 12, 2022
Figure 1 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Figure 2 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Figure 3 for Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
Viaarxiv icon

Differentiated Relevances Embedding for Group-based Referring Expression Comprehension

Add code
Mar 12, 2022
Figure 1 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 2 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 3 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Figure 4 for Differentiated Relevances Embedding for Group-based Referring Expression Comprehension
Viaarxiv icon

Weakly-Supervised Dense Action Anticipation

Add code
Nov 15, 2021
Figure 1 for Weakly-Supervised Dense Action Anticipation
Figure 2 for Weakly-Supervised Dense Action Anticipation
Figure 3 for Weakly-Supervised Dense Action Anticipation
Figure 4 for Weakly-Supervised Dense Action Anticipation
Viaarxiv icon

Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval

Add code
Aug 05, 2021
Figure 1 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Figure 2 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Figure 3 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Figure 4 for Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval
Viaarxiv icon