Picture for Rui-Wei Zhao

Rui-Wei Zhao

MOSMOS: Multi-organ segmentation facilitated by medical report supervision

Add code
Sep 04, 2024
Viaarxiv icon

Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images

Add code
Dec 01, 2022
Viaarxiv icon

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

Add code
Jun 21, 2022
Figure 1 for Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Figure 2 for Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Figure 3 for Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Figure 4 for Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Viaarxiv icon

CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping

Add code
May 27, 2022
Figure 1 for CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping
Figure 2 for CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping
Figure 3 for CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping
Figure 4 for CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping
Viaarxiv icon

Self-Supervised Video Representation Learning with Motion-Contrastive Perception

Add code
Apr 10, 2022
Figure 1 for Self-Supervised Video Representation Learning with Motion-Contrastive Perception
Figure 2 for Self-Supervised Video Representation Learning with Motion-Contrastive Perception
Figure 3 for Self-Supervised Video Representation Learning with Motion-Contrastive Perception
Figure 4 for Self-Supervised Video Representation Learning with Motion-Contrastive Perception
Viaarxiv icon

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing

Add code
Nov 24, 2021
Figure 1 for MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Figure 2 for MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Figure 3 for MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Figure 4 for MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
Viaarxiv icon

Evaluating Two-Stream CNN for Video Classification

Add code
Apr 08, 2015
Figure 1 for Evaluating Two-Stream CNN for Video Classification
Figure 2 for Evaluating Two-Stream CNN for Video Classification
Figure 3 for Evaluating Two-Stream CNN for Video Classification
Figure 4 for Evaluating Two-Stream CNN for Video Classification
Viaarxiv icon