Picture for Daoxin Zhang

Daoxin Zhang

VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models

Add code
Mar 10, 2025
Viaarxiv icon

Bridging the Emotional Semantic Gap via Multimodal Relevance Estimation

Add code
Feb 03, 2023
Viaarxiv icon

Salient Object Ranking with Position-Preserved Attention

Add code
Jun 10, 2021
Figure 1 for Salient Object Ranking with Position-Preserved Attention
Figure 2 for Salient Object Ranking with Position-Preserved Attention
Figure 3 for Salient Object Ranking with Position-Preserved Attention
Figure 4 for Salient Object Ranking with Position-Preserved Attention
Viaarxiv icon

Horizontal-to-Vertical Video Conversion

Add code
Jan 11, 2021
Figure 1 for Horizontal-to-Vertical Video Conversion
Figure 2 for Horizontal-to-Vertical Video Conversion
Figure 3 for Horizontal-to-Vertical Video Conversion
Figure 4 for Horizontal-to-Vertical Video Conversion
Viaarxiv icon

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection

Add code
Mar 22, 2018
Figure 1 for Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection
Figure 2 for Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection
Figure 3 for Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection
Figure 4 for Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection
Viaarxiv icon