Picture for Shutao Li

Shutao Li

Fellow, IEEE

DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection

Add code
Oct 23, 2024
Viaarxiv icon

Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble

Add code
Jun 18, 2024
Viaarxiv icon

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon

Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation

Add code
Mar 20, 2024
Figure 1 for Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation
Figure 2 for Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation
Figure 3 for Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation
Figure 4 for Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation
Viaarxiv icon

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

Add code
Feb 04, 2024
Viaarxiv icon

Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition

Add code
Oct 16, 2023
Figure 1 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Figure 2 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Figure 3 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Figure 4 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Viaarxiv icon

VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation

Add code
Jun 11, 2023
Viaarxiv icon

AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation

Add code
May 07, 2023
Viaarxiv icon

LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition

Add code
May 05, 2023
Figure 1 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 2 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 3 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 4 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Viaarxiv icon

Learning to Locate Visual Answer in Video Corpus Using Question

Add code
Oct 11, 2022
Figure 1 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 2 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 3 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 4 for Learning to Locate Visual Answer in Video Corpus Using Question
Viaarxiv icon