Picture for Yonghui Wang

Yonghui Wang

AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding

Add code
Aug 30, 2024
Figure 1 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Figure 2 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Figure 3 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Figure 4 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Viaarxiv icon

LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation

Add code
Aug 25, 2024
Viaarxiv icon

SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection

Add code
Aug 07, 2024
Viaarxiv icon

TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding

Add code
Apr 15, 2024
Viaarxiv icon

Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs

Add code
Nov 22, 2023
Viaarxiv icon

Progressive Recurrent Network for Shadow Removal

Add code
Nov 01, 2023
Viaarxiv icon

Detect Any Shadow: Segment Anything for Video Shadow Detection

Add code
May 26, 2023
Figure 1 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Figure 2 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Figure 3 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Figure 4 for Detect Any Shadow: Segment Anything for Video Shadow Detection
Viaarxiv icon

UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior

Add code
Oct 15, 2022
Figure 1 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Figure 2 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Figure 3 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Figure 4 for UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior
Viaarxiv icon