Picture for Wei Shen

Wei Shen

Dereflection Any Image with Diffusion Priors and Diversified Data

Add code
Mar 21, 2025
Viaarxiv icon

Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding

Add code
Mar 18, 2025
Viaarxiv icon

A Token-level Text Image Foundation Model for Document Understanding

Add code
Mar 04, 2025
Viaarxiv icon

MDN: Mamba-Driven Dualstream Network For Medical Hyperspectral Image Segmentation

Add code
Feb 24, 2025
Viaarxiv icon

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence

Add code
Feb 19, 2025
Viaarxiv icon

Unveiling the Mystery of Weight in Large Foundation Models: Gaussian Distribution Never Fades

Add code
Jan 18, 2025
Viaarxiv icon

Enhancing Visual Representation for Text-based Person Searching

Add code
Dec 30, 2024
Figure 1 for Enhancing Visual Representation for Text-based Person Searching
Figure 2 for Enhancing Visual Representation for Text-based Person Searching
Figure 3 for Enhancing Visual Representation for Text-based Person Searching
Figure 4 for Enhancing Visual Representation for Text-based Person Searching
Viaarxiv icon

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Add code
Dec 12, 2024
Viaarxiv icon

Realistic Surgical Simulation from Monocular Videos

Add code
Dec 03, 2024
Figure 1 for Realistic Surgical Simulation from Monocular Videos
Figure 2 for Realistic Surgical Simulation from Monocular Videos
Figure 3 for Realistic Surgical Simulation from Monocular Videos
Figure 4 for Realistic Surgical Simulation from Monocular Videos
Viaarxiv icon

Technical Report for Soccernet 2023 -- Dense Video Captioning

Add code
Oct 31, 2024
Figure 1 for Technical Report for Soccernet 2023 -- Dense Video Captioning
Figure 2 for Technical Report for Soccernet 2023 -- Dense Video Captioning
Figure 3 for Technical Report for Soccernet 2023 -- Dense Video Captioning
Viaarxiv icon