Picture for Lanyun Zhu

Lanyun Zhu

HD-VGGT: High-Resolution Visual Geometry Transformer

Add code
Mar 28, 2026
Viaarxiv icon

PreSight: Preoperative Outcome Prediction for Parkinson's Disease via Region-Prior Morphometry and Patient-Specific Weighting

Add code
Mar 02, 2026
Viaarxiv icon

StreamSense: Streaming Social Task Detection with Selective Vision-Language Model Routing

Add code
Jan 30, 2026
Viaarxiv icon

Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection

Add code
Nov 14, 2025
Figure 1 for Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection
Figure 2 for Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection
Figure 3 for Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection
Figure 4 for Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection
Viaarxiv icon

SID: Multi-LLM Debate Driven by Self Signals

Add code
Oct 08, 2025
Viaarxiv icon

Unlocking the Power of SAM 2 for Few-Shot Segmentation

Add code
May 21, 2025
Figure 1 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 2 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 3 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 4 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Viaarxiv icon

From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching

Add code
May 15, 2025
Viaarxiv icon

POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation

Add code
Apr 01, 2025
Figure 1 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Figure 2 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Figure 3 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Figure 4 for POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Viaarxiv icon

Breaking the Box: Enhancing Remote Sensing Image Segmentation with Freehand Sketches

Add code
Mar 15, 2025
Viaarxiv icon

Syllables to Scenes: Literary-Guided Free-Viewpoint 3D Scene Synthesis from Japanese Haiku

Add code
Feb 17, 2025
Viaarxiv icon