Picture for Zhanyu Ma

Zhanyu Ma

PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction

Add code
Mar 11, 2026
Viaarxiv icon

EvalMVX: A Unified Benchmarking for Neural 3D Reconstruction under Diverse Multiview Setups

Add code
Mar 04, 2026
Viaarxiv icon

Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding

Add code
Mar 04, 2026
Viaarxiv icon

Generative Visual Chain-of-Thought for Image Editing

Add code
Mar 02, 2026
Viaarxiv icon

Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing

Add code
Mar 02, 2026
Viaarxiv icon

Hepato-LLaVA: An Expert MLLM with Sparse Topo-Pack Attention for Hepatocellular Pathology Analysis on Whole Slide Images

Add code
Feb 26, 2026
Viaarxiv icon

Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

State Rank Dynamics in Linear Attention LLMs

Add code
Feb 02, 2026
Viaarxiv icon

LTS-VoiceAgent: A Listen-Think-Speak Framework for Efficient Streaming Voice Interaction via Semantic Triggering and Incremental Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

Near-Light Color Photometric Stereo for mono-Chromaticity non-lambertian surface

Add code
Jan 19, 2026
Viaarxiv icon