Picture for Wenyu Liu

Wenyu Liu

OmniTrack: General Motion Tracking via Physics-Consistent Reference

Add code
Feb 27, 2026
Viaarxiv icon

Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning

Add code
Feb 24, 2026
Viaarxiv icon

Constructing Industrial-Scale Optimization Modeling Benchmark

Add code
Feb 11, 2026
Viaarxiv icon

TriC-Motion: Tri-Domain Causal Modeling Grounded Text-to-Motion Generation

Add code
Feb 09, 2026
Viaarxiv icon

GO-MLVTON: Garment Occlusion-Aware Multi-Layer Virtual Try-On with Diffusion Models

Add code
Jan 20, 2026
Viaarxiv icon

Cross-Layer Attentive Feature Upsampling for Low-latency Semantic Segmentation

Add code
Jan 03, 2026
Viaarxiv icon

DriveLaW:Unifying Planning and Video Generation in a Latent Driving World

Add code
Dec 31, 2025
Viaarxiv icon

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

Add code
Dec 24, 2025
Viaarxiv icon

DeltaMIL: Gated Memory Integration for Efficient and Discriminative Whole Slide Image Analysis

Add code
Dec 22, 2025
Figure 1 for DeltaMIL: Gated Memory Integration for Efficient and Discriminative Whole Slide Image Analysis
Figure 2 for DeltaMIL: Gated Memory Integration for Efficient and Discriminative Whole Slide Image Analysis
Figure 3 for DeltaMIL: Gated Memory Integration for Efficient and Discriminative Whole Slide Image Analysis
Figure 4 for DeltaMIL: Gated Memory Integration for Efficient and Discriminative Whole Slide Image Analysis
Viaarxiv icon

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Add code
Dec 09, 2025
Viaarxiv icon