Picture for Zirui Wang

Zirui Wang

Towards Adaptable Humanoid Control via Adaptive Motion Tracking

Add code
Oct 16, 2025
Viaarxiv icon

COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization

Add code
Oct 08, 2025
Viaarxiv icon

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Add code
Sep 19, 2025
Viaarxiv icon

YOLO-Count: Differentiable Object Counting for Text-to-Image Generation

Add code
Aug 01, 2025
Viaarxiv icon

UniTracker: Learning Universal Whole-Body Motion Tracker for Humanoid Robots

Add code
Jul 10, 2025
Viaarxiv icon

Active View Selector: Fast and Accurate Active View Selection with Cross Reference Image Quality Assessment

Add code
Jun 24, 2025
Viaarxiv icon

Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset

Add code
Jun 04, 2025
Viaarxiv icon

VEAttack: Downstream-agnostic Vision Encoder Attack against Large Vision Language Models

Add code
May 23, 2025
Viaarxiv icon

DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models

Add code
May 13, 2025
Viaarxiv icon

DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution

Add code
Mar 03, 2025
Figure 1 for DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution
Figure 2 for DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution
Figure 3 for DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution
Figure 4 for DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution
Viaarxiv icon