Picture for Zhe Wu

Zhe Wu

1.x-Distill: Breaking the Diversity, Quality, and Efficiency Barrier in Distribution Matching Distillation

Add code
Apr 05, 2026
Viaarxiv icon

Improving Search Suggestions for Alphanumeric Queries

Add code
Apr 01, 2026
Viaarxiv icon

K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control

Add code
Feb 28, 2026
Viaarxiv icon

Boosting Instance Awareness via Cross-View Correlation with 4D Radar and Camera for 3D Object Detection

Add code
Feb 24, 2026
Viaarxiv icon

Towards Scalable Meta-Learning of near-optimal Interpretable Models via Synthetic Model Generations

Add code
Nov 06, 2025
Viaarxiv icon

Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control

Add code
Oct 16, 2025
Viaarxiv icon

Recurrent Cross-View Object Geo-Localization

Add code
Sep 16, 2025
Viaarxiv icon

Bridging Modality Gaps in e-Commerce Products via Vision-Language Alignment

Add code
Aug 13, 2025
Viaarxiv icon

NEAR$^2$: A Nested Embedding Approach to Efficient Product Retrieval and Ranking

Add code
Jun 24, 2025
Viaarxiv icon

CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion

Add code
May 02, 2025
Figure 1 for CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Figure 2 for CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Figure 3 for CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Figure 4 for CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Viaarxiv icon