Picture for Zijie Zhou

Zijie Zhou

PolarMem: A Training-Free Polarized Latent Graph Memory for Verifiable Multimodal Agents

Add code
Jan 31, 2026
Viaarxiv icon

Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving

Add code
Jan 29, 2026
Viaarxiv icon

A Universal Load Balancing Principle and Its Application to Large Language Model Serving

Add code
Jan 25, 2026
Viaarxiv icon

ICPO: Illocution-Calibrated Policy Optimization for Multi-Turn Conversation

Add code
Jan 20, 2026
Viaarxiv icon

Adaptively Robust LLM Inference Optimization under Prediction Uncertainty

Add code
Aug 20, 2025
Figure 1 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Figure 2 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Figure 3 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Figure 4 for Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Viaarxiv icon

LLM Serving Optimization with Variable Prefill and Decode Lengths

Add code
Aug 08, 2025
Viaarxiv icon

LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place Recognition

Add code
Apr 27, 2025
Viaarxiv icon

Industrial Internet Robot Collaboration System and Edge Computing Optimization

Add code
Apr 03, 2025
Viaarxiv icon

SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation

Add code
Feb 28, 2025
Viaarxiv icon

Online Scheduling for LLM Inference with KV Cache Constraints

Add code
Feb 10, 2025
Figure 1 for Online Scheduling for LLM Inference with KV Cache Constraints
Figure 2 for Online Scheduling for LLM Inference with KV Cache Constraints
Figure 3 for Online Scheduling for LLM Inference with KV Cache Constraints
Figure 4 for Online Scheduling for LLM Inference with KV Cache Constraints
Viaarxiv icon