Picture for Xun Zhou

Xun Zhou

Real-time Indexing for Large-scale Recommendation by Streaming Vector Quantization Retriever

Add code
Jan 15, 2025
Viaarxiv icon

Towards Simple and Provable Parameter-Free Adaptive Gradient Methods

Add code
Dec 27, 2024
Viaarxiv icon

WebLLM: A High-Performance In-Browser LLM Inference Engine

Add code
Dec 20, 2024
Viaarxiv icon

LISA: Learning-Integrated Space Partitioning Framework for Traffic Accident Forecasting on Heterogeneous Spatiotemporal Data

Add code
Dec 19, 2024
Viaarxiv icon

GeoPro-Net: Learning Interpretable Spatiotemporal Prediction Models through Statistically-Guided Geo-Prototyping

Add code
Dec 19, 2024
Viaarxiv icon

Centaur: Bridging the Impossible Trinity of Privacy, Efficiency, and Performance in Privacy-Preserving Transformer Inference

Add code
Dec 14, 2024
Viaarxiv icon

Ultra-Sparse Memory Network

Add code
Nov 19, 2024
Viaarxiv icon

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Add code
Nov 15, 2024
Figure 1 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 2 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 3 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 4 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Viaarxiv icon

Local deployment of large-scale music AI models on commodity hardware

Add code
Nov 14, 2024
Viaarxiv icon

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Add code
Nov 06, 2024
Viaarxiv icon