Picture for Zhanpeng Zeng

Zhanpeng Zeng

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

Add code
Mar 10, 2026
Viaarxiv icon

RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

Add code
Feb 13, 2026
Viaarxiv icon

Distribution-Aware End-to-End Embedding for Streaming Numerical Features in Click-Through Rate Prediction

Add code
Feb 03, 2026
Viaarxiv icon

Speculative Decoding Reimagined for Multimodal Large Language Models

Add code
May 20, 2025
Viaarxiv icon

LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation

Add code
Mar 11, 2025
Viaarxiv icon

Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers

Add code
Dec 21, 2024
Viaarxiv icon

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models

Add code
Jun 13, 2024
Viaarxiv icon

IM-Unpack: Training and Inference with Arbitrarily Low Precision Integers

Add code
Mar 12, 2024
Viaarxiv icon

LookupFFN: Making Transformers Compute-lite for CPU inference

Add code
Mar 12, 2024
Viaarxiv icon

FrameQuant: Flexible Low-Bit Quantization for Transformers

Add code
Mar 10, 2024
Figure 1 for FrameQuant: Flexible Low-Bit Quantization for Transformers
Figure 2 for FrameQuant: Flexible Low-Bit Quantization for Transformers
Figure 3 for FrameQuant: Flexible Low-Bit Quantization for Transformers
Figure 4 for FrameQuant: Flexible Low-Bit Quantization for Transformers
Viaarxiv icon