Picture for Longwei Zou

Longwei Zou

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers

Add code
Apr 10, 2024
Viaarxiv icon

A Multi-Level Framework for Accelerating Training Transformer Models

Add code
Apr 07, 2024
Viaarxiv icon