Picture for David Wentlzaff

David Wentlzaff

Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference

Add code
Aug 14, 2024
Figure 1 for Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
Figure 2 for Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
Figure 3 for Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
Figure 4 for Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference
Viaarxiv icon