Picture for Michael R. Metel

Michael R. Metel

Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression

Add code
Dec 07, 2024
Viaarxiv icon

Draft on the Fly: Adaptive Self-Speculative Decoding using Cosine Similarity

Add code
Oct 01, 2024
Viaarxiv icon

Mathematical Challenges in Deep Learning

Add code
Mar 24, 2023
Figure 1 for Mathematical Challenges in Deep Learning
Figure 2 for Mathematical Challenges in Deep Learning
Figure 3 for Mathematical Challenges in Deep Learning
Figure 4 for Mathematical Challenges in Deep Learning
Viaarxiv icon

Variants of SGD for Lipschitz Continuous Loss Functions in Low-Precision Environments

Add code
Nov 09, 2022
Viaarxiv icon