Picture for Michael R. Metel

Michael R. Metel

Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression

Add code
Dec 07, 2024
Viaarxiv icon

Draft on the Fly: Adaptive Self-Speculative Decoding using Cosine Similarity

Add code
Oct 01, 2024
Viaarxiv icon

Mathematical Challenges in Deep Learning

Add code
Mar 24, 2023
Viaarxiv icon

Variants of SGD for Lipschitz Continuous Loss Functions in Low-Precision Environments

Add code
Nov 09, 2022
Viaarxiv icon