Picture for David Brooks

David Brooks

FlexQuant: Elastic Quantization Framework for Locally Hosted LLM on Edge Devices

Add code
Jan 13, 2025
Viaarxiv icon

Nanoscaling Floating-Point (NxFP): NanoMantissa, Adaptive Microexponents, and Code Recycling for Direct-Cast Compression of Large Language Models

Add code
Dec 15, 2024
Viaarxiv icon

Carbon Connect: An Ecosystem for Sustainable Computing

Add code
May 22, 2024
Figure 1 for Carbon Connect: An Ecosystem for Sustainable Computing
Figure 2 for Carbon Connect: An Ecosystem for Sustainable Computing
Figure 3 for Carbon Connect: An Ecosystem for Sustainable Computing
Figure 4 for Carbon Connect: An Ecosystem for Sustainable Computing
Viaarxiv icon

Is Flash Attention Stable?

Add code
May 05, 2024
Figure 1 for Is Flash Attention Stable?
Figure 2 for Is Flash Attention Stable?
Figure 3 for Is Flash Attention Stable?
Figure 4 for Is Flash Attention Stable?
Viaarxiv icon

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Add code
Dec 22, 2023
Figure 1 for Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Figure 2 for Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Figure 3 for Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Figure 4 for Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Viaarxiv icon

Hardware Resilience Properties of Text-Guided Image Classifiers

Add code
Dec 05, 2023
Viaarxiv icon

MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems

Add code
Oct 18, 2023
Viaarxiv icon

Guess & Sketch: Language Model Guided Transpilation

Add code
Sep 25, 2023
Viaarxiv icon

Local primordial non-Gaussianity from the large-scale clustering of photometric DESI luminous red galaxies

Add code
Jul 04, 2023
Viaarxiv icon

INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

Add code
Jun 13, 2023
Viaarxiv icon