Picture for Christina Giannoula

Christina Giannoula

Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models

Add code
Aug 13, 2024
Viaarxiv icon

Proteus: Preserving Model Confidentiality during Graph Optimizations

Add code
Apr 18, 2024
Figure 1 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Figure 2 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Figure 3 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Figure 4 for Proteus: Preserving Model Confidentiality during Graph Optimizations
Viaarxiv icon

Accelerating Graph Neural Networks on Real Processing-In-Memory Systems

Add code
Feb 26, 2024
Viaarxiv icon

The Synergy of Speculative Decoding and Batching in Serving Large Language Models

Add code
Oct 28, 2023
Viaarxiv icon