Picture for Torsten Hoefler

Torsten Hoefler

EfQAT: An Efficient Framework for Quantization-Aware Training

Add code
Nov 17, 2024
Viaarxiv icon

All models are wrong, some are useful: Model Selection with Limited Labels

Add code
Oct 17, 2024
Viaarxiv icon

Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud

Add code
Oct 08, 2024
Viaarxiv icon

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects

Add code
Aug 26, 2024
Viaarxiv icon

Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments

Add code
Aug 22, 2024
Figure 1 for Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments
Viaarxiv icon

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

Demystifying Higher-Order Graph Neural Networks

Add code
Jun 18, 2024
Figure 1 for Demystifying Higher-Order Graph Neural Networks
Figure 2 for Demystifying Higher-Order Graph Neural Networks
Figure 3 for Demystifying Higher-Order Graph Neural Networks
Figure 4 for Demystifying Higher-Order Graph Neural Networks
Viaarxiv icon

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

Add code
Jun 07, 2024
Viaarxiv icon

CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks

Add code
Jun 04, 2024
Figure 1 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 2 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 3 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 4 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Viaarxiv icon

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Add code
Mar 30, 2024
Figure 1 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Figure 2 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Figure 3 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Figure 4 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Viaarxiv icon