Picture for Torsten Hoefler

Torsten Hoefler

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Add code
Jan 05, 2025
Figure 1 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 2 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 3 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 4 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Viaarxiv icon

EfQAT: An Efficient Framework for Quantization-Aware Training

Add code
Nov 17, 2024
Viaarxiv icon

All models are wrong, some are useful: Model Selection with Limited Labels

Add code
Oct 17, 2024
Viaarxiv icon

Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud

Add code
Oct 08, 2024
Figure 1 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 2 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 3 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 4 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Viaarxiv icon

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects

Add code
Aug 26, 2024
Figure 1 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 2 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 3 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 4 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Viaarxiv icon

Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments

Add code
Aug 22, 2024
Figure 1 for Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments
Viaarxiv icon

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

Demystifying Higher-Order Graph Neural Networks

Add code
Jun 18, 2024
Figure 1 for Demystifying Higher-Order Graph Neural Networks
Figure 2 for Demystifying Higher-Order Graph Neural Networks
Figure 3 for Demystifying Higher-Order Graph Neural Networks
Figure 4 for Demystifying Higher-Order Graph Neural Networks
Viaarxiv icon

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

Add code
Jun 07, 2024
Viaarxiv icon

CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks

Add code
Jun 04, 2024
Figure 1 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 2 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 3 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 4 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Viaarxiv icon