Picture for Kurt Keutzer

Kurt Keutzer

Squeezed Attention: Accelerating Long Context Length LLM Inference

Add code
Nov 14, 2024
Viaarxiv icon

Stochastic Communication Avoidance for Recommendation Systems

Add code
Nov 03, 2024
Figure 1 for Stochastic Communication Avoidance for Recommendation Systems
Figure 2 for Stochastic Communication Avoidance for Recommendation Systems
Figure 3 for Stochastic Communication Avoidance for Recommendation Systems
Figure 4 for Stochastic Communication Avoidance for Recommendation Systems
Viaarxiv icon

DQRM: Deep Quantized Recommendation Models

Add code
Oct 26, 2024
Viaarxiv icon

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Add code
Oct 25, 2024
Viaarxiv icon

PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views

Add code
Oct 24, 2024
Viaarxiv icon

UniDrive: Towards Universal Driving Perception Across Camera Configurations

Add code
Oct 17, 2024
Figure 1 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Figure 2 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Figure 3 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Figure 4 for UniDrive: Towards Universal Driving Perception Across Camera Configurations
Viaarxiv icon

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference

Add code
Oct 06, 2024
Figure 1 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Figure 2 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Figure 3 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Figure 4 for SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference
Viaarxiv icon

One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks

Add code
Sep 20, 2024
Viaarxiv icon

RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning

Add code
Sep 05, 2024
Viaarxiv icon

Efficient and Scalable Estimation of Tool Representations in Vector Space

Add code
Sep 02, 2024
Figure 1 for Efficient and Scalable Estimation of Tool Representations in Vector Space
Figure 2 for Efficient and Scalable Estimation of Tool Representations in Vector Space
Figure 3 for Efficient and Scalable Estimation of Tool Representations in Vector Space
Figure 4 for Efficient and Scalable Estimation of Tool Representations in Vector Space
Viaarxiv icon