Picture for Jiaming Xu

Jiaming Xu

SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting

Add code
Apr 11, 2025
Viaarxiv icon

Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models

Add code
Mar 30, 2025
Viaarxiv icon

Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective

Add code
Oct 06, 2024
Figure 1 for Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Figure 2 for Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Figure 3 for Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Figure 4 for Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Viaarxiv icon

MARCA: Mamba Accelerator with ReConfigurable Architecture

Add code
Sep 16, 2024
Figure 1 for MARCA: Mamba Accelerator with ReConfigurable Architecture
Figure 2 for MARCA: Mamba Accelerator with ReConfigurable Architecture
Figure 3 for MARCA: Mamba Accelerator with ReConfigurable Architecture
Figure 4 for MARCA: Mamba Accelerator with ReConfigurable Architecture
Viaarxiv icon

Collaborative Learning with Shared Linear Representations: Statistical Rates and Optimal Algorithms

Add code
Sep 07, 2024
Viaarxiv icon

A Survey on Efficient Inference for Large Language Models

Add code
Apr 22, 2024
Viaarxiv icon

Enabling Fast 2-bit LLM on GPUs: Memory Alignment, Sparse Outlier, and Asynchronous Dequantization

Add code
Nov 28, 2023
Viaarxiv icon

FlashDecoding++: Faster Large Language Model Inference on GPUs

Add code
Nov 10, 2023
Figure 1 for FlashDecoding++: Faster Large Language Model Inference on GPUs
Figure 2 for FlashDecoding++: Faster Large Language Model Inference on GPUs
Figure 3 for FlashDecoding++: Faster Large Language Model Inference on GPUs
Figure 4 for FlashDecoding++: Faster Large Language Model Inference on GPUs
Viaarxiv icon

Federated Learning in the Presence of Adversarial Client Unavailability

Add code
May 31, 2023
Viaarxiv icon

Random graph matching at Otter's threshold via counting chandeliers

Add code
Sep 25, 2022
Figure 1 for Random graph matching at Otter's threshold via counting chandeliers
Figure 2 for Random graph matching at Otter's threshold via counting chandeliers
Figure 3 for Random graph matching at Otter's threshold via counting chandeliers
Figure 4 for Random graph matching at Otter's threshold via counting chandeliers
Viaarxiv icon