Picture for Deming Chen

Deming Chen

Transforming the Hybrid Cloud for Emerging AI Workloads

Add code
Nov 20, 2024
Viaarxiv icon

New Solutions on LLM Acceleration, Optimization, and Application

Add code
Jun 16, 2024
Viaarxiv icon

Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context

Add code
Jun 10, 2024
Viaarxiv icon

SnapKV: LLM Knows What You are Looking for Before Generation

Add code
Apr 22, 2024
Viaarxiv icon

On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models

Add code
Apr 04, 2024
Viaarxiv icon

FedCore: Straggler-Free Federated Learning with Distributed Coresets

Add code
Jan 31, 2024
Figure 1 for FedCore: Straggler-Free Federated Learning with Distributed Coresets
Figure 2 for FedCore: Straggler-Free Federated Learning with Distributed Coresets
Figure 3 for FedCore: Straggler-Free Federated Learning with Distributed Coresets
Figure 4 for FedCore: Straggler-Free Federated Learning with Distributed Coresets
Viaarxiv icon

Subgraph Extraction-based Feedback-guided Iterative Scheduling for HLS

Add code
Jan 22, 2024
Viaarxiv icon

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Add code
Jan 19, 2024
Viaarxiv icon

What Makes Convolutional Models Great on Long Sequence Modeling?

Add code
Oct 17, 2022
Figure 1 for What Makes Convolutional Models Great on Long Sequence Modeling?
Figure 2 for What Makes Convolutional Models Great on Long Sequence Modeling?
Figure 3 for What Makes Convolutional Models Great on Long Sequence Modeling?
Figure 4 for What Makes Convolutional Models Great on Long Sequence Modeling?
Viaarxiv icon

Extensible Proxy for Efficient NAS

Add code
Oct 17, 2022
Figure 1 for Extensible Proxy for Efficient NAS
Figure 2 for Extensible Proxy for Efficient NAS
Figure 3 for Extensible Proxy for Efficient NAS
Figure 4 for Extensible Proxy for Efficient NAS
Viaarxiv icon