Picture for Charlie F. Ruan

Charlie F. Ruan

WebLLM: A High-Performance In-Browser LLM Inference Engine

Add code
Dec 20, 2024
Figure 1 for WebLLM: A High-Performance In-Browser LLM Inference Engine
Figure 2 for WebLLM: A High-Performance In-Browser LLM Inference Engine
Viaarxiv icon

XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models

Add code
Nov 22, 2024
Figure 1 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Figure 2 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Figure 3 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Figure 4 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Viaarxiv icon

Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development

Add code
Apr 14, 2024
Viaarxiv icon

Scale up with Order: Finding Good Data Permutations for Distributed Training

Add code
Feb 02, 2023
Figure 1 for Scale up with Order: Finding Good Data Permutations for Distributed Training
Figure 2 for Scale up with Order: Finding Good Data Permutations for Distributed Training
Figure 3 for Scale up with Order: Finding Good Data Permutations for Distributed Training
Figure 4 for Scale up with Order: Finding Good Data Permutations for Distributed Training
Viaarxiv icon