Picture for Yixin Dong

Yixin Dong

XGrammar 2: Dynamic and Efficient Structured Generation Engine for Agentic LLMs

Add code
Jan 07, 2026
Viaarxiv icon

FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems

Add code
Jan 01, 2026
Viaarxiv icon

Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs

Add code
Dec 22, 2025
Viaarxiv icon

WebLLM: A High-Performance In-Browser LLM Inference Engine

Add code
Dec 20, 2024
Figure 1 for WebLLM: A High-Performance In-Browser LLM Inference Engine
Figure 2 for WebLLM: A High-Performance In-Browser LLM Inference Engine
Viaarxiv icon

XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models

Add code
Nov 22, 2024
Figure 1 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Figure 2 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Figure 3 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Figure 4 for XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
Viaarxiv icon