Picture for Siyuan Zhuang

Siyuan Zhuang

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Add code
Oct 16, 2024
Figure 1 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Figure 2 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Figure 3 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Figure 4 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Viaarxiv icon

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Add code
Sep 30, 2023
Viaarxiv icon

Efficient Memory Management for Large Language Model Serving with PagedAttention

Add code
Sep 12, 2023
Viaarxiv icon

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Add code
Jun 09, 2023
Viaarxiv icon

TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models

Add code
Feb 16, 2021
Figure 1 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Figure 2 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Figure 3 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Figure 4 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Viaarxiv icon

Hoplite: Efficient Collective Communication for Task-Based Distributed Systems

Add code
Feb 13, 2020
Figure 1 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Figure 2 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Figure 3 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Figure 4 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Viaarxiv icon