Picture for Ben Athiwaratkun

Ben Athiwaratkun

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping

Add code
Jan 11, 2025
Viaarxiv icon

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Training-Free Activation Sparsity in Large Language Models

Add code
Aug 26, 2024
Viaarxiv icon

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

Add code
Jun 11, 2024
Figure 1 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 2 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 3 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 4 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Viaarxiv icon

Mixture-of-Agents Enhances Large Language Model Capabilities

Add code
Jun 07, 2024
Figure 1 for Mixture-of-Agents Enhances Large Language Model Capabilities
Figure 2 for Mixture-of-Agents Enhances Large Language Model Capabilities
Figure 3 for Mixture-of-Agents Enhances Large Language Model Capabilities
Figure 4 for Mixture-of-Agents Enhances Large Language Model Capabilities
Viaarxiv icon

Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model

Add code
Jun 03, 2024
Figure 1 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 2 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 3 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 4 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Viaarxiv icon

Bifurcated Attention for Single-Context Large-Batch Sampling

Add code
Mar 13, 2024
Viaarxiv icon

Token Alignment via Character Matching for Subword Completion

Add code
Mar 13, 2024
Figure 1 for Token Alignment via Character Matching for Subword Completion
Figure 2 for Token Alignment via Character Matching for Subword Completion
Figure 3 for Token Alignment via Character Matching for Subword Completion
Figure 4 for Token Alignment via Character Matching for Subword Completion
Viaarxiv icon

Greener yet Powerful: Taming Large Code Generation Models with Quantization

Add code
Mar 09, 2023
Figure 1 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Figure 2 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Figure 3 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Figure 4 for Greener yet Powerful: Taming Large Code Generation Models with Quantization
Viaarxiv icon

Multi-lingual Evaluation of Code Generation Models

Add code
Oct 26, 2022
Figure 1 for Multi-lingual Evaluation of Code Generation Models
Figure 2 for Multi-lingual Evaluation of Code Generation Models
Figure 3 for Multi-lingual Evaluation of Code Generation Models
Figure 4 for Multi-lingual Evaluation of Code Generation Models
Viaarxiv icon