Picture for Surin Ahn

Surin Ahn

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Add code
Dec 13, 2024
Viaarxiv icon

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Add code
Jul 02, 2024
Figure 1 for MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Figure 2 for MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Figure 3 for MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Figure 4 for MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Viaarxiv icon

Local Model Explanations and Uncertainty Without Model Access

Add code
Jan 24, 2023
Viaarxiv icon

Global Multiclass Classification from Heterogeneous Local Models

Add code
May 25, 2020
Figure 1 for Global Multiclass Classification from Heterogeneous Local Models
Figure 2 for Global Multiclass Classification from Heterogeneous Local Models
Figure 3 for Global Multiclass Classification from Heterogeneous Local Models
Figure 4 for Global Multiclass Classification from Heterogeneous Local Models
Viaarxiv icon