Picture for Keshav Santhanam

Keshav Santhanam

ALTO: An Efficient Network Orchestrator for Compound AI Systems

Add code
Mar 07, 2024
Viaarxiv icon

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Add code
Oct 05, 2023
Viaarxiv icon

Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs

Add code
May 03, 2023
Viaarxiv icon

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

Add code
Mar 01, 2023
Viaarxiv icon

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

Add code
Dec 28, 2022
Viaarxiv icon

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

Add code
Dec 02, 2022
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Viaarxiv icon

PLAID: An Efficient Engine for Late Interaction Retrieval

Add code
May 19, 2022
Figure 1 for PLAID: An Efficient Engine for Late Interaction Retrieval
Figure 2 for PLAID: An Efficient Engine for Late Interaction Retrieval
Figure 3 for PLAID: An Efficient Engine for Late Interaction Retrieval
Figure 4 for PLAID: An Efficient Engine for Late Interaction Retrieval
Viaarxiv icon

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Add code
Dec 16, 2021
Figure 1 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Figure 2 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Figure 3 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Figure 4 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Viaarxiv icon

DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution

Add code
Nov 09, 2021
Figure 1 for DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution
Figure 2 for DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution
Figure 3 for DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution
Figure 4 for DistIR: An Intermediate Representation and Simulator for Efficient Neural Network Distribution
Viaarxiv icon