Picture for Harikrishna Narasimhan

Harikrishna Narasimhan

Universal Model Routing for Efficient LLM Inference

Add code
Feb 12, 2025
Viaarxiv icon

Cascade-Aware Training of Language Models

Add code
May 29, 2024
Figure 1 for Cascade-Aware Training of Language Models
Figure 2 for Cascade-Aware Training of Language Models
Figure 3 for Cascade-Aware Training of Language Models
Figure 4 for Cascade-Aware Training of Language Models
Viaarxiv icon

Faster Cascades via Speculative Decoding

Add code
May 29, 2024
Viaarxiv icon

Language Model Cascades: Token-level uncertainty and beyond

Add code
Apr 15, 2024
Viaarxiv icon

Metric-aware LLM inference

Add code
Mar 07, 2024
Viaarxiv icon

Distributionally Robust Post-hoc Classifiers under Prior Shifts

Add code
Sep 16, 2023
Viaarxiv icon

When Does Confidence-Based Cascade Deferral Suffice?

Add code
Jul 06, 2023
Figure 1 for When Does Confidence-Based Cascade Deferral Suffice?
Figure 2 for When Does Confidence-Based Cascade Deferral Suffice?
Figure 3 for When Does Confidence-Based Cascade Deferral Suffice?
Figure 4 for When Does Confidence-Based Cascade Deferral Suffice?
Viaarxiv icon

Learning to reject meets OOD detection: Are all abstentions created equal?

Add code
Jan 31, 2023
Viaarxiv icon

Consistent Multiclass Algorithms for Complex Metrics and Constraints

Add code
Oct 19, 2022
Figure 1 for Consistent Multiclass Algorithms for Complex Metrics and Constraints
Figure 2 for Consistent Multiclass Algorithms for Complex Metrics and Constraints
Figure 3 for Consistent Multiclass Algorithms for Complex Metrics and Constraints
Figure 4 for Consistent Multiclass Algorithms for Complex Metrics and Constraints
Viaarxiv icon

Robust Distillation for Worst-class Performance

Add code
Jun 13, 2022
Figure 1 for Robust Distillation for Worst-class Performance
Figure 2 for Robust Distillation for Worst-class Performance
Figure 3 for Robust Distillation for Worst-class Performance
Figure 4 for Robust Distillation for Worst-class Performance
Viaarxiv icon