Picture for Wittawat Jitkrittum

Wittawat Jitkrittum

I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning

Add code
Feb 26, 2025
Viaarxiv icon

Universal Model Routing for Efficient LLM Inference

Add code
Feb 12, 2025
Figure 1 for Universal Model Routing for Efficient LLM Inference
Figure 2 for Universal Model Routing for Efficient LLM Inference
Figure 3 for Universal Model Routing for Efficient LLM Inference
Figure 4 for Universal Model Routing for Efficient LLM Inference
Viaarxiv icon

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs

Add code
Oct 24, 2024
Figure 1 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 2 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 3 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 4 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Viaarxiv icon

Faster Cascades via Speculative Decoding

Add code
May 29, 2024
Viaarxiv icon

Cascade-Aware Training of Language Models

Add code
May 29, 2024
Figure 1 for Cascade-Aware Training of Language Models
Figure 2 for Cascade-Aware Training of Language Models
Figure 3 for Cascade-Aware Training of Language Models
Figure 4 for Cascade-Aware Training of Language Models
Viaarxiv icon

Language Model Cascades: Token-level uncertainty and beyond

Add code
Apr 15, 2024
Viaarxiv icon

It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models

Add code
Oct 13, 2023
Viaarxiv icon

When Does Confidence-Based Cascade Deferral Suffice?

Add code
Jul 06, 2023
Figure 1 for When Does Confidence-Based Cascade Deferral Suffice?
Figure 2 for When Does Confidence-Based Cascade Deferral Suffice?
Figure 3 for When Does Confidence-Based Cascade Deferral Suffice?
Figure 4 for When Does Confidence-Based Cascade Deferral Suffice?
Viaarxiv icon

Learning to reject meets OOD detection: Are all abstentions created equal?

Add code
Jan 31, 2023
Viaarxiv icon

EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval

Add code
Jan 27, 2023
Figure 1 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Figure 2 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Figure 3 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Figure 4 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Viaarxiv icon