Picture for Khyathi Raghavi Chandu

Khyathi Raghavi Chandu

Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness

Add code
Jul 02, 2024
Viaarxiv icon

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Add code
Feb 23, 2024
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Deal, or no deal ? Forecasting Uncertainty in Conversations using Large Language Models

Add code
Feb 05, 2024
Figure 1 for Deal, or no deal ? Forecasting Uncertainty in Conversations using Large Language Models
Figure 2 for Deal, or no deal ? Forecasting Uncertainty in Conversations using Large Language Models
Figure 3 for Deal, or no deal ? Forecasting Uncertainty in Conversations using Large Language Models
Figure 4 for Deal, or no deal ? Forecasting Uncertainty in Conversations using Large Language Models
Viaarxiv icon

Localized Symbolic Knowledge Distillation for Visual Commonsense Models

Add code
Dec 12, 2023
Viaarxiv icon

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

Add code
Jun 07, 2023
Viaarxiv icon

Continual Dialogue State Tracking via Example-Guided Question Answering

Add code
May 23, 2023
Viaarxiv icon

Curriculum Script Distillation for Multilingual Visual Question Answering

Add code
Jan 17, 2023
Viaarxiv icon

Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities

Add code
Oct 30, 2022
Viaarxiv icon

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Add code
Jun 24, 2022
Figure 1 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 2 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 3 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 4 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Viaarxiv icon