Picture for Sai Munikoti

Sai Munikoti

Surprisingly Fragile: Assessing and Addressing Prompt Instability in Multimodal Foundation Models

Add code
Aug 26, 2024
Viaarxiv icon

PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain

Add code
Aug 21, 2024
Figure 1 for PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain
Figure 2 for PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain
Figure 3 for PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain
Figure 4 for PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain
Viaarxiv icon

RAG vs. Long Context: Examining Frontier Large Language Models for Environmental Review Document Comprehension

Add code
Jul 10, 2024
Viaarxiv icon

Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities

Add code
Jun 08, 2024
Viaarxiv icon

ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science

Add code
Nov 21, 2023
Viaarxiv icon

Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science

Add code
Nov 15, 2023
Figure 1 for Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science
Figure 2 for Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science
Figure 3 for Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science
Figure 4 for Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science
Viaarxiv icon

Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning

Add code
Nov 07, 2023
Viaarxiv icon

NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain

Add code
Oct 17, 2023
Figure 1 for NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain
Figure 2 for NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain
Figure 3 for NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain
Figure 4 for NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain
Viaarxiv icon

SCITUNE: Aligning Large Language Models with Scientific Multimodal Instructions

Add code
Jul 03, 2023
Viaarxiv icon

A General Framework for Uncertainty Quantification via Neural SDE-RNN

Add code
Jun 01, 2023
Viaarxiv icon