Picture for Prasanna Sattigeri

Prasanna Sattigeri

Granite Guardian

Add code
Dec 10, 2024
Viaarxiv icon

Graph-based Uncertainty Metrics for Long-form Language Model Outputs

Add code
Oct 28, 2024
Viaarxiv icon

Value Alignment from Unstructured Text

Add code
Aug 19, 2024
Figure 1 for Value Alignment from Unstructured Text
Figure 2 for Value Alignment from Unstructured Text
Figure 3 for Value Alignment from Unstructured Text
Figure 4 for Value Alignment from Unstructured Text
Viaarxiv icon

When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails

Add code
Jul 08, 2024
Figure 1 for When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
Figure 2 for When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
Figure 3 for When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
Figure 4 for When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
Viaarxiv icon

WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia

Add code
Jun 19, 2024
Figure 1 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 2 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 3 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Figure 4 for WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Viaarxiv icon

Interventional Causal Discovery in a Mixture of DAGs

Add code
Jun 12, 2024
Figure 1 for Interventional Causal Discovery in a Mixture of DAGs
Figure 2 for Interventional Causal Discovery in a Mixture of DAGs
Figure 3 for Interventional Causal Discovery in a Mixture of DAGs
Figure 4 for Interventional Causal Discovery in a Mixture of DAGs
Viaarxiv icon

Large Language Model Confidence Estimation via Black-Box Access

Add code
Jun 01, 2024
Viaarxiv icon

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

Add code
May 30, 2024
Viaarxiv icon

The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

Add code
Apr 03, 2024
Figure 1 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 2 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 3 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Figure 4 for The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers
Viaarxiv icon

Language Models in Dialogue: Conversational Maxims for Human-AI Interactions

Add code
Mar 22, 2024
Figure 1 for Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Figure 2 for Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Figure 3 for Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Figure 4 for Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
Viaarxiv icon