Picture for Werner Geyer

Werner Geyer

Granite Guardian

Add code
Dec 10, 2024
Viaarxiv icon

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

Add code
Oct 18, 2024
Figure 1 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 2 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 3 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 4 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Viaarxiv icon

Black-box Uncertainty Quantification Method for LLM-as-a-Judge

Add code
Oct 15, 2024
Figure 1 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Figure 2 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Figure 3 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Figure 4 for Black-box Uncertainty Quantification Method for LLM-as-a-Judge
Viaarxiv icon

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

Add code
Oct 03, 2024
Figure 1 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Figure 2 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Figure 3 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Figure 4 for Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge
Viaarxiv icon

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

Add code
May 30, 2024
Viaarxiv icon

Multi-Level Explanations for Generative Language Models

Add code
Mar 21, 2024
Figure 1 for Multi-Level Explanations for Generative Language Models
Figure 2 for Multi-Level Explanations for Generative Language Models
Figure 3 for Multi-Level Explanations for Generative Language Models
Figure 4 for Multi-Level Explanations for Generative Language Models
Viaarxiv icon

Design Principles for Generative AI Applications

Add code
Jan 25, 2024
Viaarxiv icon

Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback

Add code
May 15, 2023
Viaarxiv icon

Fairness Evaluation in Text Classification: Machine Learning Practitioner Perspectives of Individual and Group Fairness

Add code
Mar 01, 2023
Viaarxiv icon

AutoDOViz: Human-Centered Automation for Decision Optimization

Add code
Feb 19, 2023
Viaarxiv icon