Picture for Anand Kannappan

Anand Kannappan

Lynx: An Open Source Hallucination Evaluation Model

Add code
Jul 11, 2024
Figure 1 for Lynx: An Open Source Hallucination Evaluation Model
Figure 2 for Lynx: An Open Source Hallucination Evaluation Model
Figure 3 for Lynx: An Open Source Hallucination Evaluation Model
Figure 4 for Lynx: An Open Source Hallucination Evaluation Model
Viaarxiv icon

FinanceBench: A New Benchmark for Financial Question Answering

Add code
Nov 20, 2023
Figure 1 for FinanceBench: A New Benchmark for Financial Question Answering
Figure 2 for FinanceBench: A New Benchmark for Financial Question Answering
Figure 3 for FinanceBench: A New Benchmark for Financial Question Answering
Figure 4 for FinanceBench: A New Benchmark for Financial Question Answering
Viaarxiv icon

SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models

Add code
Nov 14, 2023
Figure 1 for SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models
Figure 2 for SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models
Figure 3 for SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models
Figure 4 for SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models
Viaarxiv icon