Picture for Nazneen Rajani

Nazneen Rajani

VERITAS: A Unified Approach to Reliability Evaluation

Add code
Nov 05, 2024
Figure 1 for VERITAS: A Unified Approach to Reliability Evaluation
Figure 2 for VERITAS: A Unified Approach to Reliability Evaluation
Figure 3 for VERITAS: A Unified Approach to Reliability Evaluation
Figure 4 for VERITAS: A Unified Approach to Reliability Evaluation
Viaarxiv icon

Self-rationalization improves LLM as a fine-grained judge

Add code
Oct 07, 2024
Figure 1 for Self-rationalization improves LLM as a fine-grained judge
Figure 2 for Self-rationalization improves LLM as a fine-grained judge
Figure 3 for Self-rationalization improves LLM as a fine-grained judge
Figure 4 for Self-rationalization improves LLM as a fine-grained judge
Viaarxiv icon

What's documented in AI? Systematic Analysis of 32K AI Model Cards

Add code
Feb 07, 2024
Viaarxiv icon

Zephyr: Direct Distillation of LM Alignment

Add code
Oct 25, 2023
Viaarxiv icon

Measuring Data

Add code
Dec 09, 2022
Viaarxiv icon

Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated Explanations

Add code
Nov 14, 2022
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Conformal Predictor for Improving Zero-shot Text Classification Efficiency

Add code
Oct 23, 2022
Viaarxiv icon

SEAL : Interactive Tool for Systematic Error Analysis and Labeling

Add code
Oct 11, 2022
Figure 1 for SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Figure 2 for SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Figure 3 for SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Figure 4 for SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Viaarxiv icon

Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

Add code
Oct 06, 2022
Figure 1 for Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Figure 2 for Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Figure 3 for Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Viaarxiv icon