Picture for Saurav Sahay

Saurav Sahay

Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models

Add code
Aug 07, 2024
Figure 1 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Figure 2 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Figure 3 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Figure 4 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Viaarxiv icon

Zero-shot Conversational Summarization Evaluations with small Large Language Models

Add code
Nov 29, 2023
Viaarxiv icon

Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models

Add code
Oct 17, 2023
Viaarxiv icon

Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home

Add code
Jun 01, 2023
Viaarxiv icon

Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization

Add code
Mar 08, 2023
Viaarxiv icon

Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

Add code
Feb 12, 2023
Viaarxiv icon

General Framework for Self-Supervised Model Priming for Parameter-Efficient Fine-tuning

Add code
Dec 02, 2022
Viaarxiv icon

End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics

Add code
Nov 07, 2022
Viaarxiv icon

Human in the loop approaches in multi-modal conversational task guidance system development

Add code
Nov 03, 2022
Viaarxiv icon