Picture for Saurav Sahay

Saurav Sahay

QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing

Add code
Dec 03, 2024
Viaarxiv icon

Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models

Add code
Aug 07, 2024
Figure 1 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Figure 2 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Figure 3 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Figure 4 for Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

Zero-shot Conversational Summarization Evaluations with small Large Language Models

Add code
Nov 29, 2023
Figure 1 for Zero-shot Conversational Summarization Evaluations with small Large Language Models
Figure 2 for Zero-shot Conversational Summarization Evaluations with small Large Language Models
Figure 3 for Zero-shot Conversational Summarization Evaluations with small Large Language Models
Figure 4 for Zero-shot Conversational Summarization Evaluations with small Large Language Models
Viaarxiv icon

Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models

Add code
Oct 17, 2023
Figure 1 for Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models
Figure 2 for Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models
Figure 3 for Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models
Figure 4 for Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models
Viaarxiv icon

Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home

Add code
Jun 01, 2023
Viaarxiv icon

Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization

Add code
Mar 08, 2023
Figure 1 for Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization
Figure 2 for Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization
Figure 3 for Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization
Figure 4 for Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization
Viaarxiv icon

Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

Add code
Feb 12, 2023
Figure 1 for Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue
Figure 2 for Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue
Figure 3 for Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue
Figure 4 for Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue
Viaarxiv icon

General Framework for Self-Supervised Model Priming for Parameter-Efficient Fine-tuning

Add code
Dec 02, 2022
Viaarxiv icon

End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics

Add code
Nov 07, 2022
Viaarxiv icon