Picture for Adriano Soares Koshiyama

Adriano Soares Koshiyama

Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation

Add code
Apr 02, 2024
Figure 1 for Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation
Figure 2 for Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation
Figure 3 for Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation
Figure 4 for Auditing Large Language Models for Enhanced Text-Based Stereotype Detection and Probing-Based Bias Evaluation
Viaarxiv icon

Eliciting Personality Traits in Large Language Models

Add code
Feb 15, 2024
Figure 1 for Eliciting Personality Traits in Large Language Models
Figure 2 for Eliciting Personality Traits in Large Language Models
Figure 3 for Eliciting Personality Traits in Large Language Models
Figure 4 for Eliciting Personality Traits in Large Language Models
Viaarxiv icon

Towards Auditing Large Language Models: Improving Text-based Stereotype Detection

Add code
Nov 23, 2023
Viaarxiv icon