Picture for Lavinia Dunagan

Lavinia Dunagan

You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments

Add code
Nov 16, 2023
Viaarxiv icon

Exploring Linguistic Style Matching in Online Communities: The Role of Social Context and Conversation Dynamics

Add code
Jul 06, 2023
Viaarxiv icon

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

Add code
Dec 08, 2021
Figure 1 for Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Figure 2 for Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Figure 3 for Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Figure 4 for Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Viaarxiv icon

Transparent Human Evaluation for Image Captioning

Add code
Nov 17, 2021
Figure 1 for Transparent Human Evaluation for Image Captioning
Figure 2 for Transparent Human Evaluation for Image Captioning
Figure 3 for Transparent Human Evaluation for Image Captioning
Figure 4 for Transparent Human Evaluation for Image Captioning
Viaarxiv icon