Picture for Anna Korhonen

Anna Korhonen

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

Add code
Aug 30, 2024
Viaarxiv icon

Can Rule-Based Insights Enhance LLMs for Radiology Report Classification? Introducing the RadPrompt Methodology

Add code
Aug 07, 2024
Viaarxiv icon

TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish

Add code
Jul 17, 2024
Figure 1 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Figure 2 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Figure 3 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Figure 4 for TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish
Viaarxiv icon

"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

Add code
Jun 25, 2024
Viaarxiv icon

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments

Add code
Jun 17, 2024
Viaarxiv icon

Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art

Add code
Jun 06, 2024
Viaarxiv icon

TopViewRS: Vision-Language Models as Top-View Spatial Reasoners

Add code
Jun 04, 2024
Viaarxiv icon

Spectral Editing of Activations for Large Language Model Alignment

Add code
May 15, 2024
Viaarxiv icon

CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation

Add code
May 03, 2024
Viaarxiv icon

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Add code
Mar 26, 2024
Viaarxiv icon