Picture for Owen Henkel

Owen Henkel

Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy

Add code
Sep 26, 2024
Viaarxiv icon

Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education

Add code
May 05, 2024
Viaarxiv icon

Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs

Add code
Oct 26, 2023
Viaarxiv icon

Using State-of-the-Art Speech Models to Evaluate Oral Reading Fluency in Ghana

Add code
Oct 26, 2023
Viaarxiv icon

Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference

Add code
Oct 04, 2023
Viaarxiv icon