Picture for Owen Henkel

Owen Henkel

Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy

Add code
Sep 26, 2024
Figure 1 for Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy
Figure 2 for Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy
Figure 3 for Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy
Figure 4 for Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy
Viaarxiv icon

Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education

Add code
May 05, 2024
Figure 1 for Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education
Figure 2 for Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education
Viaarxiv icon

Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs

Add code
Oct 26, 2023
Figure 1 for Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs
Figure 2 for Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs
Figure 3 for Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs
Figure 4 for Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs
Viaarxiv icon

Using State-of-the-Art Speech Models to Evaluate Oral Reading Fluency in Ghana

Add code
Oct 26, 2023
Viaarxiv icon

Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference

Add code
Oct 04, 2023
Figure 1 for Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference
Figure 2 for Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference
Figure 3 for Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference
Figure 4 for Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference
Viaarxiv icon