Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ethan Prihar

MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education

Jun 02, 2021

Jia Tracy Shen, Michiharu Yamashita, Ethan Prihar, Neil Heffernan, Xintao Wu, Dongwon Lee

Figure 1 for MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education

Figure 2 for MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education

Figure 3 for MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education

Figure 4 for MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education

Abstract:Due to the transfer learning nature of BERT model, researchers have achieved better performance than base BERT by further pre-training the original BERT on a huge domain-specific corpus. Due to the special nature of mathematical texts which often contain math equations and symbols, the original BERT model pre-trained on general English context will not fit Natural Language Processing (NLP) tasks in mathematical education well. Therefore, we propose MathBERT, a BERT pre-trained on large mathematical corpus including pre-k to graduate level mathematical content to tackle math-specific tasks. In addition, We generate a customized mathematical vocabulary to pre-train with MathBERT and compare the performance to the MathBERT pre-trained with the original BERT vocabulary. We select three important tasks in mathematical education such as knowledge component, auto-grading, and knowledge tracing prediction to evaluate the performance of MathBERT. Our experiments show that MathBERT outperforms the base BERT by 2-9\% margin. In some cases, MathBERT pre-trained with mathematical vocabulary is better than MathBERT trained with original vocabulary.To our best knowledge, MathBERT is the first pre-trained model for general purpose mathematics education tasks.

Via

Access Paper or Ask Questions

Classifying Math KCs via Task-Adaptive Pre-Trained BERT

May 24, 2021

Jia Tracy Shen, Michiharu Yamashita, Ethan Prihar, Neil Heffernan, Xintao Wu, Sean McGrew, Dongwon Lee

Figure 1 for Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Figure 2 for Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Figure 3 for Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Figure 4 for Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Abstract:Educational content labeled with proper knowledge components (KCs) are particularly useful to teachers or content organizers. However, manually labeling educational content is labor intensive and error-prone. To address this challenge, prior research proposed machine learning based solutions to auto-label educational content with limited success. In this work, we significantly improve prior research by (1) expanding the input types to include KC descriptions, instructional video titles, and problem descriptions (i.e., three types of prediction task), (2) doubling the granularity of the prediction from 198 to 385 KC labels (i.e., more practical setting but much harder multinomial classification problem), (3) improving the prediction accuracies by 0.5-2.3% using Task-adaptive Pre-trained BERT, outperforming six baselines, and (4) proposing a simple evaluation measure by which we can recover 56-73% of mispredicted KC labels. All codes and data sets in the experiments are available at:https://github.com/tbs17/TAPT-BERT

Via

Access Paper or Ask Questions