Picture for Sanskruti Mishra

Sanskruti Mishra

Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi

Add code
Jul 13, 2024
Figure 1 for Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi
Figure 2 for Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi
Figure 3 for Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi
Figure 4 for Building pre-train LLM Dataset for the INDIC Languages: a case study on Hindi
Viaarxiv icon