Picture for Mykola Khandoga

Mykola Khandoga

From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages

Add code
Oct 24, 2024
Viaarxiv icon

From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language Representation

Add code
Apr 14, 2024
Viaarxiv icon