Picture for Pavel Stepachev

Pavel Stepachev

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 13, 2025
Viaarxiv icon

Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation

Add code
Aug 23, 2024
Figure 1 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 2 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 3 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 4 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Viaarxiv icon