Picture for Pavel Stepachev

Pavel Stepachev

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 13, 2025
Figure 1 for An Expanded Massive Multilingual Dataset for High-Performance Language Technologies
Figure 2 for An Expanded Massive Multilingual Dataset for High-Performance Language Technologies
Figure 3 for An Expanded Massive Multilingual Dataset for High-Performance Language Technologies
Figure 4 for An Expanded Massive Multilingual Dataset for High-Performance Language Technologies
Viaarxiv icon

Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models

Add code
Oct 04, 2024
Figure 1 for Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models
Figure 2 for Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models
Figure 3 for Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models
Figure 4 for Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models
Viaarxiv icon

Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation

Add code
Aug 23, 2024
Figure 1 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 2 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 3 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 4 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Viaarxiv icon