Picture for Mikko Aulamo

Mikko Aulamo

A New Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 20, 2024
Viaarxiv icon

OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language models

Add code
Nov 24, 2023
Viaarxiv icon

Democratizing Machine Translation with OPUS-MT

Add code
Dec 04, 2022
Viaarxiv icon

Paraphrase Detection on Noisy Subtitles in Six Languages

Add code
Sep 21, 2018
Figure 1 for Paraphrase Detection on Noisy Subtitles in Six Languages
Figure 2 for Paraphrase Detection on Noisy Subtitles in Six Languages
Figure 3 for Paraphrase Detection on Noisy Subtitles in Six Languages
Figure 4 for Paraphrase Detection on Noisy Subtitles in Six Languages
Viaarxiv icon