Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Borislav Kozlovskii

Multilingual End to End Entity Linking

Jun 15, 2023

Mikhail Plekhanov, Nora Kassner, Kashyap Popat, Louis Martin, Simone Merello, Borislav Kozlovskii, Frédéric A. Dreyer, Nicola Cancedda

Figure 1 for Multilingual End to End Entity Linking

Figure 2 for Multilingual End to End Entity Linking

Figure 3 for Multilingual End to End Entity Linking

Figure 4 for Multilingual End to End Entity Linking

Abstract:Entity Linking is one of the most common Natural Language Processing tasks in practical applications, but so far efficient end-to-end solutions with multilingual coverage have been lacking, leading to complex model stacks. To fill this gap, we release and open source BELA, the first fully end-to-end multilingual entity linking model that efficiently detects and links entities in texts in any of 97 languages. We provide here a detailed description of the model and report BELA's performance on four entity linking datasets covering high- and low-resource languages.

Via

Access Paper or Ask Questions

Fine-Tuning Transformers: Vocabulary Transfer

Dec 29, 2021

Igor Samenko, Alexey Tikhonov, Borislav Kozlovskii, Ivan P. Yamshchikov

Figure 1 for Fine-Tuning Transformers: Vocabulary Transfer

Figure 2 for Fine-Tuning Transformers: Vocabulary Transfer

Figure 3 for Fine-Tuning Transformers: Vocabulary Transfer

Figure 4 for Fine-Tuning Transformers: Vocabulary Transfer

Abstract:Transformers are responsible for the vast majority of recent advances in natural language processing. The majority of practical natural language processing applications of these models is typically enabled through transfer learning. This paper studies if corpus-specific tokenization used for fine-tuning improves the resulting performance of the model. Through a series of experiments, we demonstrate that such tokenization combined with the initialization and fine-tuning strategy for the vocabulary tokens speeds up the transfer and boosts the performance of the fine-tuned model. We call this aspect of transfer facilitation vocabulary transfer.

Via

Access Paper or Ask Questions