Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sinan Kurtyigit

Lexical Semantic Change Discovery

Jun 06, 2021

Sinan Kurtyigit, Maike Park, Dominik Schlechtweg, Jonas Kuhn, Sabine Schulte im Walde

Figure 1 for Lexical Semantic Change Discovery

Figure 2 for Lexical Semantic Change Discovery

Figure 3 for Lexical Semantic Change Discovery

Figure 4 for Lexical Semantic Change Discovery

Abstract:While there is a large amount of research in the field of Lexical Semantic Change Detection, only few approaches go beyond a standard benchmark evaluation of existing models. In this paper, we propose a shift of focus from change detection to change discovery, i.e., discovering novel word senses over time from the full corpus vocabulary. By heavily fine-tuning a type-based and a token-based approach on recently published German data, we demonstrate that both models can successfully be applied to discover new words undergoing meaning change. Furthermore, we provide an almost fully automated framework for both evaluation and discovery.

* ACL 2021, 9 pages

Via

Access Paper or Ask Questions

Explaining and Improving BERT Performance on Lexical Semantic Change Detection

Mar 12, 2021

Severin Laicher, Sinan Kurtyigit, Dominik Schlechtweg, Jonas Kuhn, Sabine Schulte im Walde

Figure 1 for Explaining and Improving BERT Performance on Lexical Semantic Change Detection

Figure 2 for Explaining and Improving BERT Performance on Lexical Semantic Change Detection

Figure 3 for Explaining and Improving BERT Performance on Lexical Semantic Change Detection

Figure 4 for Explaining and Improving BERT Performance on Lexical Semantic Change Detection

Abstract:Type- and token-based embedding architectures are still competing in lexical semantic change detection. The recent success of type-based models in SemEval-2020 Task 1 has raised the question why the success of token-based models on a variety of other NLP tasks does not translate to our field. We investigate the influence of a range of variables on clusterings of BERT vectors and show that its low performance is largely due to orthographic information on the target word, which is encoded even in the higher layers of BERT representations. By reducing the influence of orthography we considerably improve BERT's performance.

* EACL SRW, 6 Pages

Via

Access Paper or Ask Questions

Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection

Jan 26, 2021

Jens Kaiser, Sinan Kurtyigit, Serge Kotchourko, Dominik Schlechtweg

Figure 1 for Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection

Figure 2 for Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection

Figure 3 for Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection

Figure 4 for Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection

Abstract:Lexical semantic change detection is a new and innovative research field. The optimal fine-tuning of models including pre- and post-processing is largely unclear. We optimize existing models by (i) pre-training on large corpora and refining on diachronic target corpora tackling the notorious small data problem, and (ii) applying post-processing transformations that have been shown to improve performance on synchronic tasks. Our results provide a guide for the application and optimization of lexical semantic change detection models across various learning scenarios.

* 9 pages, 16 figures, 3 tables

Via

Access Paper or Ask Questions