Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marta Kiatkowska

Emergent Linguistic Structures in Neural Networks are Fragile

Nov 15, 2022

Emanuele La Malfa, Matthew Wicker, Marta Kiatkowska

Figure 1 for Emergent Linguistic Structures in Neural Networks are Fragile

Figure 2 for Emergent Linguistic Structures in Neural Networks are Fragile

Figure 3 for Emergent Linguistic Structures in Neural Networks are Fragile

Figure 4 for Emergent Linguistic Structures in Neural Networks are Fragile

Abstract:Large language models (LLMs) have been reported to have strong performance on natural language processing tasks. However, performance metrics such as accuracy do not measure the quality of the model in terms of its ability to robustly represent complex linguistic structure. In this work, we propose a framework to evaluate the robustness of linguistic representations using probing tasks. We leverage recent advances in extracting emergent linguistic constructs from LLMs and apply syntax-preserving perturbations to test the stability of these constructs in order to better understand the representations learned by LLMs. Empirically, we study the performance of four LLMs across six different corpora on the proposed robustness measures. We provide evidence that context-free representation (e.g., GloVe) are in some cases competitive with context-dependent representations from modern LLMs (e.g., BERT), yet equally brittle to syntax-preserving manipulations. Emergent syntactic representations in neural networks are brittle, thus our work poses the attention on the risk of comparing such structures to those that are object of a long lasting debate in linguistics.

Via

Access Paper or Ask Questions