Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:An Overview on Language Models: Recent Developments and Outlook

Mar 10, 2023

Chengwei Wei, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

Figure 1 for An Overview on Language Models: Recent Developments and Outlook

Figure 2 for An Overview on Language Models: Recent Developments and Outlook

Figure 3 for An Overview on Language Models: Recent Developments and Outlook

Figure 4 for An Overview on Language Models: Recent Developments and Outlook

Share this with someone who'll enjoy it:

Abstract:Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine translation, etc. Conventional language models (CLMs) aim to predict the probability of linguistic sequences in a causal manner. In contrast, pre-trained language models (PLMs) cover broader concepts and can be used in both causal sequential modeling and fine-tuning for downstream applications. PLMs have their own training paradigms (usually self-supervised) and serve as foundation models in modern NLP systems. This overview paper provides an introduction to both CLMs and PLMs from five aspects, i.e., linguistic units, structures, training methods, evaluation methods, and applications. Furthermore, we discuss the relationship between CLMs and PLMs and shed light on the future directions of language modeling in the pre-trained era.

View paper on

Share this with someone who'll enjoy it:

Title:An Overview on Language Models: Recent Developments and Outlook

Paper and Code