Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stella Douka

JuriBERT: A Masked-Language Model Adaptation for French Legal Text

Oct 04, 2021

Stella Douka, Hadi Abdine, Michalis Vazirgiannis, Rajaa El Hamdani, David Restrepo Amariles

Figure 1 for JuriBERT: A Masked-Language Model Adaptation for French Legal Text

Figure 2 for JuriBERT: A Masked-Language Model Adaptation for French Legal Text

Figure 3 for JuriBERT: A Masked-Language Model Adaptation for French Legal Text

Figure 4 for JuriBERT: A Masked-Language Model Adaptation for French Legal Text

Abstract:Language models have proven to be very useful when adapted to specific domains. Nonetheless, little research has been done on the adaptation of domain-specific BERT models in the French language. In this paper, we focus on creating a language model adapted to French legal text with the goal of helping law professionals. We conclude that some specific tasks do not benefit from generic language models pre-trained on large amounts of data. We explore the use of smaller architectures in domain-specific sub-languages and their benefits for French legal text. We prove that domain-specific pre-trained models can perform better than their equivalent generalised ones in the legal domain. Finally, we release JuriBERT, a new set of BERT models adapted to the French legal domain.

* 7 pages

Via

Access Paper or Ask Questions