Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

May 30, 2022

Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Roberto Lotufo, Rodrigo Nogueira

Figure 1 for Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

Figure 2 for Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

Figure 3 for Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

Share this with someone who'll enjoy it:

Abstract:Recent work has shown that language models scaled to billions of parameters, such as GPT-3, perform remarkably well in zero-shot and few-shot scenarios. In this work, we experiment with zero-shot models in the legal case entailment task of the COLIEE 2022 competition. Our experiments show that scaling the number of parameters in a language model improves the F1 score of our previous zero-shot result by more than 6 points, suggesting that stronger zero-shot capability may be a characteristic of larger models, at least for this task. Our 3B-parameter zero-shot model outperforms all models, including ensembles, in the COLIEE 2021 test set and also achieves the best performance of a single model in the COLIEE 2022 competition, second only to the ensemble composed of the 3B model itself and a smaller version of the same model. Despite the challenges posed by large language models, mainly due to latency constraints in real-time applications, we provide a demonstration of our zero-shot monoT5-3b model being used in production as a search engine, including for legal documents. The code for our submission and the demo of our system are available at https://github.com/neuralmind-ai/coliee and https://neuralsearchx.neuralmind.ai, respectively.

View paper on

Share this with someone who'll enjoy it:

Title:Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

Paper and Code