Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shadi Manafi

Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Apr 04, 2024

Abhijnan Nath, Shadi Manafi, Avyakta Chelle, Nikhil Krishnaswamy

Figure 1 for Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Figure 2 for Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Figure 3 for Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Figure 4 for Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Abstract:In NLP, Event Coreference Resolution (ECR) is the task of connecting event clusters that refer to the same underlying real-life event, usually via neural systems. In this work, we investigate using abductive free-text rationales (FTRs) generated by modern autoregressive LLMs as distant supervision of smaller student models for cross-document coreference (CDCR) of events. We implement novel rationale-oriented event clustering and knowledge distillation methods for event coreference scoring that leverage enriched information from the FTRs for improved CDCR without additional annotation or expensive document clustering. Our model using coreference specific knowledge distillation achieves SOTA B3 F1 on the ECB+ and GVC corpora and we establish a new baseline on the AIDA Phase 1 corpus. Our code can be found at https://github.com/csu-signal/llama_cdcr

* To be published in NAACL 2024 Main

Via

Access Paper or Ask Questions

Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Mar 29, 2024

Shadi Manafi, Nikhil Krishnaswamy

Figure 1 for Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Figure 2 for Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Figure 3 for Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Figure 4 for Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Abstract:Multilingual Language Models (MLLMs) exhibit robust cross-lingual transfer capabilities, or the ability to leverage information acquired in a source language and apply it to a target language. These capabilities find practical applications in well-established Natural Language Processing (NLP) tasks such as Named Entity Recognition (NER). This study aims to investigate the effectiveness of a source language when applied to a target language, particularly in the context of perturbing the input test set. We evaluate on 13 pairs of languages, each including one high-resource language (HRL) and one low-resource language (LRL) with a geographic, genetic, or borrowing relationship. We evaluate two well-known MLLMs--MBERT and XLM-R--on these pairs, in native LRL and cross-lingual transfer settings, in two tasks, under a set of different perturbations. Our findings indicate that NER cross-lingual transfer depends largely on the overlap of entity chunks. If a source and target language have more entities in common, the transfer ability is stronger. Models using cross-lingual transfer also appear to be somewhat more robust to certain perturbations of the input, perhaps indicating an ability to leverage stronger representations derived from the HRL. Our research provides valuable insights into cross-lingual transfer and its implications for NLP applications, and underscores the need to consider linguistic nuances and potential limitations when employing MLLMs across distinct languages.

* accepted in LREC-COLING 2024

Via

Access Paper or Ask Questions