Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jill Lehman

Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Nov 02, 2021

Aakanksha Naik, Jill Lehman, Carolyn Rose

Figure 1 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Figure 2 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Figure 3 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Figure 4 for Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Abstract:Natural language understanding (NLU) has made massive progress driven by large benchmarks, paired with research on transfer learning to broaden its impact. Benchmarks are dominated by a small set of frequent phenomena, leaving a long tail of infrequent phenomena underrepresented. In this work, we reflect on the question: have transfer learning methods sufficiently addressed performance of benchmark-trained models on the long tail? Since benchmarks do not list included/excluded phenomena, we conceptualize the long tail using macro-level dimensions such as underrepresented genres, topics, etc. We assess trends in transfer learning research through a qualitative meta-analysis of 100 representative papers on transfer learning for NLU. Our analysis asks three questions: (i) Which long tail dimensions do transfer learning studies target? (ii) Which properties help adaptation methods improve performance on the long tail? (iii) Which methodological gaps have greatest negative impact on long tail performance? Our answers to these questions highlight major avenues for future research in transfer learning for the long tail. Lastly, we present a case study comparing the performance of various adaptation methods on clinical narratives to show how systematically conducted meta-experiments can provide insights that enable us to make progress along these future avenues.

* 14 pages

Via

Access Paper or Ask Questions

Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

Aug 21, 2020

Aakanksha Naik, Jill Lehman, Carolyn Rose

Figure 1 for Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

Figure 2 for Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

Figure 3 for Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

Figure 4 for Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

Abstract:We tackle the task of adapting event extractors to new domains without labeled data, by aligning the marginal distributions of source and target domains. As a testbed, we create two new event extraction datasets using English texts from two medical domains: (i) clinical notes, and (ii) doctor-patient conversations. We test the efficacy of three marginal alignment techniques: (i) adversarial domain adaptation (ADA), (ii) domain adaptive fine-tuning (DAFT), and (iii) a novel instance weighting technique based on language model likelihood scores (LIW). LIW and DAFT improve over a no-transfer BERT baseline on both domains, but ADA only improves on clinical notes. Deeper analysis of performance under different types of shifts (e.g., lexical shift, semantic shift) reveals interesting variations among models. Our best-performing models reach F1 scores of 70.0 and 72.9 on notes and conversations respectively, using no labeled data from target domains.

Via

Access Paper or Ask Questions