Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul McNamee

On the Evaluation of Machine-Generated Reports

May 02, 2024

James Mayfield, Eugene Yang, Dawn Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Kayi(+3 more)

Figure 1 for On the Evaluation of Machine-Generated Reports

Figure 2 for On the Evaluation of Machine-Generated Reports

Figure 3 for On the Evaluation of Machine-Generated Reports

Figure 4 for On the Evaluation of Machine-Generated Reports

Abstract:Large Language Models (LLMs) have enabled new ways to satisfy information needs. Although great strides have been made in applying them to settings like document ranking and short-form text generation, they still struggle to compose complete, accurate, and verifiable long-form reports. Reports with these qualities are necessary to satisfy the complex, nuanced, or multi-faceted information needs of users. In this perspective paper, we draw together opinions from industry and academia, and from a variety of related research areas, to present our vision for automatic report generation, and -- critically -- a flexible framework by which such reports can be evaluated. In contrast with other summarization tasks, automatic report generation starts with a detailed description of an information need, stating the necessary background, requirements, and scope of the report. Further, the generated reports should be complete, accurate, and verifiable. These qualities, which are desirable -- if not required -- in many analytic report-writing settings, require rethinking how to build and evaluate systems that exhibit these qualities. To foster new efforts in building these systems, we present an evaluation framework that draws on ideas found in various evaluations. To test completeness and accuracy, the framework uses nuggets of information, expressed as questions and answers, that need to be part of any high-quality generated report. Additionally, evaluation of citations that map claims made in the report to their source documents ensures verifiability.

* 12 pages, 4 figures, accepted at SIGIR 2024 as perspective paper

Via

Access Paper or Ask Questions

Overview of the TREC 2023 NeuCLIR Track

Apr 11, 2024

Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang

Figure 1 for Overview of the TREC 2023 NeuCLIR Track

Figure 2 for Overview of the TREC 2023 NeuCLIR Track

Figure 3 for Overview of the TREC 2023 NeuCLIR Track

Figure 4 for Overview of the TREC 2023 NeuCLIR Track

Abstract:The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR) track is to study the impact of neural approaches to cross-language information retrieval. The track has created four collections, large collections of Chinese, Persian, and Russian newswire and a smaller collection of Chinese scientific abstracts. The principal tasks are ranked retrieval of news in one of the three languages, using English topics. Results for a multilingual task, also with English topics but with documents from all three newswire collections, are also reported. New in this second year of the track is a pilot technical documents CLIR task for ranked retrieval of Chinese technical documents using English topics. A total of 220 runs across all tasks were submitted by six participating teams and, as baselines, by track coordinators. Task descriptions and results are presented.

* 27 pages, 17 figures. Part of the TREC 2023 Proceedings

Via

Access Paper or Ask Questions

Extending Translate-Train for ColBERT-X to African Language CLIR

Apr 11, 2024

Eugene Yang, Dawn J. Lawrie, Paul McNamee, James Mayfield

Figure 1 for Extending Translate-Train for ColBERT-X to African Language CLIR

Figure 2 for Extending Translate-Train for ColBERT-X to African Language CLIR

Figure 3 for Extending Translate-Train for ColBERT-X to African Language CLIR

Figure 4 for Extending Translate-Train for ColBERT-X to African Language CLIR

Abstract:This paper describes the submission runs from the HLTCOE team at the CIRAL CLIR tasks for African languages at FIRE 2023. Our submissions use machine translation models to translate the documents and the training passages, and ColBERT-X as the retrieval model. Additionally, we present a set of unofficial runs that use an alternative training procedure with a similar training setting.

* 10 pages, 2 figures. System description paper for HLTCOE's participation in CIRAL@FIRE 2023

Via

Access Paper or Ask Questions

Overview of the TREC 2022 NeuCLIR Track

Apr 24, 2023

Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang

Figure 1 for Overview of the TREC 2022 NeuCLIR Track

Figure 2 for Overview of the TREC 2022 NeuCLIR Track

Figure 3 for Overview of the TREC 2022 NeuCLIR Track

Figure 4 for Overview of the TREC 2022 NeuCLIR Track

Abstract:This is the first year of the TREC Neural CLIR (NeuCLIR) track, which aims to study the impact of neural approaches to cross-language information retrieval. The main task in this year's track was ad hoc ranked retrieval of Chinese, Persian, or Russian newswire documents using queries expressed in English. Topics were developed using standard TREC processes, except that topics developed by an annotator for one language were assessed by a different annotator when evaluating that topic on a different language. There were 172 total runs submitted by twelve teams.

* 22 pages, 13 figures, 10 tables. Part of the Thirty-First Text REtrieval Conference (TREC 2022) Proceedings

Via

Access Paper or Ask Questions

Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

Jan 20, 2022

Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield, Douglas W. Oard

Figure 1 for Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

Figure 2 for Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

Figure 3 for Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

Figure 4 for Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

Abstract:The advent of transformer-based models such as BERT has led to the rise of neural ranking models. These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25. While monolingual retrieval tasks have benefited from large-scale training collections such as MS MARCO and advances in neural architectures, cross-language retrieval tasks have fallen behind these advancements. This paper introduces ColBERT-X, a generalization of the ColBERT multi-representation dense retrieval model that uses the XLM-RoBERTa (XLM-R) encoder to support cross-language information retrieval (CLIR). ColBERT-X can be trained in two ways. In zero-shot training, the system is trained on the English MS MARCO collection, relying on the XLM-R encoder for cross-language mappings. In translate-train, the system is trained on the MS MARCO English queries coupled with machine translations of the associated MS MARCO passages. Results on ad hoc document ranking tasks in several languages demonstrate substantial and statistically significant improvements of these trained dense retrieval models over traditional lexical CLIR baselines.

* Accepted at ECIR 2022 (Full paper)

Via

Access Paper or Ask Questions

Curriculum Learning for Domain Adaptation in Neural Machine Translation

May 14, 2019

Xuan Zhang, Pamela Shapiro, Gaurav Kumar, Paul McNamee, Marine Carpuat, Kevin Duh

Figure 1 for Curriculum Learning for Domain Adaptation in Neural Machine Translation

Figure 2 for Curriculum Learning for Domain Adaptation in Neural Machine Translation

Figure 3 for Curriculum Learning for Domain Adaptation in Neural Machine Translation

Figure 4 for Curriculum Learning for Domain Adaptation in Neural Machine Translation

Abstract:We introduce a curriculum learning approach to adapt generic neural machine translation models to a specific domain. Samples are grouped by their similarities to the domain of interest and each group is fed to the training algorithm with a particular schedule. This approach is simple to implement on top of any neural framework or architecture, and consistently outperforms both unadapted and adapted baselines in experiments with two distinct domains and two language pairs.

Via

Access Paper or Ask Questions

An Empirical Exploration of Curriculum Learning for Neural Machine Translation

Nov 02, 2018

Xuan Zhang, Gaurav Kumar, Huda Khayrallah, Kenton Murray, Jeremy Gwinnup, Marianna J Martindale, Paul McNamee, Kevin Duh, Marine Carpuat

Figure 1 for An Empirical Exploration of Curriculum Learning for Neural Machine Translation

Figure 2 for An Empirical Exploration of Curriculum Learning for Neural Machine Translation

Figure 3 for An Empirical Exploration of Curriculum Learning for Neural Machine Translation

Figure 4 for An Empirical Exploration of Curriculum Learning for Neural Machine Translation

Abstract:Machine translation systems based on deep neural networks are expensive to train. Curriculum learning aims to address this issue by choosing the order in which samples are presented during training to help train better models faster. We adopt a probabilistic view of curriculum learning, which lets us flexibly evaluate the impact of curricula design, and perform an extensive exploration on a German-English translation task. Results show that it is possible to improve convergence time at no loss in translation quality. However, results are highly sensitive to the choice of sample difficulty criteria, curriculum schedule and other hyperparameters.

Via

Access Paper or Ask Questions

Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

Sep 14, 2018

Brian Thompson, Huda Khayrallah, Antonios Anastasopoulos, Arya McCarthy, Kevin Duh, Rebecca Marvin, Paul McNamee, Jeremy Gwinnup, Tim Anderson, Philipp Koehn

Figure 1 for Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

Figure 2 for Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

Figure 3 for Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

Figure 4 for Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

Abstract:To better understand the effectiveness of continued training, we analyze the major components of a neural machine translation system (the encoder, decoder, and each embedding space) and consider each component's contribution to, and capacity for, domain adaptation. We find that freezing any single component during continued training has minimal impact on performance, and that performance is surprisingly good when a single component is adapted while holding the rest of the model fixed. We also find that continued training does not move the model very far from the out-of-domain model, compared to a sensitivity analysis metric, suggesting that the out-of-domain model can provide a good generic initialization for the new domain.

* to be presented at WMT 2018

Via

Access Paper or Ask Questions

Using of heterogeneous corpora for training of an ASR system

Jun 01, 2017

Jan Trmal, Gaurav Kumar, Vimal Manohar, Sanjeev Khudanpur, Matt Post, Paul McNamee

Figure 1 for Using of heterogeneous corpora for training of an ASR system

Figure 2 for Using of heterogeneous corpora for training of an ASR system

Figure 3 for Using of heterogeneous corpora for training of an ASR system

Figure 4 for Using of heterogeneous corpora for training of an ASR system

Abstract:The paper summarizes the development of the LVCSR system built as a part of the Pashto speech-translation system at the SCALE (Summer Camp for Applied Language Exploration) 2015 workshop on "Speech-to-text-translation for low-resource languages". The Pashto language was chosen as a good "proxy" low-resource language, exhibiting multiple phenomena which make the speech-recognition and and speech-to-text-translation systems development hard. Even when the amount of data is seemingly sufficient, given the fact that the data originates from multiple sources, the preliminary experiments reveal that there is little to no benefit in merging (concatenating) the corpora and more elaborate ways of making use of all of the data must be worked out. This paper concentrates only on the LVCSR part and presents a range of different techniques that were found to be useful in order to benefit from multiple different corpora

Via

Access Paper or Ask Questions

Interactive Knowledge Base Population

May 31, 2015

Travis Wolfe, Mark Dredze, James Mayfield, Paul McNamee, Craig Harman, Tim Finin, Benjamin Van Durme

Abstract:Most work on building knowledge bases has focused on collecting entities and facts from as large a collection of documents as possible. We argue for and describe a new paradigm where the focus is on a high-recall extraction over a small collection of documents under the supervision of a human expert, that we call Interactive Knowledge Base Population (IKBP).

Via

Access Paper or Ask Questions