Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Aug 16, 2022

Ye Ma, Lu Zong

Figure 1 for Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Figure 2 for Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Figure 3 for Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Figure 4 for Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Share this with someone who'll enjoy it:

Abstract:In comparison to single-document summarization, abstractive Multi-Document Summarization (MDS) brings challenges on the representation and coverage of its lengthy and linked sources. This study develops a Parallel Hierarchical Transformer (PHT) with attention alignment for MDS. By incorporating word- and paragraph-level multi-head attentions, the hierarchical architecture of PHT allows better processing of dependencies at both token and document levels. To guide the decoding towards a better coverage of the source documents, the attention-alignment mechanism is then introduced to calibrate beam search with predicted optimal attention distributions. Based on the WikiSum data, a comprehensive evaluation is conducted to test improvements on MDS by the proposed architecture. By better handling the inner- and cross-document information, results in both ROUGE and human evaluation suggest that our hierarchical model generates summaries of higher quality relative to other Transformer-based baselines at relatively low computational cost.

* A work in 2020. arXiv admin note: substantial text overlap with arXiv:2009.06891

View paper on

Share this with someone who'll enjoy it:

Title:Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Paper and Code