Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction

Oct 25, 2023

Xuming Hu, Junzhe Chen, Aiwei Liu, Shiao Meng, Lijie Wen, Philip S. Yu

Figure 1 for Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction

Figure 2 for Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction

Figure 3 for Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction

Figure 4 for Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction

Share this with someone who'll enjoy it:

Abstract:How can we better extract entities and relations from text? Using multimodal extraction with images and text obtains more signals for entities and relations, and aligns them through graphs or hierarchical fusion, aiding in extraction. Despite attempts at various fusions, previous works have overlooked many unlabeled image-caption pairs, such as NewsCLIPing. This paper proposes innovative pre-training objectives for entity-object and relation-image alignment, extracting objects from images and aligning them with entity and relation prompts for soft pseudo-labels. These labels are used as self-supervised signals for pre-training, enhancing the ability to extract entities and relations. Experiments on three datasets show an average 3.41% F1 improvement over prior SOTA. Additionally, our method is orthogonal to previous multimodal fusions, and using it on prior SOTA fusions further improves 5.47% F1.

* Accepted to ACM Multimedia 2023

View paper on

Share this with someone who'll enjoy it:

Title:Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction

Paper and Code