Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xin Cui

JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs

Jun 19, 2021

Pei Ke, Haozhe Ji, Yu Ran, Xin Cui, Liwei Wang, Linfeng Song, Xiaoyan Zhu, Minlie Huang

Figure 1 for JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs

Figure 2 for JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs

Figure 3 for JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs

Figure 4 for JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs

Abstract:Existing pre-trained models for knowledge-graph-to-text (KG-to-text) generation simply fine-tune text-to-text pre-trained models such as BART or T5 on KG-to-text datasets, which largely ignore the graph structure during encoding and lack elaborate pre-training tasks to explicitly model graph-text alignments. To tackle these problems, we propose a graph-text joint representation learning model called JointGT. During encoding, we devise a structure-aware semantic aggregation module which is plugged into each Transformer layer to preserve the graph structure. Furthermore, we propose three new pre-training tasks to explicitly enhance the graph-text alignment including respective text / graph reconstruction, and graph-text alignment in the embedding space via Optimal Transport. Experiments show that JointGT obtains new state-of-the-art performance on various KG-to-text datasets.

* ACL 2021 (Findings)

Via

Access Paper or Ask Questions

Shapley Interpretation and Activation in Neural Networks

Sep 17, 2019

Yadong Li, Xin Cui

Figure 1 for Shapley Interpretation and Activation in Neural Networks

Figure 2 for Shapley Interpretation and Activation in Neural Networks

Figure 3 for Shapley Interpretation and Activation in Neural Networks

Figure 4 for Shapley Interpretation and Activation in Neural Networks

Abstract:We propose a novel Shapley value approach to help address neural networks' interpretability and "vanishing gradient" problems. Our method is based on an accurate analytical approximation to the Shapley value of a neuron with ReLU activation. This analytical approximation admits a linear propagation of relevance across neural network layers, resulting in a simple, fast and sensible interpretation of neural networks' decision making process. We then derived a globally continuous and non-vanishing Shapley gradient, which can replace the conventional gradient in training neural network layers with ReLU activation, and leading to better training performance. We further derived a Shapley Activation (SA) function, which is a close approximation to ReLU but features the Shapley gradient. The SA is easy to implement in existing machine learning frameworks. Numerical tests show that SA consistently outperforms ReLU in training convergence, accuracy and stability.

* 11 pages, 7 figures

Via

Access Paper or Ask Questions