Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tran Xuan Bach

Graph Convolutional Topic Model for Data Streams

Mar 17, 2020

Ngo Van Linh, Tran Xuan Bach, Khoat Than

Figure 1 for Graph Convolutional Topic Model for Data Streams

Figure 2 for Graph Convolutional Topic Model for Data Streams

Figure 3 for Graph Convolutional Topic Model for Data Streams

Figure 4 for Graph Convolutional Topic Model for Data Streams

Abstract:Learning hidden topics in data streams has been paid a great deal of attention by researchers with a lot of proposed methods, but exploiting prior knowledge in general and a knowledge graph in particular has not been taken into adequate consideration in these methods. Prior knowledge that is derived from human knowledge (e.g. Wordnet) or a pre-trained model (e.g.Word2vec) is very valuable and useful to help topic models work better, especially on short texts. However, previous work often ignores this resource, or it can only utilize prior knowledge of a vector form in a simple way. In this paper, we propose a novel graph convolutional topic model (GCTM) which integrates graph convolutional networks (GCN) into a topic model and a learning method which learns the networks and the topic model simultaneously for data streams. In each minibatch, our method not only can exploit an external knowledge graph but also can balance between the external and old knowledge to perform well on new data. We conduct extensive experiments to evaluate our method with both human graph knowledge(Wordnet) and a graph built from pre-trained word embeddings (Word2vec). The experimental results show that our method achieves significantly better performances than the state-of-the-art baselines in terms of probabilistic predictive measure and topic coherence. In particular, our method can work well when dealing with short texts as well as concept drift. The implementation of GCTM is available at https://github.com/bachtranxuan/GCTM.git.

Via

Access Paper or Ask Questions

Dynamic transformation of prior knowledge into Bayesian models for data streams

Mar 17, 2020

Tran Xuan Bach, Nguyen Duc Anh, Ngo Van Linh, Khoat Than

Figure 1 for Dynamic transformation of prior knowledge into Bayesian models for data streams

Figure 2 for Dynamic transformation of prior knowledge into Bayesian models for data streams

Figure 3 for Dynamic transformation of prior knowledge into Bayesian models for data streams

Figure 4 for Dynamic transformation of prior knowledge into Bayesian models for data streams

Abstract:We consider how to effectively use prior knowledge when learning a Bayesian model from streaming environments where the data come infinitely and sequentially. This problem is highly important in the era of data explosion and rich sources of precious external knowledge such as pre-trained models, ontologies, Wikipedia, etc. We show that some existing approaches can forget any knowledge very fast. We then propose a novel framework that enables to incorporate the prior knowledge of different forms into a base Bayesian model for data streams. Our framework subsumes some existing popular models for time-series/dynamic data. Extensive experiments show that our framework outperforms existing methods with a large margin. In particular, our framework can help Bayesian models generalize well on extremely short text while other methods overfit. The implementation of our framework is available at https://github.com/bachtranxuan/TPS.git.

Via

Access Paper or Ask Questions