Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sungju Hwang

Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data

Dec 11, 2018

Hyunwoo Jung, Moonsu Han, Minki Kang, Sungju Hwang

Figure 1 for Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data

Figure 2 for Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data

Figure 3 for Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data

Figure 4 for Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data

Abstract:Current generation of memory-augmented neural networks has limited scalability as they cannot efficiently process data that are too large to fit in the external memory storage. One example of this is lifelong learning scenario where the model receives unlimited length of data stream as an input which contains vast majority of uninformative entries. We tackle this problem by proposing a memory network fit for long-term lifelong learning scenario, which we refer to as Long-term Episodic Memory Networks (LEMN), that features a RNN-based retention agent that learns to replace less important memory entries based on the retention probability generated on each entry that is learned to identify data instances of generic importance relative to other memory entries, as well as its historical importance. Such learning of retention agent allows our long-term episodic memory network to retain memory entries of generic importance for a given task. We validate our model on a path-finding task as well as synthetic and real question answering tasks, on which our model achieves significant improvements over the memory augmented networks with rule-based memory scheduling as well as an RL-based baseline that does not consider relative or historical importance of the memory.

Via

Access Paper or Ask Questions

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Oct 02, 2018

Yanbin Liu, Juho Lee, Minseop Park, Saehoon Kim, Eunho Yang, Sungju Hwang, Yi Yang

Figure 1 for Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Figure 2 for Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Figure 3 for Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Figure 4 for Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Abstract:The goal of few-shot learning is to learn a classifier that generalizes well even when trained with a limited number of training instances per class. The recently introduced meta-learning approaches tackle this problem by learning a generic classifier across a large number of multiclass classification tasks and generalizing the model to a new task. Yet, even with such meta-learning, the low-data problem in the novel classification task still remains. In this paper, we propose Transductive Propagation Network (TPN), a novel meta-learning framework for transductive inference that classifies the entire test set at once to alleviate the low-data problem. Specifically, we propose to learn to propagate labels from labeled instances to unlabeled test instances, by learning a graph construction module that exploits the manifold structure in the data. TPN jointly learns both the parameters of feature embedding and the graph construction in an end-to-end manner. We validate TPN on multiple benchmark datasets, on which it largely outperforms existing few-shot learning approaches and achieves the state-of-the-art results.

* 11 pages, 5 figures. We propose to learn to propagate labels and achieved the state-of-the-art on miniImagenet and tieredImagenet dataset

Via

Access Paper or Ask Questions