Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Structure Optimization for Deep Multimodal Fusion Networks using Graph-Induced Kernels

Jul 03, 2017

Dhanesh Ramachandram, Michal Lisicki, Timothy J. Shields, Mohamed R. Amer, Graham W. Taylor

Figure 1 for Structure Optimization for Deep Multimodal Fusion Networks using Graph-Induced Kernels

Figure 2 for Structure Optimization for Deep Multimodal Fusion Networks using Graph-Induced Kernels

Figure 3 for Structure Optimization for Deep Multimodal Fusion Networks using Graph-Induced Kernels

Share this with someone who'll enjoy it:

Abstract:A popular testbed for deep learning has been multimodal recognition of human activity or gesture involving diverse inputs such as video, audio, skeletal pose and depth images. Deep learning architectures have excelled on such problems due to their ability to combine modality representations at different levels of nonlinear feature extraction. However, designing an optimal architecture in which to fuse such learned representations has largely been a non-trivial human engineering effort. We treat fusion structure optimization as a hyper-parameter search and cast it as a discrete optimization problem under the Bayesian optimization framework. We propose a novel graph-induced kernel to compute structural similarities in the search space of tree-structured multimodal architectures and demonstrate its effectiveness using two challenging multimodal human activity recognition datasets.

* Proceedings of the 25th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, April 2017, Bruges, Belgium

View paper on

Share this with someone who'll enjoy it:

Title:Structure Optimization for Deep Multimodal Fusion Networks using Graph-Induced Kernels

Paper and Code