Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances

Add code
Sep 12, 2018
Figure 1 for Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances
Figure 2 for Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances
Figure 3 for Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances
Figure 4 for Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: