Picture for Xinpeng Chen

Xinpeng Chen

Real-Time Referring Expression Comprehension by Single-Stage Grounding Network

Add code
Dec 09, 2018
Figure 1 for Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Figure 2 for Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Figure 3 for Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Figure 4 for Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Viaarxiv icon

Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

Add code
Apr 07, 2018
Figure 1 for Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Figure 2 for Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Figure 3 for Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Figure 4 for Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Viaarxiv icon

Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset

Add code
Apr 07, 2018
Figure 1 for Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset
Figure 2 for Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset
Figure 3 for Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset
Figure 4 for Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset
Viaarxiv icon

Learning to Guide Decoding for Image Captioning

Add code
Apr 03, 2018
Figure 1 for Learning to Guide Decoding for Image Captioning
Figure 2 for Learning to Guide Decoding for Image Captioning
Figure 3 for Learning to Guide Decoding for Image Captioning
Figure 4 for Learning to Guide Decoding for Image Captioning
Viaarxiv icon

Aggregating Frame-level Features for Large-Scale Video Classification

Add code
Jul 04, 2017
Figure 1 for Aggregating Frame-level Features for Large-Scale Video Classification
Figure 2 for Aggregating Frame-level Features for Large-Scale Video Classification
Figure 3 for Aggregating Frame-level Features for Large-Scale Video Classification
Figure 4 for Aggregating Frame-level Features for Large-Scale Video Classification
Viaarxiv icon