Picture for Jonathan C. Stroud

Jonathan C. Stroud

Learning Video Representations from Textual Web Supervision

Add code
Jul 29, 2020
Figure 1 for Learning Video Representations from Textual Web Supervision
Figure 2 for Learning Video Representations from Textual Web Supervision
Figure 3 for Learning Video Representations from Textual Web Supervision
Figure 4 for Learning Video Representations from Textual Web Supervision
Viaarxiv icon

Compositional Temporal Visual Grounding of Natural Language Event Descriptions

Add code
Dec 04, 2019
Figure 1 for Compositional Temporal Visual Grounding of Natural Language Event Descriptions
Figure 2 for Compositional Temporal Visual Grounding of Natural Language Event Descriptions
Figure 3 for Compositional Temporal Visual Grounding of Natural Language Event Descriptions
Figure 4 for Compositional Temporal Visual Grounding of Natural Language Event Descriptions
Viaarxiv icon

D3D: Distilled 3D Networks for Video Action Recognition

Add code
Dec 19, 2018
Figure 1 for D3D: Distilled 3D Networks for Video Action Recognition
Figure 2 for D3D: Distilled 3D Networks for Video Action Recognition
Figure 3 for D3D: Distilled 3D Networks for Video Action Recognition
Figure 4 for D3D: Distilled 3D Networks for Video Action Recognition
Viaarxiv icon