Picture for Richard P. Wildes

Richard P. Wildes

Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models

Add code
Apr 10, 2024
Viaarxiv icon

Selective, Interpretable, and Motion Consistent Privacy Attribute Obfuscation for Action Recognition

Add code
Mar 19, 2024
Viaarxiv icon

Understanding Video Transformers for Segmentation: A Survey of Application and Interpretability

Add code
Oct 18, 2023
Viaarxiv icon

StepFormer: Self-supervised Step Discovery and Localization in Instructional Videos

Add code
Apr 26, 2023
Viaarxiv icon

MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Add code
Apr 12, 2023
Figure 1 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation
Figure 2 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation
Figure 3 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation
Figure 4 for MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation
Viaarxiv icon

Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks

Add code
Nov 03, 2022
Viaarxiv icon

Sports Video Analysis on Large-Scale Data

Add code
Aug 09, 2022
Figure 1 for Sports Video Analysis on Large-Scale Data
Figure 2 for Sports Video Analysis on Large-Scale Data
Figure 3 for Sports Video Analysis on Large-Scale Data
Figure 4 for Sports Video Analysis on Large-Scale Data
Viaarxiv icon

Is Appearance Free Action Recognition Possible?

Add code
Jul 13, 2022
Figure 1 for Is Appearance Free Action Recognition Possible?
Figure 2 for Is Appearance Free Action Recognition Possible?
Figure 3 for Is Appearance Free Action Recognition Possible?
Figure 4 for Is Appearance Free Action Recognition Possible?
Viaarxiv icon

A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information

Add code
Jun 06, 2022
Figure 1 for A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
Figure 2 for A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
Figure 3 for A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
Figure 4 for A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
Viaarxiv icon

P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision

Add code
May 04, 2022
Figure 1 for P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision
Figure 2 for P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision
Figure 3 for P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision
Figure 4 for P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision
Viaarxiv icon