Picture for Victor Escorcia

Victor Escorcia

Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning

Add code
Dec 09, 2024
Viaarxiv icon

SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition

Add code
Apr 10, 2022
Figure 1 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Figure 2 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Figure 3 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Figure 4 for SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition
Viaarxiv icon

OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context

Add code
Feb 14, 2022
Figure 1 for OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context
Figure 2 for OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context
Figure 3 for OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context
Figure 4 for OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context
Viaarxiv icon

vCLIMB: A Novel Video Class Incremental Learning Benchmark

Add code
Jan 23, 2022
Figure 1 for vCLIMB: A Novel Video Class Incremental Learning Benchmark
Figure 2 for vCLIMB: A Novel Video Class Incremental Learning Benchmark
Figure 3 for vCLIMB: A Novel Video Class Incremental Learning Benchmark
Figure 4 for vCLIMB: A Novel Video Class Incremental Learning Benchmark
Viaarxiv icon

TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification

Add code
Jun 21, 2021
Figure 1 for TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification
Figure 2 for TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification
Figure 3 for TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification
Figure 4 for TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification
Viaarxiv icon

Boundary-sensitive Pre-training for Temporal Localization in Videos

Add code
Nov 24, 2020
Figure 1 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 2 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 3 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Figure 4 for Boundary-sensitive Pre-training for Temporal Localization in Videos
Viaarxiv icon

Egocentric Action Recognition by Video Attention and Temporal Context

Add code
Jul 03, 2020
Figure 1 for Egocentric Action Recognition by Video Attention and Temporal Context
Figure 2 for Egocentric Action Recognition by Video Attention and Temporal Context
Figure 3 for Egocentric Action Recognition by Video Attention and Temporal Context
Figure 4 for Egocentric Action Recognition by Video Attention and Temporal Context
Viaarxiv icon

Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention

Add code
Apr 02, 2020
Figure 1 for Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Figure 2 for Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Figure 3 for Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Figure 4 for Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Viaarxiv icon

Temporal Localization of Moments in Video Collections with Natural Language

Add code
Jul 30, 2019
Figure 1 for Temporal Localization of Moments in Video Collections with Natural Language
Figure 2 for Temporal Localization of Moments in Video Collections with Natural Language
Figure 3 for Temporal Localization of Moments in Video Collections with Natural Language
Figure 4 for Temporal Localization of Moments in Video Collections with Natural Language
Viaarxiv icon

The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary

Add code
Aug 23, 2018
Figure 1 for The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Figure 2 for The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Figure 3 for The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Figure 4 for The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Viaarxiv icon