Picture for Tim K. Marks

Tim K. Marks

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

Apr 25, 2024
Viaarxiv icon

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Sep 30, 2023
Figure 1 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 2 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 3 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 4 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Viaarxiv icon

H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions

Add code
Oct 22, 2022
Figure 1 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Figure 2 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Figure 3 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Figure 4 for H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions
Viaarxiv icon

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering

Feb 18, 2022
Figure 1 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 2 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 3 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Figure 4 for (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
Viaarxiv icon

MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation

Nov 01, 2021
Figure 1 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Figure 2 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Figure 3 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Figure 4 for MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation
Viaarxiv icon

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

Add code
Oct 13, 2021
Figure 1 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 2 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 3 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Figure 4 for Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning
Viaarxiv icon

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

Aug 31, 2021
Figure 1 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Figure 2 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Figure 3 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Figure 4 for InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images
Viaarxiv icon

LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood

Add code
Apr 06, 2020
Figure 1 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Figure 2 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Figure 3 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Figure 4 for LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood
Viaarxiv icon

Spatio-Temporal Ranked-Attention Networks for Video Captioning

Jan 17, 2020
Figure 1 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Figure 2 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Figure 3 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Figure 4 for Spatio-Temporal Ranked-Attention Networks for Video Captioning
Viaarxiv icon

The Eighth Dialog System Technology Challenge

Add code
Nov 14, 2019
Figure 1 for The Eighth Dialog System Technology Challenge
Figure 2 for The Eighth Dialog System Technology Challenge
Figure 3 for The Eighth Dialog System Technology Challenge
Figure 4 for The Eighth Dialog System Technology Challenge
Viaarxiv icon