Picture for Omid Poursaeed

Omid Poursaeed

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Add code
Sep 26, 2024
Figure 1 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Figure 2 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Figure 3 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Figure 4 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Viaarxiv icon

Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

Add code
Apr 11, 2024
Figure 1 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Figure 2 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Figure 3 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Figure 4 for Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs
Viaarxiv icon

Universal Pyramid Adversarial Training for Improved ViT Performance

Add code
Dec 26, 2023
Viaarxiv icon

Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

Add code
Sep 20, 2023
Viaarxiv icon

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Add code
Jun 01, 2023
Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Viaarxiv icon

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

Add code
Dec 09, 2022
Viaarxiv icon

A Unified Model for Tracking and Image-Video Detection Has More Power

Add code
Nov 20, 2022
Figure 1 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 2 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 3 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 4 for A Unified Model for Tracking and Image-Video Detection Has More Power
Viaarxiv icon

Robustness and Generalization via Generative Adversarial Training

Add code
Sep 06, 2021
Figure 1 for Robustness and Generalization via Generative Adversarial Training
Figure 2 for Robustness and Generalization via Generative Adversarial Training
Figure 3 for Robustness and Generalization via Generative Adversarial Training
Figure 4 for Robustness and Generalization via Generative Adversarial Training
Viaarxiv icon

Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation

Add code
Nov 25, 2020
Figure 1 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Figure 2 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Figure 3 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Figure 4 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Viaarxiv icon

Self-supervised Learning of Point Clouds via Orientation Estimation

Add code
Aug 01, 2020
Figure 1 for Self-supervised Learning of Point Clouds via Orientation Estimation
Figure 2 for Self-supervised Learning of Point Clouds via Orientation Estimation
Figure 3 for Self-supervised Learning of Point Clouds via Orientation Estimation
Figure 4 for Self-supervised Learning of Point Clouds via Orientation Estimation
Viaarxiv icon