Picture for Spyros Gidaris

Spyros Gidaris

EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Add code
Feb 13, 2025
Viaarxiv icon

Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers

Add code
Jan 14, 2025
Viaarxiv icon

DINO-Foresight Looking into the Future with DINO

Add code
Dec 16, 2024
Viaarxiv icon

No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations

Add code
Jul 15, 2024
Figure 1 for No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
Figure 2 for No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
Figure 3 for No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
Figure 4 for No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
Viaarxiv icon

Valeo4Cast: A Modular Approach to End-to-End Forecasting

Add code
Jun 12, 2024
Figure 1 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Figure 2 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Figure 3 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Figure 4 for Valeo4Cast: A Modular Approach to End-to-End Forecasting
Viaarxiv icon

OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Add code
Apr 22, 2024
Figure 1 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Figure 2 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Figure 3 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Figure 4 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Viaarxiv icon

POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images

Add code
Jan 17, 2024
Figure 1 for POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Figure 2 for POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Figure 3 for POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Figure 4 for POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Viaarxiv icon

SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Add code
Dec 01, 2023
Viaarxiv icon

Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving

Add code
Oct 26, 2023
Figure 1 for Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving
Figure 2 for Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving
Figure 3 for Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving
Figure 4 for Revisiting the Distillation of Image Representations into Point Clouds for Autonomous Driving
Viaarxiv icon

Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey

Add code
Oct 19, 2023
Viaarxiv icon