Picture for Mathieu Salzmann

Mathieu Salzmann

CVLab EPFL Switzerland

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

Add code
Dec 15, 2024
Viaarxiv icon

Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes

Add code
Dec 07, 2024
Viaarxiv icon

Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds

Add code
Dec 02, 2024
Viaarxiv icon

Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

Add code
Nov 06, 2024
Figure 1 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Figure 2 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Figure 3 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Figure 4 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Viaarxiv icon

Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis

Add code
Oct 31, 2024
Figure 1 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Figure 2 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Figure 3 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Figure 4 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Viaarxiv icon

Unlocking Comics: The AI4VA Dataset for Visual Understanding

Add code
Oct 27, 2024
Figure 1 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Figure 2 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Figure 3 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Figure 4 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Viaarxiv icon

QT-DoG: Quantization-aware Training for Domain Generalization

Add code
Oct 08, 2024
Viaarxiv icon

Data Augmentation via Latent Diffusion for Saliency Prediction

Add code
Sep 11, 2024
Figure 1 for Data Augmentation via Latent Diffusion for Saliency Prediction
Figure 2 for Data Augmentation via Latent Diffusion for Saliency Prediction
Figure 3 for Data Augmentation via Latent Diffusion for Saliency Prediction
Figure 4 for Data Augmentation via Latent Diffusion for Saliency Prediction
Viaarxiv icon

Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule

Add code
Sep 08, 2024
Viaarxiv icon

Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models

Add code
Aug 06, 2024
Viaarxiv icon