Picture for Mathieu Salzmann

Mathieu Salzmann

CVLab EPFL Switzerland

MotionMap: Representing Multimodality in Human Pose Forecasting

Add code
Dec 25, 2024
Viaarxiv icon

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

Add code
Dec 15, 2024
Figure 1 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 2 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 3 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 4 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Viaarxiv icon

Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes

Add code
Dec 07, 2024
Viaarxiv icon

Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds

Add code
Dec 02, 2024
Viaarxiv icon

Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

Add code
Nov 06, 2024
Figure 1 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Figure 2 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Figure 3 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Figure 4 for Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Viaarxiv icon

Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis

Add code
Oct 31, 2024
Figure 1 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Figure 2 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Figure 3 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Figure 4 for Self-Ensembling Gaussian Splatting for Few-shot Novel View Synthesis
Viaarxiv icon

Unlocking Comics: The AI4VA Dataset for Visual Understanding

Add code
Oct 27, 2024
Figure 1 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Figure 2 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Figure 3 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Figure 4 for Unlocking Comics: The AI4VA Dataset for Visual Understanding
Viaarxiv icon

QT-DoG: Quantization-aware Training for Domain Generalization

Add code
Oct 08, 2024
Viaarxiv icon

Data Augmentation via Latent Diffusion for Saliency Prediction

Add code
Sep 11, 2024
Figure 1 for Data Augmentation via Latent Diffusion for Saliency Prediction
Figure 2 for Data Augmentation via Latent Diffusion for Saliency Prediction
Figure 3 for Data Augmentation via Latent Diffusion for Saliency Prediction
Figure 4 for Data Augmentation via Latent Diffusion for Saliency Prediction
Viaarxiv icon

Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule

Add code
Sep 08, 2024
Viaarxiv icon