Picture for Alexander Schwing

Alexander Schwing

ETH Zurich

On Inductive Biases That Enable Generalization of Diffusion Transformers

Add code
Oct 28, 2024
Viaarxiv icon

Pixel-Aligned Multi-View Generation with Depth Guided Decoder

Add code
Aug 26, 2024
Viaarxiv icon

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows

Add code
Jun 15, 2024
Viaarxiv icon

Virtual Pets: Animatable Animal Generation in 3D Scenes

Add code
Dec 21, 2023
Viaarxiv icon

Putting the Object Back into Video Object Segmentation

Add code
Oct 19, 2023
Viaarxiv icon

Tracking Anything with Decoupled Video Segmentation

Add code
Sep 07, 2023
Viaarxiv icon

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

Add code
Dec 08, 2022
Viaarxiv icon

On the Importance of Gradient Norm in PAC-Bayesian Bounds

Add code
Oct 12, 2022
Figure 1 for On the Importance of Gradient Norm in PAC-Bayesian Bounds
Figure 2 for On the Importance of Gradient Norm in PAC-Bayesian Bounds
Figure 3 for On the Importance of Gradient Norm in PAC-Bayesian Bounds
Viaarxiv icon

Joint Forecasting of Panoptic Segmentations with Difference Attention

Add code
Apr 14, 2022
Figure 1 for Joint Forecasting of Panoptic Segmentations with Difference Attention
Figure 2 for Joint Forecasting of Panoptic Segmentations with Difference Attention
Figure 3 for Joint Forecasting of Panoptic Segmentations with Difference Attention
Figure 4 for Joint Forecasting of Panoptic Segmentations with Difference Attention
Viaarxiv icon

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

Add code
Dec 20, 2021
Figure 1 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 2 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 3 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Figure 4 for MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Viaarxiv icon