Picture for Michael Felsberg

Michael Felsberg

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

Affine steerers for structured keypoint description

Add code
Aug 26, 2024
Viaarxiv icon

Prior Learning in Introspective VAEs

Add code
Aug 25, 2024
Viaarxiv icon

Sim-to-real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning

Add code
Jun 07, 2024
Viaarxiv icon

NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving

Add code
Apr 12, 2024
Viaarxiv icon

Composed Video Retrieval via Enriched Context and Discriminative Embeddings

Add code
Mar 25, 2024
Viaarxiv icon

DiffSF: Diffusion Models for Scene Flow Estimation

Add code
Mar 14, 2024
Figure 1 for DiffSF: Diffusion Models for Scene Flow Estimation
Figure 2 for DiffSF: Diffusion Models for Scene Flow Estimation
Figure 3 for DiffSF: Diffusion Models for Scene Flow Estimation
Figure 4 for DiffSF: Diffusion Models for Scene Flow Estimation
Viaarxiv icon

PALO: A Polyglot Large Multimodal Model for 5B People

Add code
Mar 05, 2024
Figure 1 for PALO: A Polyglot Large Multimodal Model for 5B People
Figure 2 for PALO: A Polyglot Large Multimodal Model for 5B People
Figure 3 for PALO: A Polyglot Large Multimodal Model for 5B People
Figure 4 for PALO: A Polyglot Large Multimodal Model for 5B People
Viaarxiv icon

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Add code
Feb 26, 2024
Viaarxiv icon

SeTformer is What You Need for Vision and Language

Add code
Jan 07, 2024
Figure 1 for SeTformer is What You Need for Vision and Language
Figure 2 for SeTformer is What You Need for Vision and Language
Figure 3 for SeTformer is What You Need for Vision and Language
Figure 4 for SeTformer is What You Need for Vision and Language
Viaarxiv icon