Picture for Sven Behnke

Sven Behnke

University of Bonn

VideoPCDNet: Video Parsing and Prediction with Phase Correlation Networks

Add code
Jun 24, 2025
Viaarxiv icon

OC-SOP: Enhancing Vision-Based 3D Semantic Occupancy Prediction by Object-Centric Awareness

Add code
Jun 23, 2025
Viaarxiv icon

SWA-SOP: Spatially-aware Window Attention for Semantic Occupancy Prediction in Autonomous Driving

Add code
Jun 23, 2025
Viaarxiv icon

Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning

Add code
May 04, 2025
Viaarxiv icon

Feature-Preserving Mesh Decimation for Normal Integration

Add code
Apr 01, 2025
Viaarxiv icon

Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and Tracking

Add code
Mar 18, 2025
Viaarxiv icon

LIAM: Multimodal Transformer for Language Instructions, Images, Actions and Semantic Maps

Add code
Mar 15, 2025
Viaarxiv icon

Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models

Add code
Mar 12, 2025
Viaarxiv icon

Object-Centric Image to Video Generation with Language Guidance

Add code
Feb 17, 2025
Viaarxiv icon

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning

Add code
Feb 11, 2025
Viaarxiv icon