Picture for Tommaso Galliena

Tommaso Galliena

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

Add code
Apr 11, 2025
Viaarxiv icon