Picture for Zsolt Kira

Zsolt Kira

Grounding Multimodal LLMs to Embodied Agents that Ask for Help with Reinforcement Learning

Add code
Apr 02, 2025
Viaarxiv icon

When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach

Add code
Mar 21, 2025
Viaarxiv icon

Directional Gradient Projection for Robust Fine-Tuning of Foundation Models

Add code
Feb 21, 2025
Viaarxiv icon

Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding

Add code
Jan 28, 2025
Viaarxiv icon

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Add code
Dec 11, 2024
Viaarxiv icon

Grounding Descriptions in Images informs Zero-Shot Visual Recognition

Add code
Dec 05, 2024
Figure 1 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 2 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 3 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 4 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Viaarxiv icon

Adversarial Attacks Using Differentiable Rendering: A Survey

Add code
Nov 14, 2024
Viaarxiv icon

Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models

Add code
Nov 03, 2024
Figure 1 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 2 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 3 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 4 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Viaarxiv icon

Neural Fields in Robotics: A Survey

Add code
Oct 26, 2024
Viaarxiv icon

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

Add code
Oct 03, 2024
Viaarxiv icon