Picture for Zsolt Kira

Zsolt Kira

Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding

Add code
Jan 28, 2025
Viaarxiv icon

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Add code
Dec 11, 2024
Viaarxiv icon

Grounding Descriptions in Images informs Zero-Shot Visual Recognition

Add code
Dec 05, 2024
Figure 1 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 2 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 3 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 4 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Viaarxiv icon

Adversarial Attacks Using Differentiable Rendering: A Survey

Add code
Nov 14, 2024
Viaarxiv icon

Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models

Add code
Nov 03, 2024
Figure 1 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 2 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 3 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 4 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Viaarxiv icon

Neural Fields in Robotics: A Survey

Add code
Oct 26, 2024
Viaarxiv icon

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

Add code
Oct 03, 2024
Viaarxiv icon

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

Add code
Jul 09, 2024
Figure 1 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 2 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 3 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 4 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Viaarxiv icon

Reinforcement Learning via Auxiliary Task Distillation

Add code
Jun 24, 2024
Viaarxiv icon

Grounding Multimodal Large Language Models in Actions

Add code
Jun 12, 2024
Figure 1 for Grounding Multimodal Large Language Models in Actions
Figure 2 for Grounding Multimodal Large Language Models in Actions
Figure 3 for Grounding Multimodal Large Language Models in Actions
Figure 4 for Grounding Multimodal Large Language Models in Actions
Viaarxiv icon