Picture for Zsolt Kira

Zsolt Kira

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Add code
Dec 11, 2024
Viaarxiv icon

Grounding Descriptions in Images informs Zero-Shot Visual Recognition

Add code
Dec 05, 2024
Figure 1 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 2 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 3 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Figure 4 for Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Viaarxiv icon

Adversarial Attacks Using Differentiable Rendering: A Survey

Add code
Nov 14, 2024
Viaarxiv icon

Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models

Add code
Nov 03, 2024
Figure 1 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 2 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 3 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Figure 4 for Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models
Viaarxiv icon

Neural Fields in Robotics: A Survey

Add code
Oct 26, 2024
Viaarxiv icon

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

Add code
Oct 03, 2024
Viaarxiv icon

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

Add code
Jul 09, 2024
Figure 1 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 2 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 3 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 4 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Viaarxiv icon

Reinforcement Learning via Auxiliary Task Distillation

Add code
Jun 24, 2024
Viaarxiv icon

ICE-G: Image Conditional Editing of 3D Gaussian Splats

Add code
Jun 12, 2024
Viaarxiv icon

Grounding Multimodal Large Language Models in Actions

Add code
Jun 12, 2024
Figure 1 for Grounding Multimodal Large Language Models in Actions
Figure 2 for Grounding Multimodal Large Language Models in Actions
Figure 3 for Grounding Multimodal Large Language Models in Actions
Figure 4 for Grounding Multimodal Large Language Models in Actions
Viaarxiv icon