Picture for Ruslan Salakhutdinov

Ruslan Salakhutdinov

Shammie

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Viaarxiv icon

Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion

Add code
Nov 30, 2024
Viaarxiv icon

Local Policies Enable Zero-shot Long-horizon Manipulation

Add code
Oct 29, 2024
Viaarxiv icon

Embodied-RAG: General non-parametric Embodied Memory for Retrieval and Generation

Add code
Sep 26, 2024
Viaarxiv icon

Neural MP: A Generalist Neural Motion Planner

Add code
Sep 09, 2024
Viaarxiv icon

Situated Instruction Following

Add code
Jul 15, 2024
Figure 1 for Situated Instruction Following
Figure 2 for Situated Instruction Following
Figure 3 for Situated Instruction Following
Figure 4 for Situated Instruction Following
Viaarxiv icon

HEMM: Holistic Evaluation of Multimodal Foundation Models

Add code
Jul 03, 2024
Figure 1 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Figure 2 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Figure 3 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Figure 4 for HEMM: Holistic Evaluation of Multimodal Foundation Models
Viaarxiv icon

Tree Search for Language Model Agents

Add code
Jul 01, 2024
Figure 1 for Tree Search for Language Model Agents
Figure 2 for Tree Search for Language Model Agents
Figure 3 for Tree Search for Language Model Agents
Figure 4 for Tree Search for Language Model Agents
Viaarxiv icon

Adversarial Attacks on Multimodal Agents

Add code
Jun 18, 2024
Viaarxiv icon

Understanding Visual Concepts Across Models

Add code
Jun 11, 2024
Viaarxiv icon