Picture for Kiana Ehsani

Kiana Ehsani

SAT: Spatial Aptitude Training for Multimodal Language Models

Add code
Dec 10, 2024
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Add code
Jun 28, 2024
Viaarxiv icon

Manipulate-Anything: Automating Real-World Robots using Vision-Language Models

Add code
Jun 27, 2024
Viaarxiv icon

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Add code
Dec 14, 2023
Figure 1 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 2 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 3 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 4 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Viaarxiv icon

Harmonic Mobile Manipulation

Add code
Dec 11, 2023
Viaarxiv icon

Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World

Add code
Dec 05, 2023
Viaarxiv icon

Objaverse-XL: A Universe of 10M+ 3D Objects

Add code
Jul 11, 2023
Viaarxiv icon

Objaverse: A Universe of Annotated 3D Objects

Add code
Dec 15, 2022
Viaarxiv icon

Phone2Proc: Bringing Robust Robots Into Our Chaotic World

Add code
Dec 08, 2022
Viaarxiv icon