Picture for Kuo-Hao Zeng

Kuo-Hao Zeng

SAT: Spatial Aptitude Training for Multimodal Language Models

Add code
Dec 10, 2024
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning

Add code
Sep 25, 2024
Figure 1 for FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Figure 2 for FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Figure 3 for FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Figure 4 for FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Viaarxiv icon

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Add code
Jun 28, 2024
Viaarxiv icon

Seeing the Unseen: Visual Common Sense for Semantic Placement

Add code
Jan 15, 2024
Figure 1 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Figure 2 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Figure 3 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Figure 4 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Viaarxiv icon

Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World

Add code
Dec 05, 2023
Viaarxiv icon

Selective Visual Representations Improve Convergence and Generalization for Embodied AI

Add code
Nov 07, 2023
Viaarxiv icon

Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics

Add code
Apr 24, 2023
Viaarxiv icon

Pushing it out of the Way: Interactive Visual Navigation

Add code
May 02, 2021
Figure 1 for Pushing it out of the Way: Interactive Visual Navigation
Figure 2 for Pushing it out of the Way: Interactive Visual Navigation
Figure 3 for Pushing it out of the Way: Interactive Visual Navigation
Figure 4 for Pushing it out of the Way: Interactive Visual Navigation
Viaarxiv icon

AllenAct: A Framework for Embodied AI Research

Add code
Aug 28, 2020
Figure 1 for AllenAct: A Framework for Embodied AI Research
Figure 2 for AllenAct: A Framework for Embodied AI Research
Figure 3 for AllenAct: A Framework for Embodied AI Research
Figure 4 for AllenAct: A Framework for Embodied AI Research
Viaarxiv icon