Picture for Hiroki Furuta

Hiroki Furuta

Geometric-Averaged Preference Optimization for Soft Preference Labels

Add code
Sep 10, 2024
Viaarxiv icon

Interpreting Grokked Transformers in Complex Modular Arithmetic

Add code
Feb 27, 2024
Viaarxiv icon

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Add code
Feb 23, 2024
Viaarxiv icon

Language Model Agents Suffer from Compositional Generalization in Web Automation

Add code
Nov 30, 2023
Viaarxiv icon

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Add code
Oct 17, 2023
Figure 1 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 2 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 3 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 4 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Viaarxiv icon

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

Add code
Jul 24, 2023
Figure 1 for A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Figure 2 for A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Figure 3 for A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Figure 4 for A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Viaarxiv icon

Multimodal Web Navigation with Instruction-Finetuned Foundation Models

Add code
May 19, 2023
Viaarxiv icon

A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation

Add code
Nov 25, 2022
Viaarxiv icon

Generalized Decision Transformer for Offline Hindsight Information Matching

Add code
Nov 23, 2021
Figure 1 for Generalized Decision Transformer for Offline Hindsight Information Matching
Figure 2 for Generalized Decision Transformer for Offline Hindsight Information Matching
Figure 3 for Generalized Decision Transformer for Offline Hindsight Information Matching
Figure 4 for Generalized Decision Transformer for Offline Hindsight Information Matching
Viaarxiv icon

Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization

Add code
Oct 10, 2021
Figure 1 for Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
Figure 2 for Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
Figure 3 for Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
Figure 4 for Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
Viaarxiv icon