Picture for Hao Bai

Hao Bai

InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

Add code
Jan 20, 2026
Viaarxiv icon

WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Add code
Jan 07, 2026
Viaarxiv icon

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Add code
Oct 14, 2025
Viaarxiv icon

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

Add code
Jun 09, 2025
Figure 1 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 2 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 3 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 4 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Viaarxiv icon

Improving Neuron-level Interpretability with White-box Language Models

Add code
Oct 21, 2024
Viaarxiv icon

NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations

Add code
Jul 18, 2024
Figure 1 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Figure 2 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Figure 3 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Figure 4 for NODER: Image Sequence Regression Based on Neural Ordinary Differential Equations
Viaarxiv icon

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Add code
Jun 14, 2024
Viaarxiv icon

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Figure 1 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 2 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 3 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 4 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Viaarxiv icon

White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

Add code
Nov 24, 2023
Figure 1 for White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Figure 2 for White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Figure 3 for White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Figure 4 for White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Viaarxiv icon

Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations

Add code
Oct 22, 2023
Figure 1 for Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
Figure 2 for Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
Figure 3 for Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
Figure 4 for Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
Viaarxiv icon