Picture for Shibo Hao

Shibo Hao

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Viaarxiv icon

Linear Correlation in LM's Compositional Generalization and Hallucination

Add code
Feb 06, 2025
Viaarxiv icon

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Add code
Dec 20, 2024
Viaarxiv icon

Training Large Language Models to Reason in a Continuous Latent Space

Add code
Dec 09, 2024
Figure 1 for Training Large Language Models to Reason in a Continuous Latent Space
Figure 2 for Training Large Language Models to Reason in a Continuous Latent Space
Figure 3 for Training Large Language Models to Reason in a Continuous Latent Space
Figure 4 for Training Large Language Models to Reason in a Continuous Latent Space
Viaarxiv icon

Pandora: Towards General World Model with Natural Language Actions and Video States

Add code
Jun 12, 2024
Viaarxiv icon

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

Add code
Jun 09, 2024
Figure 1 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Figure 2 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Figure 3 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Figure 4 for Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking
Viaarxiv icon

LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

Reasoning with Language Model is Planning with World Model

Add code
May 24, 2023
Viaarxiv icon

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

Add code
May 19, 2023
Viaarxiv icon

BertNet: Harvesting Knowledge Graphs from Pretrained Language Models

Add code
Jun 28, 2022
Figure 1 for BertNet: Harvesting Knowledge Graphs from Pretrained Language Models
Figure 2 for BertNet: Harvesting Knowledge Graphs from Pretrained Language Models
Figure 3 for BertNet: Harvesting Knowledge Graphs from Pretrained Language Models
Figure 4 for BertNet: Harvesting Knowledge Graphs from Pretrained Language Models
Viaarxiv icon