Picture for Guhao Feng

Guhao Feng

How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs

Add code
Oct 17, 2024
Figure 1 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 2 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 3 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 4 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Viaarxiv icon

DPO Meets PPO: Reinforced Token Optimization for RLHF

Add code
Apr 29, 2024
Figure 1 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 2 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 3 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 4 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Viaarxiv icon

Do Efficient Transformers Really Save Computation?

Add code
Feb 21, 2024
Viaarxiv icon

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Add code
Jan 29, 2024
Viaarxiv icon

Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

Add code
Dec 28, 2023
Viaarxiv icon

Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective

Add code
May 24, 2023
Viaarxiv icon

A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests

Add code
Feb 14, 2023
Viaarxiv icon