Picture for Guhao Feng

Guhao Feng

Theoretical Benefit and Limitation of Diffusion Language Model

Add code
Feb 13, 2025
Viaarxiv icon

How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs

Add code
Oct 17, 2024
Figure 1 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 2 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 3 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 4 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Viaarxiv icon

DPO Meets PPO: Reinforced Token Optimization for RLHF

Add code
Apr 29, 2024
Figure 1 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 2 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 3 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 4 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Viaarxiv icon

Do Efficient Transformers Really Save Computation?

Add code
Feb 21, 2024
Viaarxiv icon

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Add code
Jan 29, 2024
Figure 1 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 2 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 3 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 4 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Viaarxiv icon

Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity

Add code
Dec 28, 2023
Viaarxiv icon

Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective

Add code
May 24, 2023
Figure 1 for Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective
Figure 2 for Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective
Figure 3 for Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective
Viaarxiv icon

A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests

Add code
Feb 14, 2023
Figure 1 for A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests
Figure 2 for A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests
Figure 3 for A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests
Figure 4 for A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests
Viaarxiv icon