Picture for Yuandong Tian

Yuandong Tian

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Add code
Feb 18, 2025
Viaarxiv icon

Spectral Journey: How Transformers Predict the Shortest Path

Add code
Feb 12, 2025
Viaarxiv icon

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Viaarxiv icon

SHARP: Accelerating Language Model Inference by SHaring Adjacent layers with Recovery Parameters

Add code
Feb 11, 2025
Viaarxiv icon

GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?

Add code
Feb 07, 2025
Viaarxiv icon

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Add code
Feb 05, 2025
Viaarxiv icon

ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

Add code
Feb 04, 2025
Viaarxiv icon

Towards General-Purpose Model-Free Reinforcement Learning

Add code
Jan 27, 2025
Figure 1 for Towards General-Purpose Model-Free Reinforcement Learning
Figure 2 for Towards General-Purpose Model-Free Reinforcement Learning
Figure 3 for Towards General-Purpose Model-Free Reinforcement Learning
Figure 4 for Towards General-Purpose Model-Free Reinforcement Learning
Viaarxiv icon

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Add code
Jan 18, 2025
Viaarxiv icon

Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition

Add code
Jan 04, 2025
Figure 1 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Figure 2 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Figure 3 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Figure 4 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Viaarxiv icon