Picture for Haitao Mi

Haitao Mi

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Add code
Nov 26, 2024
Viaarxiv icon

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Add code
Oct 09, 2024
Figure 1 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 2 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 3 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 4 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Viaarxiv icon

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search

Add code
Oct 04, 2024
Figure 1 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Figure 2 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Figure 3 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Figure 4 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Viaarxiv icon

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Add code
Sep 25, 2024
Figure 1 for HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
Figure 2 for HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
Figure 3 for HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
Figure 4 for HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
Viaarxiv icon

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

Add code
Aug 28, 2024
Figure 1 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Figure 2 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Figure 3 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Figure 4 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Viaarxiv icon

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Add code
Jun 30, 2024
Viaarxiv icon

LiteSearch: Efficacious Tree Search for LLM

Add code
Jun 29, 2024
Viaarxiv icon

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Add code
Jun 28, 2024
Viaarxiv icon

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

Add code
Jun 11, 2024
Viaarxiv icon

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Add code
Apr 18, 2024
Figure 1 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 2 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 3 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 4 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Viaarxiv icon