Picture for Linfeng Song

Linfeng Song

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Add code
Oct 09, 2024
Figure 1 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 2 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 3 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Figure 4 for Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Viaarxiv icon

Mitigating the Negative Impact of Over-association for Conversational Query Production

Add code
Sep 29, 2024
Viaarxiv icon

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

Add code
Aug 28, 2024
Figure 1 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Figure 2 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Figure 3 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Figure 4 for SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Viaarxiv icon

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Add code
Jun 30, 2024
Viaarxiv icon

LiteSearch: Efficacious Tree Search for LLM

Add code
Jun 29, 2024
Viaarxiv icon

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Add code
Apr 18, 2024
Figure 1 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 2 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 3 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 4 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Viaarxiv icon

Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models

Add code
Apr 14, 2024
Figure 1 for Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Figure 2 for Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Figure 3 for Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Figure 4 for Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Viaarxiv icon

Self-Consistency Boosts Calibration for Math Reasoning

Add code
Mar 14, 2024
Viaarxiv icon

A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation

Add code
Mar 06, 2024
Viaarxiv icon

Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

Add code
Mar 02, 2024
Figure 1 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Figure 2 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Figure 3 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Figure 4 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Viaarxiv icon