Picture for Qiuzhi Liu

Qiuzhi Liu

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Add code
May 29, 2025
Figure 1 for DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
Figure 2 for DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
Figure 3 for DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
Figure 4 for DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
Viaarxiv icon

Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training

Add code
May 20, 2025
Viaarxiv icon

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Add code
Apr 15, 2025
Figure 1 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Figure 2 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Figure 3 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Figure 4 for DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Viaarxiv icon

Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique

Add code
Mar 21, 2025
Figure 1 for Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique
Figure 2 for Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique
Figure 3 for Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique
Figure 4 for Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique
Viaarxiv icon

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models

Add code
Mar 04, 2025
Figure 1 for The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
Figure 2 for The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
Figure 3 for The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
Figure 4 for The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
Viaarxiv icon

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Add code
Jan 30, 2025
Figure 1 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Figure 2 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Figure 3 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Figure 4 for Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Viaarxiv icon

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Add code
Dec 30, 2024
Viaarxiv icon

Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs

Add code
Oct 10, 2024
Figure 1 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 2 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 3 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 4 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Viaarxiv icon

Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step

Add code
Oct 04, 2024
Figure 1 for Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
Figure 2 for Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
Figure 3 for Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
Figure 4 for Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
Viaarxiv icon

Simple and Scalable Nearest Neighbor Machine Translation

Add code
Feb 23, 2023
Figure 1 for Simple and Scalable Nearest Neighbor Machine Translation
Figure 2 for Simple and Scalable Nearest Neighbor Machine Translation
Figure 3 for Simple and Scalable Nearest Neighbor Machine Translation
Figure 4 for Simple and Scalable Nearest Neighbor Machine Translation
Viaarxiv icon