Picture for Wayne Xin Zhao

Wayne Xin Zhao

Do we Really Need Visual Instructions? Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models

Add code
Feb 17, 2025
Viaarxiv icon

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation

Add code
Feb 11, 2025
Viaarxiv icon

Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking

Add code
Feb 07, 2025
Viaarxiv icon

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Add code
Jan 03, 2025
Viaarxiv icon

Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking

Add code
Jan 03, 2025
Viaarxiv icon

YuLan-Mini: An Open Data-efficient Language Model

Add code
Dec 24, 2024
Viaarxiv icon

RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Add code
Dec 17, 2024
Viaarxiv icon

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Add code
Dec 12, 2024
Figure 1 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 2 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 3 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 4 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Viaarxiv icon

On Domain-Specific Post-Training for Multimodal Large Language Models

Add code
Nov 29, 2024
Figure 1 for On Domain-Specific Post-Training for Multimodal Large Language Models
Figure 2 for On Domain-Specific Post-Training for Multimodal Large Language Models
Figure 3 for On Domain-Specific Post-Training for Multimodal Large Language Models
Figure 4 for On Domain-Specific Post-Training for Multimodal Large Language Models
Viaarxiv icon

Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

Add code
Nov 18, 2024
Viaarxiv icon