Picture for Minda Hu

Minda Hu

Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration

Add code
Feb 03, 2026
Viaarxiv icon

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Add code
Feb 02, 2026
Viaarxiv icon

ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning

Add code
Jan 08, 2026
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback

Add code
May 26, 2025
Viaarxiv icon

A Survey of Personalized Large Language Models: Progress and Future Directions

Add code
Feb 17, 2025
Viaarxiv icon

NILE: Internal Consistency Alignment in Large Language Models

Add code
Dec 21, 2024
Viaarxiv icon

Purple-teaming LLMs with Adversarial Defender Training

Add code
Jul 01, 2024
Viaarxiv icon

Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization

Add code
Jun 17, 2024
Figure 1 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Figure 2 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Figure 3 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Figure 4 for Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization
Viaarxiv icon

Mitigating Large Language Model Hallucination with Faithful Finetuning

Add code
Jun 17, 2024
Figure 1 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 2 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 3 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Figure 4 for Mitigating Large Language Model Hallucination with Faithful Finetuning
Viaarxiv icon