Picture for Pengfei Liu

Pengfei Liu

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Add code
Jan 11, 2025
Viaarxiv icon

DIVE: Diversified Iterative Self-Improvement

Add code
Jan 01, 2025
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Viaarxiv icon

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Add code
Dec 23, 2024
Viaarxiv icon

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Add code
Nov 25, 2024
Viaarxiv icon

Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model

Add code
Oct 24, 2024
Viaarxiv icon

Revisiting Benchmark and Assessment: An Agent-based Exploratory Dynamic Evaluation Framework for LLMs

Add code
Oct 15, 2024
Figure 1 for Revisiting Benchmark and Assessment: An Agent-based Exploratory Dynamic Evaluation Framework for LLMs
Figure 2 for Revisiting Benchmark and Assessment: An Agent-based Exploratory Dynamic Evaluation Framework for LLMs
Figure 3 for Revisiting Benchmark and Assessment: An Agent-based Exploratory Dynamic Evaluation Framework for LLMs
Figure 4 for Revisiting Benchmark and Assessment: An Agent-based Exploratory Dynamic Evaluation Framework for LLMs
Viaarxiv icon

ECon: On the Detection and Resolution of Evidence Conflicts

Add code
Oct 05, 2024
Viaarxiv icon

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Add code
Sep 25, 2024
Figure 1 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 2 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 3 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Figure 4 for Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale
Viaarxiv icon

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

Add code
Aug 15, 2024
Viaarxiv icon