Picture for Shichao Sun

Shichao Sun

RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

Add code
Aug 15, 2024
Viaarxiv icon

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Add code
Aug 13, 2024
Figure 1 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 2 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 3 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 4 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Viaarxiv icon

FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models

Add code
Jul 01, 2024
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon

Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization

Add code
Jun 01, 2024
Viaarxiv icon

Dissecting Human and LLM Preferences

Add code
Feb 17, 2024
Viaarxiv icon

The Critique of Critique

Add code
Jan 09, 2024
Figure 1 for The Critique of Critique
Figure 2 for The Critique of Critique
Figure 3 for The Critique of Critique
Figure 4 for The Critique of Critique
Viaarxiv icon

Evolving Large Language Model Assistant with Long-Term Conditional Memory

Add code
Dec 22, 2023
Figure 1 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Figure 2 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Figure 3 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Figure 4 for Evolving Large Language Model Assistant with Long-Term Conditional Memory
Viaarxiv icon

Generative Judge for Evaluating Alignment

Add code
Oct 09, 2023
Viaarxiv icon

Aligning Language Models with Human Preferences via a Bayesian Approach

Add code
Oct 09, 2023
Viaarxiv icon