Picture for Jinming Nian

Jinming Nian

Evaluating Social Biases in LLM Reasoning

Add code
Feb 21, 2025
Viaarxiv icon

RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions

Add code
Oct 18, 2024
Figure 1 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Figure 2 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Figure 3 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Figure 4 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Viaarxiv icon

W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering

Add code
Aug 15, 2024
Viaarxiv icon