Picture for Boyi Deng

Boyi Deng

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs

Add code
Nov 14, 2024
Figure 1 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 2 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 3 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Figure 4 for P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
Viaarxiv icon

CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG

Add code
Jun 17, 2024
Viaarxiv icon

Attack Prompt Generation for Red Teaming and Defending Large Language Models

Add code
Oct 19, 2023
Viaarxiv icon