Picture for Zifan Zheng

Zifan Zheng

GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism

Add code
Jan 14, 2025
Viaarxiv icon

TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles

Add code
Oct 07, 2024
Viaarxiv icon

Attention Heads of Large Language Models: A Survey

Add code
Sep 05, 2024
Viaarxiv icon

Internal Consistency and Self-Feedback in Large Language Models: A Survey

Add code
Jul 19, 2024
Viaarxiv icon

xFinder: Robust and Pinpoint Answer Extraction for Large Language Models

Add code
May 23, 2024
Viaarxiv icon