Picture for Junkai Chen

Junkai Chen

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models

Add code
Aug 18, 2024
Viaarxiv icon

NLPerturbator: Studying the Robustness of Code LLMs to Natural Language Variations

Add code
Jun 28, 2024
Viaarxiv icon

Evaluating Large Language Models with Runtime Behavior of Program Execution

Add code
Mar 25, 2024
Viaarxiv icon