Picture for Wenfang Wu

Wenfang Wu

MM-BigBench: Evaluating Multimodal Models on Multimodal Content Comprehension Tasks

Add code
Oct 13, 2023
Viaarxiv icon

T-COL: Generating Counterfactual Explanations for General User Preferences on Variable Machine Learning Systems

Add code
Sep 28, 2023
Viaarxiv icon

RoCar: A Relationship Network-based Evaluation Method to Large Language Models

Add code
Jul 29, 2023
Viaarxiv icon