Picture for Mingyu Derek Ma

Mingyu Derek Ma

Are Large-Language Models Graph Algorithmic Reasoners?

Add code
Oct 29, 2024
Viaarxiv icon

GIVE: Structured Reasoning with Knowledge Graph Inspired Veracity Extrapolation

Add code
Oct 11, 2024
Viaarxiv icon

CLIMB: A Benchmark of Clinical Bias in Large Language Models

Add code
Jul 07, 2024
Viaarxiv icon

MIRAI: Evaluating LLM Agents for Event Forecasting

Add code
Jul 01, 2024
Viaarxiv icon

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

Add code
Jun 14, 2024
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

Improving Event Definition Following For Zero-Shot Event Detection

Add code
Mar 05, 2024
Viaarxiv icon

Instructional Fingerprinting of Large Language Models

Add code
Jan 21, 2024
Viaarxiv icon

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media

Add code
Nov 16, 2023
Viaarxiv icon

Mitigating Bias for Question Answering Models by Tracking Bias Influence

Add code
Oct 13, 2023
Viaarxiv icon