Picture for Mingyu Derek Ma

Mingyu Derek Ma

MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design

Add code
Dec 20, 2024
Figure 1 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 2 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 3 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 4 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Viaarxiv icon

Are Large-Language Models Graph Algorithmic Reasoners?

Add code
Oct 29, 2024
Viaarxiv icon

GIVE: Structured Reasoning with Knowledge Graph Inspired Veracity Extrapolation

Add code
Oct 11, 2024
Viaarxiv icon

CLIMB: A Benchmark of Clinical Bias in Large Language Models

Add code
Jul 07, 2024
Figure 1 for CLIMB: A Benchmark of Clinical Bias in Large Language Models
Figure 2 for CLIMB: A Benchmark of Clinical Bias in Large Language Models
Figure 3 for CLIMB: A Benchmark of Clinical Bias in Large Language Models
Figure 4 for CLIMB: A Benchmark of Clinical Bias in Large Language Models
Viaarxiv icon

MIRAI: Evaluating LLM Agents for Event Forecasting

Add code
Jul 01, 2024
Viaarxiv icon

CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescriptions

Add code
Jun 14, 2024
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

Improving Event Definition Following For Zero-Shot Event Detection

Add code
Mar 05, 2024
Viaarxiv icon

Instructional Fingerprinting of Large Language Models

Add code
Jan 21, 2024
Viaarxiv icon

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media

Add code
Nov 16, 2023
Viaarxiv icon