Picture for Jing Ma

Jing Ma

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

Add code
Nov 28, 2024
Viaarxiv icon

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

Invariant Shape Representation Learning For Image Classification

Add code
Nov 19, 2024
Viaarxiv icon

Multimodal Clinical Reasoning through Knowledge-augmented Rationale Generation

Add code
Nov 12, 2024
Viaarxiv icon

From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents

Add code
Nov 12, 2024
Figure 1 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 2 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 3 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 4 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Viaarxiv icon

Towards Low-Resource Harmful Meme Detection with LMM Agents

Add code
Nov 08, 2024
Figure 1 for Towards Low-Resource Harmful Meme Detection with LMM Agents
Figure 2 for Towards Low-Resource Harmful Meme Detection with LMM Agents
Figure 3 for Towards Low-Resource Harmful Meme Detection with LMM Agents
Figure 4 for Towards Low-Resource Harmful Meme Detection with LMM Agents
Viaarxiv icon

Global Graph Counterfactual Explanation: A Subgraph Mapping Approach

Add code
Oct 25, 2024
Viaarxiv icon

AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation

Add code
Oct 01, 2024
Viaarxiv icon

A Survey of Out-of-distribution Generalization for Graph Machine Learning from a Causal View

Add code
Sep 15, 2024
Viaarxiv icon

Causal Inference with Large Language Model: A Survey

Add code
Sep 15, 2024
Viaarxiv icon