Picture for Ziyang Luo

Ziyang Luo

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Viaarxiv icon

From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents

Add code
Nov 12, 2024
Viaarxiv icon

Towards Low-Resource Harmful Meme Detection with LMM Agents

Add code
Nov 08, 2024
Viaarxiv icon

AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation

Add code
Oct 01, 2024
Viaarxiv icon

CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?

Add code
Aug 20, 2024
Viaarxiv icon

MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models

Add code
Jun 17, 2024
Viaarxiv icon

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Add code
Jun 11, 2024
Viaarxiv icon

CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models

Add code
May 01, 2024
Viaarxiv icon

CodeHalu: Code Hallucinations in LLMs Driven by Execution-based Verification

Add code
Apr 30, 2024
Viaarxiv icon

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Add code
Apr 15, 2024
Viaarxiv icon