Picture for Yujie Lu

Yujie Lu

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Add code
Jun 16, 2024
Viaarxiv icon

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Add code
Jun 12, 2024
Viaarxiv icon

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

Add code
Jun 11, 2024
Viaarxiv icon

From Text to Pixel: Advancing Long-Context Understanding in MLLMs

Add code
May 23, 2024
Viaarxiv icon

Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)

Add code
Apr 05, 2024
Viaarxiv icon

Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes

Add code
Mar 03, 2024
Viaarxiv icon

VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following

Add code
Nov 29, 2023
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Viaarxiv icon

ImagenHub: Standardizing the evaluation of conditional image generation models

Add code
Oct 17, 2023
Viaarxiv icon

Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting

Add code
Oct 11, 2023
Viaarxiv icon