Picture for Zhelun Shi

Zhelun Shi

WorldSimBench: Towards Video Generation Models as World Simulators

Add code
Oct 23, 2024
Viaarxiv icon

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents

Add code
Mar 28, 2024
Viaarxiv icon

Assessment of Multimodal Large Language Models in Alignment with Human Values

Add code
Mar 26, 2024
Viaarxiv icon

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Add code
Jan 29, 2024
Figure 1 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 2 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 3 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 4 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Viaarxiv icon

ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models

Add code
Nov 05, 2023
Viaarxiv icon

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

Add code
Jun 18, 2023
Viaarxiv icon