Picture for Zhelun Shi

Zhelun Shi

WorldSimBench: Towards Video Generation Models as World Simulators

Add code
Oct 23, 2024
Figure 1 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 2 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 3 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 4 for WorldSimBench: Towards Video Generation Models as World Simulators
Viaarxiv icon

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents

Add code
Mar 28, 2024
Figure 1 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Figure 2 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Figure 3 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Figure 4 for RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Viaarxiv icon

Assessment of Multimodal Large Language Models in Alignment with Human Values

Add code
Mar 26, 2024
Viaarxiv icon

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Add code
Jan 29, 2024
Figure 1 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 2 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 3 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 4 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Viaarxiv icon

ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models

Add code
Nov 05, 2023
Viaarxiv icon

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

Add code
Jun 18, 2023
Viaarxiv icon