Picture for Zhenfei Yin

Zhenfei Yin

WorldSimBench: Towards Video Generation Models as World Simulators

Add code
Oct 23, 2024
Figure 1 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 2 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 3 for WorldSimBench: Towards Video Generation Models as World Simulators
Figure 4 for WorldSimBench: Towards Video Generation Models as World Simulators
Viaarxiv icon

Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation

Add code
Oct 12, 2024
Figure 1 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Figure 2 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Figure 3 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Figure 4 for Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation
Viaarxiv icon

GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing

Add code
Jun 30, 2024
Viaarxiv icon

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

Add code
Jun 17, 2024
Viaarxiv icon

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents

Add code
Mar 28, 2024
Viaarxiv icon

Assessment of Multimodal Large Language Models in Alignment with Human Values

Add code
Mar 26, 2024
Viaarxiv icon

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Add code
Mar 19, 2024
Viaarxiv icon

Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models

Add code
Feb 29, 2024
Viaarxiv icon

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Add code
Jan 29, 2024
Figure 1 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 2 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 3 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 4 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Viaarxiv icon

Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models

Add code
Dec 14, 2023
Figure 1 for Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
Figure 2 for Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
Figure 3 for Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
Figure 4 for Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
Viaarxiv icon