Picture for Fuwen Luo

Fuwen Luo

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Add code
Nov 06, 2024
Viaarxiv icon

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon

CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Model Composition for Multimodal Large Language Models

Add code
Feb 20, 2024
Viaarxiv icon

Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion

Add code
Feb 19, 2024
Viaarxiv icon

Towards Unified Alignment Between Agents, Humans, and Environment

Add code
Feb 14, 2024
Viaarxiv icon

Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models

Add code
Sep 14, 2023
Viaarxiv icon

Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf

Add code
Sep 09, 2023
Viaarxiv icon