Picture for Bohao Li

Bohao Li

Video-R1: Reinforcing Video Reasoning in MLLMs

Add code
Mar 27, 2025
Viaarxiv icon

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Add code
Dec 03, 2024
Figure 1 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Figure 2 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Figure 3 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Figure 4 for AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
Viaarxiv icon

VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI

Add code
Oct 15, 2024
Viaarxiv icon

SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension

Add code
Apr 25, 2024
Viaarxiv icon

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models

Add code
Dec 11, 2023
Viaarxiv icon

SEED-Bench-2: Benchmarking Multimodal Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension

Add code
Aug 02, 2023
Viaarxiv icon

Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Add code
Mar 03, 2023
Figure 1 for Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Figure 2 for Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Figure 3 for Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Figure 4 for Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Viaarxiv icon

Proposal Distribution Calibration for Few-Shot Object Detection

Add code
Dec 15, 2022
Viaarxiv icon

Collaboration of Pre-trained Models Makes Better Few-shot Learner

Add code
Sep 25, 2022
Figure 1 for Collaboration of Pre-trained Models Makes Better Few-shot Learner
Figure 2 for Collaboration of Pre-trained Models Makes Better Few-shot Learner
Figure 3 for Collaboration of Pre-trained Models Makes Better Few-shot Learner
Figure 4 for Collaboration of Pre-trained Models Makes Better Few-shot Learner
Viaarxiv icon