Picture for Zhenbang Sun

Zhenbang Sun

Evaluating and Advancing Multimodal Large Language Models in Ability Lens

Add code
Nov 22, 2024
Viaarxiv icon

Frame-Voyager: Learning to Query Frames for Video Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization

Add code
Sep 22, 2024
Viaarxiv icon

Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment

Add code
Sep 08, 2023
Viaarxiv icon

CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning

Add code
Sep 05, 2023
Viaarxiv icon

HIRL: A General Framework for Hierarchical Image Representation Learning

Add code
May 26, 2022
Figure 1 for HIRL: A General Framework for Hierarchical Image Representation Learning
Figure 2 for HIRL: A General Framework for Hierarchical Image Representation Learning
Figure 3 for HIRL: A General Framework for Hierarchical Image Representation Learning
Figure 4 for HIRL: A General Framework for Hierarchical Image Representation Learning
Viaarxiv icon

HCSC: Hierarchical Contrastive Selective Coding

Add code
Feb 01, 2022
Figure 1 for HCSC: Hierarchical Contrastive Selective Coding
Figure 2 for HCSC: Hierarchical Contrastive Selective Coding
Figure 3 for HCSC: Hierarchical Contrastive Selective Coding
Figure 4 for HCSC: Hierarchical Contrastive Selective Coding
Viaarxiv icon

Cross-category Video Highlight Detection via Set-based Learning

Add code
Aug 26, 2021
Figure 1 for Cross-category Video Highlight Detection via Set-based Learning
Figure 2 for Cross-category Video Highlight Detection via Set-based Learning
Figure 3 for Cross-category Video Highlight Detection via Set-based Learning
Figure 4 for Cross-category Video Highlight Detection via Set-based Learning
Viaarxiv icon